Gene Haur_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3068 
Symbol 
ID5734940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3873148 
End bp3875013 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content55% 
IMG OID641280212 
Productserine/threonine protein kinase 
Protein accessionYP_001545834 
Protein GI159899587 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000268642 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTG TTATGAACGC ATTAGCGTGT ACAAATTGTC AGATGTTGCT AGCACCAGGT 
GCACGATTCT GCTCGCATTG TGGAACGGTT GCGCCAACGC CACCAGCTAT CAACCTTCCA
CGCGAAGCAA GCGGCACGCT GCGACGTGGG GTGATTCTCC AAGAGCGTTA TAAAATTATG
ACCTTTATTG GAGCTGGTGG TTTTAGTCGG GTTTATGGTG CAATCGATTT GCGGCTCAAA
ACGCCCTGCG CCATCAAAGA AAACAATGAA TTTGATCAAT CGGCTCATGC TCAGTTTTTG
GCCGAAGCCC AATTGTTGGC GCGTTTGCGC CACCCCAGCA TGACCAAAGT CACCGATTAT
TTCACTGATC CCAGCGGGGC GCAGTTTTTG GTGATGGAAT ATGCGCCTGG TAAAGATCTC
GAAAAGATGA TGGATGAAGC CGAAGAGATG GTTTCATGGC GCAAGGTGGC CGAATGGGGT
CAAATTGTCT GTGATGTACT GACCTATTTG CACACCCAAG AGCCGCCAAT TATCCACCGC
GACATTAAAC CCGCTAATTT GCGTTTGACT CCCCACGGCG ACTTGATGGT CATCGACTTG
GGAATTGCTA AAGAATATCG CGATGGCTCA GCCACAACTC GCGCCGCCCA AGCATATTCG
GGCGGCTACT CACCAATCGA GCAATATTTG GGGCAAGGCA CTGACCCACG TTCAGATTTA
TATGCCTTGG GTGCAACGCT CTATCATTTG TTGGTTGGCA AGATGCCACC CGAAGCACCA
AACCGCTTGC GCGGCATTTC GATGCAAACC GTTGAGCAAG CTCGCCCCGA TATTCCAGTG
TTGTTGGCGC GGGCAATCGA TCGCGCGATG GCGATTGAGC CTGATCATCG CCCACCGAGT
GCTGCGGCTT TGCATATGGT CTTCGAACGA GTGCTTGAGC AAGATCAAGC GGTCAGCATC
AGCCAACCTG CGGTGATTGC CCAAGCGGCA ACCGCCCGAA CCCTGCAAGC TGCGGCGCAT
GCGGTTGCTA CGCCTGTGGC AATCAGTCAA CCACGCATCG GCACGCAACC GCGCTCAACC
ACCAATGCTG GTCAACCCAG CCGCCCATTA CATGTGCCAA GCGAACTGCG TGCTGAACGC
TTGCCGCATG GCATTATTTG GGAAGGTGAT CGGCGTGAAA TGGTGCGGGT TATGGCAGGG
CCAATGCCAA TGGGCAGCGA GGGCAATGAT CCCGATGAAG TTCCAGTGCA TCGGCTTGAC
CTCAAAACGT TTTTGATCGA CCGCTTTCCG GTGACATGTG CCGATTATGC GCGTTTTGTG
CAGGAAACCG GAACAACACC GCCACGGTAT TGGGGTGGCC CATTGCCGCC GCATATGATC
GAAGATCATC CAGTCGTCGA AATAACGCAC GACGAAGCGC GAGCTTACGC ACGTTGGGCT
GGCAAGCGCC TACCAACCGA AGCCGAATGG GAAAAAGCGG CAACATGGGA TGCCACAACT
GGGCACAAGC GGGTTTATCC TTGGGGCGAC CAATGGGATG AGCATCGCGC CAATGCGCGG
GAAGGCGGAG CTGGCGGAAC CGTGCCAATT GGCGCGTATT CACCACAGGG CGATAGCCCA
TGTGGCGCAG CAGAAATGGC TGGCAATATT TGGGAGTGGA CAGATACGGC CTATAAGCGC
TATCCTTATG ACCCAAGTGA TGGCCGCGAT TTCCCCAAGA ATGCTGGCTT GCGCGTCACC
CGTGGTGGGT CGTGGTCGTG TTCGCCGGAT GCATTACGCG GAGCGAATCG CAACATAGCT
GCTGCCAACG ATGCCGATTT TGAGGTGGGC TTCCGTTGTG CCGCTGATGT ACGCGAGGAT
TGGTAA
 
Protein sequence
MSTVMNALAC TNCQMLLAPG ARFCSHCGTV APTPPAINLP REASGTLRRG VILQERYKIM 
TFIGAGGFSR VYGAIDLRLK TPCAIKENNE FDQSAHAQFL AEAQLLARLR HPSMTKVTDY
FTDPSGAQFL VMEYAPGKDL EKMMDEAEEM VSWRKVAEWG QIVCDVLTYL HTQEPPIIHR
DIKPANLRLT PHGDLMVIDL GIAKEYRDGS ATTRAAQAYS GGYSPIEQYL GQGTDPRSDL
YALGATLYHL LVGKMPPEAP NRLRGISMQT VEQARPDIPV LLARAIDRAM AIEPDHRPPS
AAALHMVFER VLEQDQAVSI SQPAVIAQAA TARTLQAAAH AVATPVAISQ PRIGTQPRST
TNAGQPSRPL HVPSELRAER LPHGIIWEGD RREMVRVMAG PMPMGSEGND PDEVPVHRLD
LKTFLIDRFP VTCADYARFV QETGTTPPRY WGGPLPPHMI EDHPVVEITH DEARAYARWA
GKRLPTEAEW EKAATWDATT GHKRVYPWGD QWDEHRANAR EGGAGGTVPI GAYSPQGDSP
CGAAEMAGNI WEWTDTAYKR YPYDPSDGRD FPKNAGLRVT RGGSWSCSPD ALRGANRNIA
AANDADFEVG FRCAADVRED W