Gene Haur_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4041 
Symbol 
ID5735903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5158531 
End bp5160855 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content53% 
IMG OID641281192 
Productserine/threonine protein kinase 
Protein accessionYP_001546801 
Protein GI159900554 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.224391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTAA TCAAACGAGG TGGGACATCG CGTTTAACCC CAATCGATGG GCCAGATGCT 
GGCTCGTCAA GCGGGCAGGC CGATGATGCA ACACGTACTT TCTCTCCACA ACCAACCAGC
GATGACTCGA CTCGCGGCTT AGATGCAACC CACACGATTG CTCCAGCCGA TGCCGATTCG
ACTCGTGGCT TAATCGCTGC GGGCAATGAT TCAACTCGCA GCATCGCTCC GCCGGCCAAT
CCTGATGCCA CCCGTGGGAT CAACCAAAAC CCCGATGCCA CCCGTGGAAT CAATCAAAAT
CCTGATGCTA CCCGTAGCGT TAGCAATCGT GTTGGCTCAA TTTTACGCAA TCAAGGGGTT
TCACGCTTAG GCACCGAGGA TATTCGCAAT CGTGCGCCTA ACCCCGATGC CACCCGTTTG
CAGCAACCGG GAGCCAAACC TGCCGAGCCA ACTAATGCCC AATCCAGTTC GCCAGCGCTC
AAAGCTGGCA CGGTACTCGA AGGGCGCTAT GTGGTTGAAG GCGTTTTAGG CATCGGCGGG
ATGAGCGTGG TCTACAAGGG CCGCGACACC CGTTTTAAAG ATGTAACCCG TTTTTGCGCG
ATCAAAGAGA TGTTCCAAAG CTCGCTCGAT TCGCAAACCC GTTTATTGAG CTTGAAGAAC
TTCGAACGCG AAGCTGGGTT ATTGGCGACC CTCAACCATC CAGCTATTCC CAAAGTCTAC
GACTTCTTCG AGGAAGCAGG CCGCGCCTAT TTGGTGATGG AATTAATCGA AGGTCATGAT
CTCGAAACCG TGCTCGAAAA AGCCAACGGC CCTTTGGAAG AGCAACAAGT TGGGCGTTGG
GCAATTCAAC TCTGCGATGT GCTGAATTAT CTGCATGGCC ATGAACCTGA GCCAATTATC
TTCCGCGACT TAAAACCATC GAATATAATT GTGACTCCAA CTGATCGGAT TGTGCTGATT
GACTTCGGGA TTGCGCGGGT CTTTACCCGC ACCGATAAAA AAGGCACGAT GATTGGCACG
GAAGGTTACT CGCCGCCTGA ACAATATCGT GGGGTTGCCG AAGCACGGGG TGATGTCTAC
GCTTTGGGCG CAACCTTGCA CCATTTGCTG ACCAATATTG ACCCGCGGTT GGAAACCCCA
TTTACCTTCG GTGATCGCCC AATTCGTCAA TTGAACCCAA CTGTTTCGCC AGAAGTTGAA
GCGATTGTGA TGAAATCGCT GGAATACGAT ATGGCCAATC GTTGGGGTTC GGCGGCTGAA
TTCCAGTCGG TCTTGTTGAG CGTGCCAGGC TTTGCCCCCG CTGGGGTTGC AGTGGCAGCA
CCAGCAACTC CGGCCTTTGG TGCCGCTATT CGCCGTGGCG GCAAAAATGC CGAGGTGCTT
TGGAAGTTCC GCTGTGAAGA TGAGGTGCGT TCATCGCCAT TTGTGCGCAA TGGTACGCTC
TACATCGGTT GCTACGATAC CAACCTCTAT GCAATTGATA CCAAGCGCGG GGAATTTCGC
TGGAAGAAGC CAACCGAGGG CGGTATTTCA TCAACACCCA CAGTTTGGGA TGATATAGTG
ATTGTTGGCT CGGATGATGG CAATGTCTAT GCCTTTGATA CCCGTGCTGG CACGCAACGC
TGGGTCTTTC GTACCGAAAA ACCGGTGCGC TCATCGCCCC GCGTCCAAGA TCGCTTGGTC
TATTTTGGCT CCGACGATTA CCACTTGTAT GCGGTTGATG CCACCAATGG TCGCCAGATT
TGGCGCTATC GCGGCTGGCA ATGGATTCGT TCCTCGCCCT GTTTGACCAG TAATATGGTG
ATTTTTGGCT CGGGCGATGG CAGCATTGTG GCGCTTGATC TCTTCAAGGG CGGCATTCGT
TGGAAGCAAA AACTCCAAGG TGGGATTGTT TCAAGCGCCA CGGCTAACGA TAAGATGGTG
CTCGTTGGCT GTATGGATAA TAATTTGCAT GCGCTCGACC TTGAAGGTGG TTGGCCGATC
TGGAAATTCC GCACCAGCCA CTACGTCAAT TCGTCGCCAA TTATCATTGG TAATCGAGCT
TTTGTGGGCG GGATTGACGG CAATATTTAT GCAGTCGATC TCAAAAATGG CAAACAAGTT
TGGCAATATA ATACCGGAGC ACAAATCGTT TCATCACCCG TTGCCGATTC AGGCCGAATC
TACATTGGCG CGGCTGATGG CACGGTTTAT TGTCTCGACG CTGGATCGGG TACACCTGTT
TGGACGCATA CCTGCGAAGG CCCAATCGTT TCGACCCCAG CGGTTGTTGA AGGGGTCGTC
TATATCGGGT CGATGGATCA CCAGGTCTAT GCCTTACGTG CTTAG
 
Protein sequence
MALIKRGGTS RLTPIDGPDA GSSSGQADDA TRTFSPQPTS DDSTRGLDAT HTIAPADADS 
TRGLIAAGND STRSIAPPAN PDATRGINQN PDATRGINQN PDATRSVSNR VGSILRNQGV
SRLGTEDIRN RAPNPDATRL QQPGAKPAEP TNAQSSSPAL KAGTVLEGRY VVEGVLGIGG
MSVVYKGRDT RFKDVTRFCA IKEMFQSSLD SQTRLLSLKN FEREAGLLAT LNHPAIPKVY
DFFEEAGRAY LVMELIEGHD LETVLEKANG PLEEQQVGRW AIQLCDVLNY LHGHEPEPII
FRDLKPSNII VTPTDRIVLI DFGIARVFTR TDKKGTMIGT EGYSPPEQYR GVAEARGDVY
ALGATLHHLL TNIDPRLETP FTFGDRPIRQ LNPTVSPEVE AIVMKSLEYD MANRWGSAAE
FQSVLLSVPG FAPAGVAVAA PATPAFGAAI RRGGKNAEVL WKFRCEDEVR SSPFVRNGTL
YIGCYDTNLY AIDTKRGEFR WKKPTEGGIS STPTVWDDIV IVGSDDGNVY AFDTRAGTQR
WVFRTEKPVR SSPRVQDRLV YFGSDDYHLY AVDATNGRQI WRYRGWQWIR SSPCLTSNMV
IFGSGDGSIV ALDLFKGGIR WKQKLQGGIV SSATANDKMV LVGCMDNNLH ALDLEGGWPI
WKFRTSHYVN SSPIIIGNRA FVGGIDGNIY AVDLKNGKQV WQYNTGAQIV SSPVADSGRI
YIGAADGTVY CLDAGSGTPV WTHTCEGPIV STPAVVEGVV YIGSMDHQVY ALRA