Gene Haur_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0023 
Symbol 
ID5736857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp28950 
End bp31190 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content54% 
IMG OID641277144 
Productserine/threonine protein kinase 
Protein accessionYP_001542803 
Protein GI159896556 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATG ATCTGATTGG GAAACAACTC GCCAACTTCC GCATCGACCG CGTTCTAGGC 
CGTGGTGGCA TGGGCTTGAT TTACGTTGGC CACGATATCA AGCTTGATCG GCCTGTGGCA
ATCAAGGTGA TCGATGGTCG TTATCGTCAA GTGCCAAGCT ATGCCGAGCG TTTTGTGCGC
GAAGCCCGCG CTATTGCCAC GTGGCGGCAC GAAAATATTG TGCAGATCTA CTACGCCGAC
GATACTGACA ACCTCTATTA TTTTGTCATG GAATATATCG ATGGCGTTGA TCTCTCGAAA
GTTTTAGCCC AATATACCAG CACCAACAGC CTCTTGCCTC ATAGCGCCGT GATGCACATT
GCCCGTAGTT TGGCCAGTGC GCTCGATTAT GCCCATAAAA AGGGCGTAAT TCACCGCGAT
ATCAAGCCAT CGAATGTGAT GGTGGCCCAT GATCAGCGGG TCGTGTTGAT GGATTTTGGC
TTGGCGCTCG ATACTCAGCT TGGCTCGCAG GGCGAAGTTT TTGGCACGGC CCACTATATC
GCCCCGGAGC AAGCACGGCG TTCGGCTGAT GCCATCCCCC AATCAGATTT ATATGCCTTG
GGCGTGATGT TGTATGAATT GTTGACTGGC TCGGTGCCAT TCGATGATCC TTCGGCAACC
AGCGTAGCCT TGCAACATCT TACTCAGCCG CCGCCACCGC CCCAAGAGAA AAACCCCAAG
CTGAATAATG CTACAGCAGC AGTGTTGCTC CAAGCTTTAG CTAAAGCCCC AGCCGAGCGC
TTTCAAACGG GCGCAGAGCT GATCGCAGCC CTAGAACAGG CTTTGGGGAT CACAGGCAGT
CATCAACTTG CGGCCAATGC CAATGCTGCG CCTGGAGTGC GACCACCTTC GCAACCAGGA
GCCTTGGGCT TTGCCACTTT CGACCCCGAA AAACCTTTGG AAGGCCAATA TCTCGATGGC
TATCGGGTTG AAGCCTTGCT GGGTCGTGGT GGCATGGCCA ATATCTTTCG AGGGGTTGAT
GTGCGGCTCA ATCGCACCGT CGCGATCAAA GTGATCGACA CGCCCTTCCG CAACGATCAA
AGCTATGCCG AGCGTTTCAA TCGTGAAGCC CAAGCGATTG CCCAACTTGA GCATCCCAAC
ATTGTGCGGC TCTATCACTA TGGCGATTCA TATGGCCTGT TGTATATGGT GATGGAGTAT
ATCGAGGGCG ATAATCTGCA CAAGGTGATC GAGCAAGTGC GGACTAGCGC CCAAGAATGG
ACTCCACGCA CACTCTGCCG CATGATTCGT GAAGTTTGTG CTGCTTTGGA TTATGCCCAT
AGCAAAGGCG TTATCCATCG TGATGTCAAA CCATCGAATA TTATGATCAA CAAAGATGGA
CGGGCGATTT TGGCCGACTT TGGCTTGGCT TTGCTGACCG AAGTTGGCAC ACGCGGTGAG
ATTTTCGGTT CACCACACTA TGTCGCGCCT GAACAAGCGA TTTCTTCGGC CAAAGTTGTG
GCTCAAACTG ACCTCTACAG CTTGGGCGTA ATTTTGTATG AGCTTTGGAC GGGCCAAGTA
CCATTTGACG ACGCTGATCC ATTGGCGATT GCGATGTTGC ATATGGCCGA GCCACCCAAA
GCCCCGCGCT CGATCAATCC GGCGATTTCG GCCCAACTTG AGGCGGTGAT TCTCAAAATG
TTGGCCAAAG ATCCGGCTGA GCGCTTCCCA ACTGGTTTGG ATTTAGCCGA AGCGCTCGAA
GCAGCCTTGG GCATTTCATC GACTGGCACT GGCGAATATG TGCGCAGCCA AATCACGCCG
GTGCTGGATC AAGCGGTCAA TGCCCCCAGC ACCCCGGCAA TTTCAAACCC AACGCCGCCA
GTGGTCGCCA GCGAGCCAAA AACTCCAACC ACTCCGACTG CGCCTAAAAC GGCCAGCCAA
CCGATTCCCA ACGTCATTCC TGCAGAGTCG GTGCCAACGC CGCCAACCCC CGCGACCAAA
TATGAAAAAG CGTCTTCTGA GCCAACTAAA CGGCGGGTAC AGCCATTGCC GCCAACCCCA
GCCGCCGTGG CCAAAGCCCC AGAGCCAACC AGTGCGCCAA CCATGCCCAT GCCACCCGCC
GATAACACCA TTCGCGAAGC ACCAGCGCCG CGTTTGTATA CCAATCGTCA ACAACGCAGT
AATCTGCTGA TTATGGCGAT TGGGGTGATT GTGCTGATTA CGTTCATTGT GATTATGTGG
CAGTTGATGA GTATGATCTA G
 
Protein sequence
MTDDLIGKQL ANFRIDRVLG RGGMGLIYVG HDIKLDRPVA IKVIDGRYRQ VPSYAERFVR 
EARAIATWRH ENIVQIYYAD DTDNLYYFVM EYIDGVDLSK VLAQYTSTNS LLPHSAVMHI
ARSLASALDY AHKKGVIHRD IKPSNVMVAH DQRVVLMDFG LALDTQLGSQ GEVFGTAHYI
APEQARRSAD AIPQSDLYAL GVMLYELLTG SVPFDDPSAT SVALQHLTQP PPPPQEKNPK
LNNATAAVLL QALAKAPAER FQTGAELIAA LEQALGITGS HQLAANANAA PGVRPPSQPG
ALGFATFDPE KPLEGQYLDG YRVEALLGRG GMANIFRGVD VRLNRTVAIK VIDTPFRNDQ
SYAERFNREA QAIAQLEHPN IVRLYHYGDS YGLLYMVMEY IEGDNLHKVI EQVRTSAQEW
TPRTLCRMIR EVCAALDYAH SKGVIHRDVK PSNIMINKDG RAILADFGLA LLTEVGTRGE
IFGSPHYVAP EQAISSAKVV AQTDLYSLGV ILYELWTGQV PFDDADPLAI AMLHMAEPPK
APRSINPAIS AQLEAVILKM LAKDPAERFP TGLDLAEALE AALGISSTGT GEYVRSQITP
VLDQAVNAPS TPAISNPTPP VVASEPKTPT TPTAPKTASQ PIPNVIPAES VPTPPTPATK
YEKASSEPTK RRVQPLPPTP AAVAKAPEPT SAPTMPMPPA DNTIREAPAP RLYTNRQQRS
NLLIMAIGVI VLITFIVIMW QLMSMI