Gene Teth514_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0159 
Symbol 
ID5875649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp159361 
End bp160563 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content33% 
IMG OID641540502 
ProductROK family protein 
Protein accessionYP_001661814 
Protein GI167038829 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACAG CGGATCAATT ATTAGTTAAG CAAATAAATA AATCAATAGT GCTCAATACC 
ATAAGAAAAA AAGGGAATAT TTCAAGAGCG GATATTGCAC ATATTACAGG TTTAAACAAG
TCAACTGTCT CTTTCCTTGT TGATGAACTT ATAAACGAAG GGTTTGTAAG GGAAGAAGGG
CCTGGTGCAT CAAAAGGTGG TAGAAAACCA ATCATATTAA GCATCAATAA CAATGCTGGC
TGTATAATAG GTATTGATTT GGATGTAAAC TATATATTAA TTGTTTTGAC TGATTTAATG
GCAAATGTCA TTTGGGAAAA GAAAGTCGAT ATAAAAATTG GTGAAAGTCA ACAGGCTATT
ATTGAGCGTT TAATAGAATT GATTGACGAA GCAATTTTAA ATGCACCTAA CACTATTAGG
GGAATTTTAG GTATTGGGAT TGGTGTGCCA GGCATTGTGG ATTACAAAAA AGGCAGTATA
TTGATGGCGC CGAATTTAAA ATGGCAGGAT GTGCCTCTTA AAGAAATCAT AGAAAATAAA
TTCAAAATTA AAGTACACAT AGATAACGAA GCAAATGTAG GAGCAATAGG TGAGAAGTGG
TTTGGCACTG GAATTAAATA CAACAATTTC GTATATGTAA GTGCTGGAAT TGGTATTGGT
ACAGGAATAA TAATCAATGG AGAACTGTAT AGAGGGACAG TAGGGCTGGC AGGAGAAATG
GGACATATGA CAATAAATAT ACATGACCAT CAATGTAGTT GTGGAAATAC AGGTTGTTGG
GAAAATTATG CATCAGAAAA AGCACTATTC GATTATATAC ATACACAGTT GATAATGGGA
AAATCTGATA ATTATATAAA TAAAGATAAT TTTAATACAC TTAGTGCTCT TGATATTATA
AATTATGCTC AAAAAGGAAG CGAAATAGCA GTAGAGGCTT TAAAAGAAAT TGGAAGGAAA
TTGGGTGTGG GAATTGTCAA TGTAATAAAT ACATTTAACC CGGAACTTGT GATTATTGGA
AATACGTTGT CTTTAGCAGG AGATTTGATT TTGGATGAGG TTTTGAAAGA AGTAGAAAAA
AAGTGTCTTG TGTATAGATA TTATAAAGTA AAAATCAAAA CCTCTAAACT TCAATTTCAT
GCAGGAGCAA TTGGAGCAGT ATCGCTTGTT ATTTCAGAGT TGTTTGCATA TCCTGGACTT
TAA
 
Protein sequence
MITADQLLVK QINKSIVLNT IRKKGNISRA DIAHITGLNK STVSFLVDEL INEGFVREEG 
PGASKGGRKP IILSINNNAG CIIGIDLDVN YILIVLTDLM ANVIWEKKVD IKIGESQQAI
IERLIELIDE AILNAPNTIR GILGIGIGVP GIVDYKKGSI LMAPNLKWQD VPLKEIIENK
FKIKVHIDNE ANVGAIGEKW FGTGIKYNNF VYVSAGIGIG TGIIINGELY RGTVGLAGEM
GHMTINIHDH QCSCGNTGCW ENYASEKALF DYIHTQLIMG KSDNYINKDN FNTLSALDII
NYAQKGSEIA VEALKEIGRK LGVGIVNVIN TFNPELVIIG NTLSLAGDLI LDEVLKEVEK
KCLVYRYYKV KIKTSKLQFH AGAIGAVSLV ISELFAYPGL