Gene Teth514_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_2201 
Symbol 
ID5876179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp2201890 
End bp2203152 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content37% 
IMG OID641542556 
Productextracellular solute-binding protein 
Protein accessionYP_001663809 
Protein GI167040824 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.455559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AACTTATAAG CATCTTTGTA TTGACGATCT TTGTATTAGC TACTGTTTTA 
GCTGGTTGTT CATCCAGTAA AAATAATACT TCCAGTGCCA ATGAGACAAA TACACAAAAA
CAAGAGACAG CAAAACCAGT TACTATAAAA TTAGGCATGT GGTCTTCATC TCCAGCAGAA
AAGAAGATAG TGGATGACCA AATAGCTAAG TTTAAAGAAA AATATCCAAA TATAGATGTG
CAAATTGAGA CAATTGTGGG AGATTACATG CAAAAATTAC AAACAGAACT GGCGTCAAAT
ACAGCACCAG ACATATTCTA TCTTGACAGC ATGCCGGCAC CACAGCTTAT GTCTTCAGGA
GTTTTAGAGC CATTAGATGA TTATATTAAG AAATACAATG TGGATGTAAA TGATTTCGAG
CCAGCATTGC TTTCCGCTTT TCAGTGGGAT GGAAAAACTT ATGGTTTACC AAAGGATTTC
AATACTCTAG CTTTGTTTTA CAACAAAGAC ATGTTTAAAG CGGCTGGAAT AAATGAGCCT
CCAAAAACAT GGGAGGAATT AAGAGATGTA GCTAAAAAAT TGACAAAAGA CGGTGTCAAA
GGTTTGGTTT TATCAGCAGA CCTTGCAAGA TTTGATGCTT TTATAAATCA AAATGGTGGT
TCAGTATATC AAGATGGAAA AGTTACTTTA AATCTGCCAG AGAATGCACA AGCTCTTGAT
TTTTATGTGA GCCTCATCAC AAAAGACAAA GTTGCTGACA CACCACAAAA CATGGGAGAA
GGCTGGAATG GAGATGCTTT TGCTGCTAAA AAAGCTGCAA TGGCAATAGA AGGTGGCTGG
ATGATACCAT TCCTCAAAGA AAAAGCTCCT GATTTAAACT ATGGTATAGC AGAGCTTCCA
GCAGGAAAGC AAAAATCTAC AATGGCTTTC ACTGTTGCAT ATGTGATGAA TAAAAACAGC
AAACATAAAG ATGAAGCCTT TAAACTTATT GAATTTTTAA CCGGTAAAGA AGGACAGCAA
TTTGTAGTAG ATTCAGGCCT TGCACTTCCA TCGAGAAAGT CTATGCAAGA AGGATTTAAG
GAGAAATATC CTGAAAGAGC TGCCTTTGTA GATGGTGCTT CTTATGCGGT ACCATGGCAA
TTCGGTTTGT ATGGCACAAA GGTAGTAGAT GCGGCTAATA AAGCCTGTGA AGCATTAATA
ATGAAGCAAA TAAGTAGTGC TCAGCAAGCT CTTGACAACG CACAAAAGGA AGTTGGACAA
TAA
 
Protein sequence
MSKKLISIFV LTIFVLATVL AGCSSSKNNT SSANETNTQK QETAKPVTIK LGMWSSSPAE 
KKIVDDQIAK FKEKYPNIDV QIETIVGDYM QKLQTELASN TAPDIFYLDS MPAPQLMSSG
VLEPLDDYIK KYNVDVNDFE PALLSAFQWD GKTYGLPKDF NTLALFYNKD MFKAAGINEP
PKTWEELRDV AKKLTKDGVK GLVLSADLAR FDAFINQNGG SVYQDGKVTL NLPENAQALD
FYVSLITKDK VADTPQNMGE GWNGDAFAAK KAAMAIEGGW MIPFLKEKAP DLNYGIAELP
AGKQKSTMAF TVAYVMNKNS KHKDEAFKLI EFLTGKEGQQ FVVDSGLALP SRKSMQEGFK
EKYPERAAFV DGASYAVPWQ FGLYGTKVVD AANKACEALI MKQISSAQQA LDNAQKEVGQ