Gene Teth514_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1099 
Symbol 
ID5876681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1140141 
End bp1141514 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content38% 
IMG OID641541453 
Productextracellular solute-binding protein 
Protein accessionYP_001662733 
Protein GI167039748 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTAAAA CAAAAAAGTA TCTTTCGTTG ATGGTAGTTA TTGTTTTTGC ACTCACGATA 
ATGTTGGCAG GGTGTGGAGG ACAAAAAACT TCCCAGTCAT CTGAAGGAGC TCCAGCGCCT
CAAACAGAAA CTTCACAAAA GAAAGTAGAA GTTGTTTTCT GGCATAATAT GAAGGTAGTG
ACTGATAGAC AGTCTATTGA AGAAGCAGTT CAAATGTTTA ACAAAGAACA TCCGAACATT
GAAGTAAAAG CGGTATTAGT TCCAGGTGAC GAAACAGATG CAACGAAATT AATGACAGCT
GTTGCAGCAG GGGAAGGACC TGACGTTTAT TATCTTGATA GATTTACAGT AGCACAAAGA
GCTCATGCAG GCATGTTAGA ACCCTTAGAA GATTATCTAA CACAATTGGG TACAAATATT
GATGACTTAA AGAGTAAATT TTTCGATTTT GCAATTGAAG AAGCAACTTA TAAAGGAAAA
CTTTATGCAC TTCCGTGGGA TACTGATGCA CGTGTTTTAT ATTATAATAA AAAATTATTT
AAAGAAGCAG GATTAGATCC GGAAAGACCA CCACAAACTA TATCAGAACT CGATGAATAT
GCTAACAAAC TCACGAAAGT GCAAGGGGGA AAAATTTTGC AAATAGGTTT TATTCCTTGG
CGTGGTCAAG GATGGCCTTA CACTTGGGGT TGGGTATTTG GAGGAAAATT TTATGATCAT
GAAACCAAAA AATTTACTTT TGCAGATGAT CCTAAGATTG TTGCTTCACT AGAATGGCAG
AAAACCTATG CTGATAAATA TGGAATAAAA AATATTGATT CTTTCTTTGC TGCTTTTGGG
GATGGTGGAG GAGCAGAGCC TGTTGATCCA TTTATGATGG GAAAAGAAGC TATGAGGATA
GATGGTAACT GGTTTTTGAG CACGCTGAGA AAATTTGCAG ATCCTAAAGT ATGTGAATGG
GGAATAGCTC CAATACCATA TCCTGGAGGT AGGGAAAAAG ATTCAACTTG GGCAGGAGGT
TGGAGTTTAG TTATTCCAAA AGGTGCAAAA CATCCAAAAG AAGCAGCTGA ATTTATTCAA
TGGATGGCTA CAAAGGGTGC TATAAAATAT GCCAAAGATA CTGCTCATTT TTCTGCAATT
AAAGAAGGTA CATTGGAGGT TGTAAAAGAA GATCCAGATC AAAAATTGTT TTATGAACTT
TTAAATGGCC CCAATGCTCA CAGCCGCCCT GTTGTACCAG TAGGAGCACT TGTTTGGGAT
GAGTTAGTAA GGGCTAGAGA TGACGCACTT TATGGTAAAA AAGTACCTCA ACAGGCTTTG
AAAGAAGCAC AAGAGAAAGT TCAAAAAGAA CTTGATAAAG CTCTGAGTGA ATAA
 
Protein sequence
MGKTKKYLSL MVVIVFALTI MLAGCGGQKT SQSSEGAPAP QTETSQKKVE VVFWHNMKVV 
TDRQSIEEAV QMFNKEHPNI EVKAVLVPGD ETDATKLMTA VAAGEGPDVY YLDRFTVAQR
AHAGMLEPLE DYLTQLGTNI DDLKSKFFDF AIEEATYKGK LYALPWDTDA RVLYYNKKLF
KEAGLDPERP PQTISELDEY ANKLTKVQGG KILQIGFIPW RGQGWPYTWG WVFGGKFYDH
ETKKFTFADD PKIVASLEWQ KTYADKYGIK NIDSFFAAFG DGGGAEPVDP FMMGKEAMRI
DGNWFLSTLR KFADPKVCEW GIAPIPYPGG REKDSTWAGG WSLVIPKGAK HPKEAAEFIQ
WMATKGAIKY AKDTAHFSAI KEGTLEVVKE DPDQKLFYEL LNGPNAHSRP VVPVGALVWD
ELVRARDDAL YGKKVPQQAL KEAQEKVQKE LDKALSE