Gene Cthe_2128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2128 
Symbol 
ID4811175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2526898 
End bp2528280 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content37% 
IMG OID640107534 
Productextracellular solute-binding protein 
Protein accessionYP_001038527 
Protein GI125974617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTATAACAAT GGTGTTAAGC TTAATTATTT TAAGCATTTT ATTTCTGGCA 
ACGTCTTGTT CTGGCAGCCG GGATGACACG CAAGATGTTA TAGGTGGTAA AATAGTGATG
TATGCGGCAC CCGGCGACAA TGTTCAGTCA GAAATAAGAA ATATAGTAAG AAGCAAGTAT
CCAAATGTGG AGTTTCAAGT GGTTTCGTTC AATAATGCTG ACGAATTCAA AAGCAGACTT
TTAACTGAAT TGATGGCGGG AGAAGGTCCG GATGTTATTG TTTTAAGCCC ATCCACCAAA
AAGGGTTCAA TTACAATAGA AACTATGAGA AAGTTGGTAG AATCAGGAGT TTTCTGTGAT
CTGGAGCCAT ATATATCGAA GGATGAGAGT ATAAATTTGT CAGAGTATAA TGAGACTGTT
TTAAACAGCG GTGTTATAAA CGGCAAAAGA TACTTTATTC CCATAGCCTA TGATGTACCT
ATTTTTTGGA CGGCTAACTC CATTCTTGAG GAAAACAATA TAAAGGATGA AATAGCAAAC
TGGACGTTGA AGGACATGGC TGATTTTGCA GTTCAGTTTA AAGAAAAGAA TTCTGATAAT
TACCTCTTTG GCTATGGTGA CGGATTTATC AGAAATATTA TGTATGCGAA CTGGAGAGAA
TTTGTTGATT ACGAGAATAA GCAGGCAAGC TTTGACAGTC AAGAGTTTGT TGAATTTTTG
GAGGCAATTG GAGCTATTGA AAAAGCAGGC ATTTGTGATG AAAAACTTAT TAAAGAATAT
ACGGGGATGG AGTTTGAAGC TCTAAAGCAT GGGAAAATTA CTTTGATAAG CAGTACTGAG
TATCCCATAA ATCCTTGGGA ATTATGGTAT CGCAATTCGC ACATAAATTA CTATTTTAAT
CCGGATAGCA TAAGGCTTTC AAAATTTCCT ACATTTGGGG ACTTGGGCAG AATAGTGGCG
CATCCTACAG ATATAGTAGC GATAAACAAA AACAGCAAAA ATAAAGCAAC TGCATATGAG
GTGCTGAAAG TTTTTTTGTC AAAAGAAATT CAAAGTTCCC AACAATTTCG CGATAGAATG
GGAATACCGG TTAATGATGA GGCGATAAGA GAACTCATAG AGAAATATTC AGGAGAAGAA
GGAAAGACCA CCCTTCCTGT GGGAATGACC ATTAACGAAA CTATGGATAC CGTACCGTTA
CCGGAATCTG TAGTGGCGGA ATACAATTCA ATAATAAACG GAGTAACTGA ATGTGTACTG
GTTGACGAGC AAATAATTGA TTTTATGATT GAAGGATTCA ATGAATACAA AAACGGCAAA
ATGTCTGCTA AAGACGCAGC TCGGATGGTA CAGCAAAAAG TAAATTTGTT TTTAAATGAG
TAA
 
Protein sequence
MKKFITMVLS LIILSILFLA TSCSGSRDDT QDVIGGKIVM YAAPGDNVQS EIRNIVRSKY 
PNVEFQVVSF NNADEFKSRL LTELMAGEGP DVIVLSPSTK KGSITIETMR KLVESGVFCD
LEPYISKDES INLSEYNETV LNSGVINGKR YFIPIAYDVP IFWTANSILE ENNIKDEIAN
WTLKDMADFA VQFKEKNSDN YLFGYGDGFI RNIMYANWRE FVDYENKQAS FDSQEFVEFL
EAIGAIEKAG ICDEKLIKEY TGMEFEALKH GKITLISSTE YPINPWELWY RNSHINYYFN
PDSIRLSKFP TFGDLGRIVA HPTDIVAINK NSKNKATAYE VLKVFLSKEI QSSQQFRDRM
GIPVNDEAIR ELIEKYSGEE GKTTLPVGMT INETMDTVPL PESVVAEYNS IINGVTECVL
VDEQIIDFMI EGFNEYKNGK MSAKDAARMV QQKVNLFLNE