Gene Cthe_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1862 
Symbol 
ID4809413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2207255 
End bp2208367 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content41% 
IMG OID640107281 
ProductABC transporter related protein 
Protein accessionYP_001038276 
Protein GI125974366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCG TTAAACTTAA AGGTGTGTAC AAAAGATATC CAGGTGGGGT TACTGCGGTA 
AACGACTTTA ATTTGGATAT TGAAGACAAG GAATTTATTA TATTGGTAGG ACCGTCTGGA
TGTGGAAAAA CTACAACATT GAGAATGGTT GCCGGATTGG AAGAAATTAC GGAAGGTGAG
CTTTATATAG GTGACAAACT GGTCAACGAC GTGGCACCTA AAGATAGAGA TATAGCGATG
GTTTTCCAGA ACTACGCTTT GTATCCGCAT ATGTCTGTGT TTGACAACAT GGCATTTGGA
TTGAAGCTTA GAAAAGTTCC CAAAGATGAG ATTAAGAGGA GAGTTTTGGA GGCTGCAAAG
ATTCTTGACA TAGAACACTT GCTGGAAAGA AAGCCGAAGG CATTGTCCGG AGGTCAGAGA
CAGAGGGTTG CGCTTGGACG TGCCATAGTT CGTAATCCTA AGGTATTCTT GATGGATGAG
CCTCTGTCAA ACCTTGACGC AAAACTCAGA GTTCAGATGA GAACCGAAAT CAGCAAGCTG
CACCAGAGAC TTCAGACAAC ATTCATCTAC GTTACTCACG ACCAGACAGA AGCTTTGACG
ATGGGTACAA GAATTGTTGT TATGAAAGAC GGATACATTC AACAGGTTGA TACTCCTACA
AATCTTTATG AGAGACCTTG CAACATGTTC GTAGCAGGAT TTATCGGAAG CCCGCAGATG
AACTTTGTAA ATGCAAGAAT TGAAAAACGC GGGGATGAAA TGCACCTTCT GTTTGGAAAA
CAGGATATTA AACTTCCGGA AGGAAAGGCA AAGAAGCTTG AGTCCAGCGA ATATGTGGGC
AGAGAAGTGG TAATGGGTAT ACGTCCTGAA AACATTCGTG ATGAAGAGAT TTATCTTGAA
TCAATGTCTG AGAATGTTGT AGAGGGAAGA GTTGAAGTTG TTGAAATGCT CGGTTCCGAA
ACATTGATTT ACATGGTAAT AGATGACTTT GAGTTTACTG CAAGAGTTAA TCCGAGATCA
AAGGCTAGAC CGGGCGATGT GATTAAGGTT GCTTTTGATG CCAACAAGAT TCATCTCTTT
GACAAGGAAA CTGAAAAAAC AATAATGAAC TAA
 
Protein sequence
MASVKLKGVY KRYPGGVTAV NDFNLDIEDK EFIILVGPSG CGKTTTLRMV AGLEEITEGE 
LYIGDKLVND VAPKDRDIAM VFQNYALYPH MSVFDNMAFG LKLRKVPKDE IKRRVLEAAK
ILDIEHLLER KPKALSGGQR QRVALGRAIV RNPKVFLMDE PLSNLDAKLR VQMRTEISKL
HQRLQTTFIY VTHDQTEALT MGTRIVVMKD GYIQQVDTPT NLYERPCNMF VAGFIGSPQM
NFVNARIEKR GDEMHLLFGK QDIKLPEGKA KKLESSEYVG REVVMGIRPE NIRDEEIYLE
SMSENVVEGR VEVVEMLGSE TLIYMVIDDF EFTARVNPRS KARPGDVIKV AFDANKIHLF
DKETEKTIMN