Gene Cthe_2382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2382 
Symbol 
ID4811034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2847193 
End bp2848593 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content40% 
IMG OID640107795 
Productmajor facilitator transporter 
Protein accessionYP_001038777 
Protein GI125974867 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.321963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAG GTAAAATGGA TTTTAGGACT GTTTATACTG TTTTTATTTT TATTTTTCTT 
GCTTCCCTCG ACTACGCCGT CGTCGGTCTT TTTCCGCCTT TGTTTTCGTC CATTGCCAAA
AGCTTAAATG TGCATATATC GGCCATGGGA AGTGTTTCAG CCGTAACAAT ACTCTTTACT
GCTTTGTCAA GCATCGTTTG GGGTTACCTT GCAGACAAAG GTAACAGAAA AAGACTTATA
ATTATGGGTA CTTTAATCTG GTCATTATTC CTTTTCCTTA CCTCTTTGAG CCAAAGTTAC
CTTCATCTGA TTATCTTTCA GATTTTTACC GGACTGGGGC TTGGGTGCAA TAGCTCTATT
GGCTTCAGTG TACTGACGGA TTATGTGCCC AAGAAATATC TTGGAACTGT TATGAGTTTG
TGGGGGCTTT CGCAAGGCTT TGGCTGCATT GCAGGATCAA TCATGGCCCC CATTGTCTCA
TCCAGGCTTG GATGGCGCAT GCCTTTTATA ATTATATCTT CCTTAGGTGC AGTTTTCATA
TTTATGTATT TTTTTATAAA GGAACCGGTA AAAGGCGCTG CAGAGCCTGA ACTTGGAGGG
ATTGCACTTA AAAGTTACAA CTACAATATA AGCCTTTCAA GTGTTAAAAG CATTTTAACA
AAGAAAAGTA ATTTTTGGCT TATGATTCAA GGTTTTTTTC TTAATATAAC TCTCGGCACA
CTTTTATGGC TTCCCACGTT GTATGCAGCC AAAATAGAAG CCCAGGGTTA CAGTGAAGAA
ACTTCCCTGA TTGCGGCAAG CTATTTTTAT GCCCTCTTTC AGCTGGGCGG ACTTTCATCG
ACTTACTTTG GGTATCTGGG CGACAGGCTT CAGAAAAAAA CTTTGAGAGC CAGAGCACTT
CTTACGGGTT CTCTCATCTT TTTTATGATG CCGTTTTACA TTTTGGTTTT TATAATTCCA
TTAAACAACC TTACTCTGCC CGATGGCGGC AGCGCATTTT CCATTTTGCT TTCTTTGCTG
GGACAAATTT GTTTAAATCC ATGGATACTG AGTGTTTTTC TTCTTTCAAT ATTTGCATCC
GCAGTCCAAT CAGCCAATAT TCCCAACTGG CTTGCTCTTA TTACTGATGT CAATCTTCCC
GAGCACAGGG CTACAAGTTT CAGTATATCC AATTTTATAA ATGGAATCGC CCGTTCGTGC
GGAAATGCTT TCATGGGTAT AGCGTTGGGT ATAGTGTCTT CCTTCTTCGG CGAACCTGAC
AATTACATAG TTGCGATGGC AACATTTCAG CTTTTTGTTA TTCCTTCGGT ATTTGCTTAC
TACAAAGTTT CAGAGAACAG TAAAAGGGAT ATAACAAAAA TGAAGTCAAT TCTGCGAAAG
AGAGCAAAAA ATATGCCGTA A
 
Protein sequence
MAKGKMDFRT VYTVFIFIFL ASLDYAVVGL FPPLFSSIAK SLNVHISAMG SVSAVTILFT 
ALSSIVWGYL ADKGNRKRLI IMGTLIWSLF LFLTSLSQSY LHLIIFQIFT GLGLGCNSSI
GFSVLTDYVP KKYLGTVMSL WGLSQGFGCI AGSIMAPIVS SRLGWRMPFI IISSLGAVFI
FMYFFIKEPV KGAAEPELGG IALKSYNYNI SLSSVKSILT KKSNFWLMIQ GFFLNITLGT
LLWLPTLYAA KIEAQGYSEE TSLIAASYFY ALFQLGGLSS TYFGYLGDRL QKKTLRARAL
LTGSLIFFMM PFYILVFIIP LNNLTLPDGG SAFSILLSLL GQICLNPWIL SVFLLSIFAS
AVQSANIPNW LALITDVNLP EHRATSFSIS NFINGIARSC GNAFMGIALG IVSSFFGEPD
NYIVAMATFQ LFVIPSVFAY YKVSENSKRD ITKMKSILRK RAKNMP