Gene Cthe_2967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2967 
Symbol 
ID4810855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3485676 
End bp3486854 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content43% 
IMG OID640108389 
Productmajor facilitator transporter 
Protein accessionYP_001039357 
Protein GI125975447 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCGT TGTTGTTGGC GCTTATTTAT CTCGCTTTTA TCAGTTTGGG ACTGCCCGAT 
GCACTTCTTG GCTCGGCCTG GCCTACCATG TACCCCGTGC TGGAAGTACC TGTTTCTTTT
GCAGGCATAA TATCTATGAT TATTGCAGGG GGTACCATTG TCTCGAGTTT AAATACAAAC
CGGGTGGTTC GCAAATTTGG CACCGGACTT GTGACTGCTG TCAGTGTATT GATGACGGCG
GTGGCTCTGT TTGGGTTTTC CGTTTCGAAA ACTTTTTGGA TGCTTTGCCT TTGGTCTATT
CCATACGGTC TTGGAGCAGG TGCGGTGGAT TCGGCCCTGA ACAATTTTGT GGCACTTCAT
TATGCTTCCA GGCACATGAG TTGGCTGCAC TGTTTTTGGG GTATAGGTGC GTCTGTTGGA
CCTTATATCA TGAGTTATTG TCTGACAGTT AAGAACAGTT GGGAAAGTGG TTATATGACA
GTAGGTGCTT TTCAAATCGT ATTAACAGTT ATTCTCTTTT TCAGTCTGCC TGTTTGGAAC
AAGCAGGCAA AGATCAAGAA GGAGAGTCAA ACAGAGGAAC CAAAGCATTT GAAAATTCAT
GAGGCATTAA AAATTAAAGG GGTTAAGCAG GTACTAATAG CGTTTTTCTC CTATTGTGCT
CTCGAGACCA CTGCAGGCTT GTGGGCCAGC AGTTATCTCG TGCTGCATCA GGGAATTGAG
GCAAAAGTGG CTGCAAGATG GGCTTCTTTG TTTTATTTAG GTATTACTTT CGGGCGCTTT
CTGAACGGTT TTGTTACTGA CAAATTAGGG AACCGCAATA TGATTCGCAT AGGACTAGGT
ATTATAACCA TAGGATTGGC AGCGGTGATT TTGCCGGTGC AAATTGAACT TGTAACGTTG
GCAGGTTTGG TTTTAATCGG CATAGGATGC GCTCCTATCT ATCCTTGCAT CATACATGAG
ACACCAAAGA ATTTTGGAGC GGAGAATTCT CAGGCTATTA TCGGAATTCA GATGGCAAGT
GCTTATACCG GTTCAACATT CATGCCGCCT ATATTTGGTG TGCTGGCAAA ATTTACAACG
ATTTCTTTAT ATCCGGTTTA TTTGACATTC TTCCTGATTT TGATGATAGT AATGACGGAA
AGGCTTAATC GTCTTGTAGT AAGTAAAGAG AGCAGATAA
 
Protein sequence
MYSLLLALIY LAFISLGLPD ALLGSAWPTM YPVLEVPVSF AGIISMIIAG GTIVSSLNTN 
RVVRKFGTGL VTAVSVLMTA VALFGFSVSK TFWMLCLWSI PYGLGAGAVD SALNNFVALH
YASRHMSWLH CFWGIGASVG PYIMSYCLTV KNSWESGYMT VGAFQIVLTV ILFFSLPVWN
KQAKIKKESQ TEEPKHLKIH EALKIKGVKQ VLIAFFSYCA LETTAGLWAS SYLVLHQGIE
AKVAARWASL FYLGITFGRF LNGFVTDKLG NRNMIRIGLG IITIGLAAVI LPVQIELVTL
AGLVLIGIGC APIYPCIIHE TPKNFGAENS QAIIGIQMAS AYTGSTFMPP IFGVLAKFTT
ISLYPVYLTF FLILMIVMTE RLNRLVVSKE SR