Gene Cthe_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1202 
Symbol 
ID4809894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1432762 
End bp1433937 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content40% 
IMG OID640106625 
Productmajor facilitator transporter 
Protein accessionYP_001037627 
Protein GI125973717 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter
[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.207324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTGG ATAATCTTAA ACGTTCAGAA AGATACCCGT TCTCTTTTAT TCTGTTTTAT 
TCCTTGTTTT ATATGGGCCT TGCGGTATTT GGCGTGTTTA TGCCTGTGTA TTTGGAAGGG
CTGGGCTATG ACAATACGGA TATAGGAACA TTTCTTTCAA TCAGTTCGTT TGTCGGCCTG
TTTGCACAGC CCATTTGGGG TGTCATAAGT GACCGGGCAA AATCCAAAAA CAATGTGCTG
AAAATGTTGG TGCTTTTCAG CAGCATTGCC ATTTTTATGT TTCATCTCTC GGGCAACTAT
TACTATATAT TTGCGGTAAT GGTTGTTTAT GCCTTTTTCC AAACGCCCAT TACTCCGATA
GGTGATGCGA TTACATTGGA GTATATTACT GACACAAAAT GGAAGTATGG CCCGATAAGG
CTTGCCGGTG CATTGGGATA TGCGGTGATG GCATTTATCG GAGGGGCATT GACAAGAAAA
AATATCAACG CTATTTTCTT TATATGCTTT GTCATAGGTA TTATGTCTTT GATTACAGTA
TTTAGAATGC CAACGGTAAA AGGACATCAA TCGGACGGAA ACAAGCTTTC CATTTTAGAA
GTTTTCAAAA ACAGCGAACT TGTGCTGCTT ATGGGATTTA CACTTGTTAT TCATACTACC
ATGGGTTTTT ATAATACTTT CTTTCCGATT TACTATAAAA ACATGGGTGC TGACAACACC
ATTCTGGGAT TGGCGGTGTT TATCGGCTCG GCGAGTGAAA TAATCTTCCT TGTTTTCGGC
GACAGGATAA TAAAACGTTT GGGAATCAAG TTTACGCTGT TCGGTGCAGC GGTTGTTGCA
GTTGTACGGT GGGCAAGTTT GGGATTGATT AACAATATTT TTGCAGTGCT TGCACTCCAA
ATTCTCCATG GTTTTATATT CATTGTTTTG GCCTACTCCA TGGCAACATA TATCAATAAT
GAGATGCCAC CTGAATTGAA GGCCTCAGGA CAGACGGTAA ACTCCGTCAT AGGTTTGGGT
ATTTCCAGGA TAATTGGAAG TACAGGCGGC GGTGTGATAA GTGATTTAAT CGGAATCAGG
CAGGTATTCT TTTTAAATTC GGTTATTGTT CTTGCTTCAA TTGTCATTTT TGGCGCAATA
TTTTTGGTAA GAAGACAAAA AATTACAGGA CAATAG
 
Protein sequence
MILDNLKRSE RYPFSFILFY SLFYMGLAVF GVFMPVYLEG LGYDNTDIGT FLSISSFVGL 
FAQPIWGVIS DRAKSKNNVL KMLVLFSSIA IFMFHLSGNY YYIFAVMVVY AFFQTPITPI
GDAITLEYIT DTKWKYGPIR LAGALGYAVM AFIGGALTRK NINAIFFICF VIGIMSLITV
FRMPTVKGHQ SDGNKLSILE VFKNSELVLL MGFTLVIHTT MGFYNTFFPI YYKNMGADNT
ILGLAVFIGS ASEIIFLVFG DRIIKRLGIK FTLFGAAVVA VVRWASLGLI NNIFAVLALQ
ILHGFIFIVL AYSMATYINN EMPPELKASG QTVNSVIGLG ISRIIGSTGG GVISDLIGIR
QVFFLNSVIV LASIVIFGAI FLVRRQKITG Q