Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1202 |
Symbol | |
ID | 4809894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1432762 |
End bp | 1433937 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106625 |
Product | major facilitator transporter |
Protein accession | YP_001037627 |
Protein GI | 125973717 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00882] oligosaccharide:H+ symporter [TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.207324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTGG ATAATCTTAA ACGTTCAGAA AGATACCCGT TCTCTTTTAT TCTGTTTTAT TCCTTGTTTT ATATGGGCCT TGCGGTATTT GGCGTGTTTA TGCCTGTGTA TTTGGAAGGG CTGGGCTATG ACAATACGGA TATAGGAACA TTTCTTTCAA TCAGTTCGTT TGTCGGCCTG TTTGCACAGC CCATTTGGGG TGTCATAAGT GACCGGGCAA AATCCAAAAA CAATGTGCTG AAAATGTTGG TGCTTTTCAG CAGCATTGCC ATTTTTATGT TTCATCTCTC GGGCAACTAT TACTATATAT TTGCGGTAAT GGTTGTTTAT GCCTTTTTCC AAACGCCCAT TACTCCGATA GGTGATGCGA TTACATTGGA GTATATTACT GACACAAAAT GGAAGTATGG CCCGATAAGG CTTGCCGGTG CATTGGGATA TGCGGTGATG GCATTTATCG GAGGGGCATT GACAAGAAAA AATATCAACG CTATTTTCTT TATATGCTTT GTCATAGGTA TTATGTCTTT GATTACAGTA TTTAGAATGC CAACGGTAAA AGGACATCAA TCGGACGGAA ACAAGCTTTC CATTTTAGAA GTTTTCAAAA ACAGCGAACT TGTGCTGCTT ATGGGATTTA CACTTGTTAT TCATACTACC ATGGGTTTTT ATAATACTTT CTTTCCGATT TACTATAAAA ACATGGGTGC TGACAACACC ATTCTGGGAT TGGCGGTGTT TATCGGCTCG GCGAGTGAAA TAATCTTCCT TGTTTTCGGC GACAGGATAA TAAAACGTTT GGGAATCAAG TTTACGCTGT TCGGTGCAGC GGTTGTTGCA GTTGTACGGT GGGCAAGTTT GGGATTGATT AACAATATTT TTGCAGTGCT TGCACTCCAA ATTCTCCATG GTTTTATATT CATTGTTTTG GCCTACTCCA TGGCAACATA TATCAATAAT GAGATGCCAC CTGAATTGAA GGCCTCAGGA CAGACGGTAA ACTCCGTCAT AGGTTTGGGT ATTTCCAGGA TAATTGGAAG TACAGGCGGC GGTGTGATAA GTGATTTAAT CGGAATCAGG CAGGTATTCT TTTTAAATTC GGTTATTGTT CTTGCTTCAA TTGTCATTTT TGGCGCAATA TTTTTGGTAA GAAGACAAAA AATTACAGGA CAATAG
|
Protein sequence | MILDNLKRSE RYPFSFILFY SLFYMGLAVF GVFMPVYLEG LGYDNTDIGT FLSISSFVGL FAQPIWGVIS DRAKSKNNVL KMLVLFSSIA IFMFHLSGNY YYIFAVMVVY AFFQTPITPI GDAITLEYIT DTKWKYGPIR LAGALGYAVM AFIGGALTRK NINAIFFICF VIGIMSLITV FRMPTVKGHQ SDGNKLSILE VFKNSELVLL MGFTLVIHTT MGFYNTFFPI YYKNMGADNT ILGLAVFIGS ASEIIFLVFG DRIIKRLGIK FTLFGAAVVA VVRWASLGLI NNIFAVLALQ ILHGFIFIVL AYSMATYINN EMPPELKASG QTVNSVIGLG ISRIIGSTGG GVISDLIGIR QVFFLNSVIV LASIVIFGAI FLVRRQKITG Q
|
| |