Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1111 |
Symbol | |
ID | 4811409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1321854 |
End bp | 1322891 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106533 |
Product | bile acid:sodium symporter |
Protein accession | YP_001037536 |
Protein GI | 125973626 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAA ATAAAGGATT AGGTTTTTTT GAAAAGTACC TTACAGTATG GGTAGCAGTA TGCATTATAG TAGGAGTTGC AATAGGACAA TTAGTTCCTT CAATCCCTGA AACTTTAAGC AAATTTGAAT ATGCAAATGT ATCAATTCCT GTTGCTATTC TCATATGGCT AATGATTTAC CCAATGATGC TGAAAATTGA TTTTTCAAGC ATTGTCAGGG CAACAAAAAA ACCGAAGGGA CTAATAGTTA CTTGTGTAAC AAACTGGCTT ATCAAGCCTT TTACAATGTA TCTTATTGCA GCGTTTTTCT TGAAAGTAGT GTTCAGTAGG TGGATTGGTC CGGATTTAGC GACAGACTAT CTTGCAGGTG CAGTATTATT AGGAGCCGCA CCATGTACCG CTATGGTATT CGTATGGAGT TATCTGACAA AAAGCGACCC TGCTTATACA TTAGTGCAGG TAGCAGTGAA TGACCTGATA ATATTGTTTG CATTTACACC AATTGTTGCA TTCCTATTAG GGGTAAGTAA TGTGACCGTT CCTTATGACA CGCTGATATT ATCAACAATC CTGTTTGTTG TTATTCCATT GGCAGGAGGG TACCTTACTA GAAGGAACAT CATTAAACAT AAGAGTATAG AGTATTTCGA GAACATTTTT CTCAAGAAAT TTGATAATGT AACAATCGTA GGTTTGCTTC TCACTTTAGT AATTATTTTC TCGTTCCAGG GTGAAATAAT TTTAAGTAAT CCCTTGCATA TTATATTAAT TGCCATACCA TTAATTATCC AGACATTCTT TATATTCTTC ATTGCTTATG GATGGGCAAA GATATGGAAA CTTCCCCATG ATATTGCAGC ACCTGCGGGA ATGATTGGAG CAAGCAATTT CTTTGAACTT GCAGTTGCAG TGGCAATTTC ACTCTTTGGA CTGGAATCTG GAGCCGCTCT TGCAACAGTT GTAGGGGTAT TGGTTGAAGT CCCGGTCATG CTTACATTGG TCAGGATTGC AAATAGTACA AGGCATTGGT TTCAATAA
|
Protein sequence | MQENKGLGFF EKYLTVWVAV CIIVGVAIGQ LVPSIPETLS KFEYANVSIP VAILIWLMIY PMMLKIDFSS IVRATKKPKG LIVTCVTNWL IKPFTMYLIA AFFLKVVFSR WIGPDLATDY LAGAVLLGAA PCTAMVFVWS YLTKSDPAYT LVQVAVNDLI ILFAFTPIVA FLLGVSNVTV PYDTLILSTI LFVVIPLAGG YLTRRNIIKH KSIEYFENIF LKKFDNVTIV GLLLTLVIIF SFQGEIILSN PLHIILIAIP LIIQTFFIFF IAYGWAKIWK LPHDIAAPAG MIGASNFFEL AVAVAISLFG LESGAALATV VGVLVEVPVM LTLVRIANST RHWFQ
|
| |