Gene Cthe_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1111 
Symbol 
ID4811409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1321854 
End bp1322891 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content37% 
IMG OID640106533 
Productbile acid:sodium symporter 
Protein accessionYP_001037536 
Protein GI125973626 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAA ATAAAGGATT AGGTTTTTTT GAAAAGTACC TTACAGTATG GGTAGCAGTA 
TGCATTATAG TAGGAGTTGC AATAGGACAA TTAGTTCCTT CAATCCCTGA AACTTTAAGC
AAATTTGAAT ATGCAAATGT ATCAATTCCT GTTGCTATTC TCATATGGCT AATGATTTAC
CCAATGATGC TGAAAATTGA TTTTTCAAGC ATTGTCAGGG CAACAAAAAA ACCGAAGGGA
CTAATAGTTA CTTGTGTAAC AAACTGGCTT ATCAAGCCTT TTACAATGTA TCTTATTGCA
GCGTTTTTCT TGAAAGTAGT GTTCAGTAGG TGGATTGGTC CGGATTTAGC GACAGACTAT
CTTGCAGGTG CAGTATTATT AGGAGCCGCA CCATGTACCG CTATGGTATT CGTATGGAGT
TATCTGACAA AAAGCGACCC TGCTTATACA TTAGTGCAGG TAGCAGTGAA TGACCTGATA
ATATTGTTTG CATTTACACC AATTGTTGCA TTCCTATTAG GGGTAAGTAA TGTGACCGTT
CCTTATGACA CGCTGATATT ATCAACAATC CTGTTTGTTG TTATTCCATT GGCAGGAGGG
TACCTTACTA GAAGGAACAT CATTAAACAT AAGAGTATAG AGTATTTCGA GAACATTTTT
CTCAAGAAAT TTGATAATGT AACAATCGTA GGTTTGCTTC TCACTTTAGT AATTATTTTC
TCGTTCCAGG GTGAAATAAT TTTAAGTAAT CCCTTGCATA TTATATTAAT TGCCATACCA
TTAATTATCC AGACATTCTT TATATTCTTC ATTGCTTATG GATGGGCAAA GATATGGAAA
CTTCCCCATG ATATTGCAGC ACCTGCGGGA ATGATTGGAG CAAGCAATTT CTTTGAACTT
GCAGTTGCAG TGGCAATTTC ACTCTTTGGA CTGGAATCTG GAGCCGCTCT TGCAACAGTT
GTAGGGGTAT TGGTTGAAGT CCCGGTCATG CTTACATTGG TCAGGATTGC AAATAGTACA
AGGCATTGGT TTCAATAA
 
Protein sequence
MQENKGLGFF EKYLTVWVAV CIIVGVAIGQ LVPSIPETLS KFEYANVSIP VAILIWLMIY 
PMMLKIDFSS IVRATKKPKG LIVTCVTNWL IKPFTMYLIA AFFLKVVFSR WIGPDLATDY
LAGAVLLGAA PCTAMVFVWS YLTKSDPAYT LVQVAVNDLI ILFAFTPIVA FLLGVSNVTV
PYDTLILSTI LFVVIPLAGG YLTRRNIIKH KSIEYFENIF LKKFDNVTIV GLLLTLVIIF
SFQGEIILSN PLHIILIAIP LIIQTFFIFF IAYGWAKIWK LPHDIAAPAG MIGASNFFEL
AVAVAISLFG LESGAALATV VGVLVEVPVM LTLVRIANST RHWFQ