Gene Cthe_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1849 
Symbol 
ID4809395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2194563 
End bp2196062 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content43% 
IMG OID640107263 
Productheavy metal transport/detoxification protein 
Protein accessionYP_001038263 
Protein GI125974353 
COG category[P] Inorganic ion transport and metabolism
[S] Function unknown 
COG ID[COG2608] Copper chaperone
[COG2836] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ATTTTACGGC AAATAAGCTT TATGTGCAGG GAATGACGTG CACAGGGTGT 
GAGACGAGAA TAGAGAATGT ATTAAGAAAA TTGGACGGCG TAACCGATGT CAAGGCTTCA
TACACAAGCT CGACCGTTTA CGTGGCCTAT GACAAAAGCA AATTAAACCT GGATAAAATA
ATTGAGACCG TGGAAAAGCT TGATTATAAA ATAAAAGGCG TTGAAGATGA AAAAAATGCA
GGCTATGATA AAAGAGAGAA TTTTGCGCCG GAGAATGGGA GTAAAACGGA AACAGGGCAG
CTTTTGGGCA TTATCATTGT ATTTCTGGCT TTCCTGCTCA TTATAAAGAA CGGCCGGGTA
TTTAATTTCA TACCGGAAAT AGACCGGTCG ATGAGTTATG GATTGCTTTT TGTGGCGGGT
CTTTTAACTT CGCTTCACTG TGTTGCCATG TGCGGCGGAA TTAACATGTC CGTGTGCATG
CAATACAAAA GTGCCGGCGG CAAATCCAAA GCGCTGGGAA GTTCAGACGG TAAGACAGGA
AAAGCGGTTG AATTTGAACA GCTTATGCCA AGTTTTTTGT ACAACCTGGG CAGAGTAATT
TCCTATACCG TTGTAGGCGG GGTTGTGGGG GCACTCGGCT CTGTGATTAG TTTTTCAGGT
GCCGCCAAAG GTGTGGTGGC AATTATATCC GGTGTTTTTA TGGTTATCAT GGGATTGAAT
ATGCTCAATA TTTTTCCGGT TTTAAGAAAG ATTACTCCCA GAATGCCCAA AATTTTTGGC
AGGAAGATAA GCAATGCCAA AAACAAAGGG CCGCTGCTTG TGGGACTTCT AAACGGTTTC
ATGCCGTGCG GTCCTCTTCA GGCAATGCAG CTTTATGCCT TGGGCACGGG TAGTTTTATT
GCCGGTGCTA CGTCGATGTT TATGTTTTCC CTGGGGACGG TTCCTCTTAT GTTTGGCCTT
GGGGCAATAA GCTCCATAGC CGGCGGAAAA TTTACGCAAA AGATGATGAG GATAAGTGCC
GTTTTGGTGA TTGTTTTGGG AGTTGTCATG TTTAACAGGG GTCTGAGTCT TTCGGGATAC
AGTTTTCCTA TTTTCTATGC CGACTCCGCG AAGGGTGCAA GTATTGCCAG AATCGAAGGG
GATGTTCAGG TTGTTGAAAC ACAGCTGGAG CCTGGAAGAT ATGCACCGAT AGTCGTTCAA
AAGGGTATAC CGGTAAAGTG GACGATAAAA GCCAACAAAG AAGACTTGAA CGGCTGCAAC
AACGCAATTG TTGCCCGGGA GTTTGGAATA GACAATAGGA AACTGGAAGT TGGAGATAAT
ATAATTGAAT TTACGCCGGC AAGAGAAGGA GAATTTGTTT ATTCTTGTTG GATGGGAATG
ATTCATGGGT ATATAAAAGT TGTAAATGAC ATTAACGAGA TAGATCAGGA TGACATAAAC
TTAAAAAACG AAGGTTTTAA TTTGAATAAA ATACTTCCGA AAGGTTGCTG CGGCATCTAG
 
Protein sequence
MKNNFTANKL YVQGMTCTGC ETRIENVLRK LDGVTDVKAS YTSSTVYVAY DKSKLNLDKI 
IETVEKLDYK IKGVEDEKNA GYDKRENFAP ENGSKTETGQ LLGIIIVFLA FLLIIKNGRV
FNFIPEIDRS MSYGLLFVAG LLTSLHCVAM CGGINMSVCM QYKSAGGKSK ALGSSDGKTG
KAVEFEQLMP SFLYNLGRVI SYTVVGGVVG ALGSVISFSG AAKGVVAIIS GVFMVIMGLN
MLNIFPVLRK ITPRMPKIFG RKISNAKNKG PLLVGLLNGF MPCGPLQAMQ LYALGTGSFI
AGATSMFMFS LGTVPLMFGL GAISSIAGGK FTQKMMRISA VLVIVLGVVM FNRGLSLSGY
SFPIFYADSA KGASIARIEG DVQVVETQLE PGRYAPIVVQ KGIPVKWTIK ANKEDLNGCN
NAIVAREFGI DNRKLEVGDN IIEFTPAREG EFVYSCWMGM IHGYIKVVND INEIDQDDIN
LKNEGFNLNK ILPKGCCGI