Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1849 |
Symbol | |
ID | 4809395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2194563 |
End bp | 2196062 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107263 |
Product | heavy metal transport/detoxification protein |
Protein accession | YP_001038263 |
Protein GI | 125974353 |
COG category | [P] Inorganic ion transport and metabolism [S] Function unknown |
COG ID | [COG2608] Copper chaperone [COG2836] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA ATTTTACGGC AAATAAGCTT TATGTGCAGG GAATGACGTG CACAGGGTGT GAGACGAGAA TAGAGAATGT ATTAAGAAAA TTGGACGGCG TAACCGATGT CAAGGCTTCA TACACAAGCT CGACCGTTTA CGTGGCCTAT GACAAAAGCA AATTAAACCT GGATAAAATA ATTGAGACCG TGGAAAAGCT TGATTATAAA ATAAAAGGCG TTGAAGATGA AAAAAATGCA GGCTATGATA AAAGAGAGAA TTTTGCGCCG GAGAATGGGA GTAAAACGGA AACAGGGCAG CTTTTGGGCA TTATCATTGT ATTTCTGGCT TTCCTGCTCA TTATAAAGAA CGGCCGGGTA TTTAATTTCA TACCGGAAAT AGACCGGTCG ATGAGTTATG GATTGCTTTT TGTGGCGGGT CTTTTAACTT CGCTTCACTG TGTTGCCATG TGCGGCGGAA TTAACATGTC CGTGTGCATG CAATACAAAA GTGCCGGCGG CAAATCCAAA GCGCTGGGAA GTTCAGACGG TAAGACAGGA AAAGCGGTTG AATTTGAACA GCTTATGCCA AGTTTTTTGT ACAACCTGGG CAGAGTAATT TCCTATACCG TTGTAGGCGG GGTTGTGGGG GCACTCGGCT CTGTGATTAG TTTTTCAGGT GCCGCCAAAG GTGTGGTGGC AATTATATCC GGTGTTTTTA TGGTTATCAT GGGATTGAAT ATGCTCAATA TTTTTCCGGT TTTAAGAAAG ATTACTCCCA GAATGCCCAA AATTTTTGGC AGGAAGATAA GCAATGCCAA AAACAAAGGG CCGCTGCTTG TGGGACTTCT AAACGGTTTC ATGCCGTGCG GTCCTCTTCA GGCAATGCAG CTTTATGCCT TGGGCACGGG TAGTTTTATT GCCGGTGCTA CGTCGATGTT TATGTTTTCC CTGGGGACGG TTCCTCTTAT GTTTGGCCTT GGGGCAATAA GCTCCATAGC CGGCGGAAAA TTTACGCAAA AGATGATGAG GATAAGTGCC GTTTTGGTGA TTGTTTTGGG AGTTGTCATG TTTAACAGGG GTCTGAGTCT TTCGGGATAC AGTTTTCCTA TTTTCTATGC CGACTCCGCG AAGGGTGCAA GTATTGCCAG AATCGAAGGG GATGTTCAGG TTGTTGAAAC ACAGCTGGAG CCTGGAAGAT ATGCACCGAT AGTCGTTCAA AAGGGTATAC CGGTAAAGTG GACGATAAAA GCCAACAAAG AAGACTTGAA CGGCTGCAAC AACGCAATTG TTGCCCGGGA GTTTGGAATA GACAATAGGA AACTGGAAGT TGGAGATAAT ATAATTGAAT TTACGCCGGC AAGAGAAGGA GAATTTGTTT ATTCTTGTTG GATGGGAATG ATTCATGGGT ATATAAAAGT TGTAAATGAC ATTAACGAGA TAGATCAGGA TGACATAAAC TTAAAAAACG AAGGTTTTAA TTTGAATAAA ATACTTCCGA AAGGTTGCTG CGGCATCTAG
|
Protein sequence | MKNNFTANKL YVQGMTCTGC ETRIENVLRK LDGVTDVKAS YTSSTVYVAY DKSKLNLDKI IETVEKLDYK IKGVEDEKNA GYDKRENFAP ENGSKTETGQ LLGIIIVFLA FLLIIKNGRV FNFIPEIDRS MSYGLLFVAG LLTSLHCVAM CGGINMSVCM QYKSAGGKSK ALGSSDGKTG KAVEFEQLMP SFLYNLGRVI SYTVVGGVVG ALGSVISFSG AAKGVVAIIS GVFMVIMGLN MLNIFPVLRK ITPRMPKIFG RKISNAKNKG PLLVGLLNGF MPCGPLQAMQ LYALGTGSFI AGATSMFMFS LGTVPLMFGL GAISSIAGGK FTQKMMRISA VLVIVLGVVM FNRGLSLSGY SFPIFYADSA KGASIARIEG DVQVVETQLE PGRYAPIVVQ KGIPVKWTIK ANKEDLNGCN NAIVAREFGI DNRKLEVGDN IIEFTPAREG EFVYSCWMGM IHGYIKVVND INEIDQDDIN LKNEGFNLNK ILPKGCCGI
|
| |