Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0606 |
Symbol | |
ID | 4808208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 741757 |
End bp | 744093 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106020 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001037034 |
Protein GI | 125973124 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAC CGCTGGTTTG TTTTAGTCTG TCTCTTATGG CCGGAATTTT ATGCACCAAT TTAACCCATT CATACTTGTT TGCTTTTTTG TCCTGTGTGG TAATTGGTGT TATTGCGTTT ATTCTATTAA AGAACAAGGA TAACGCCAAA TTTATAGTTG GCGGAATTGT TCTGTTTTAC TTTATTGGTG CGGTATATTA CTTATACGGC TACAACCGGA ACCTTCATAA ATTTGAAGAG TTTGCCGGGA AAAATGTTGT AATAAGGGGA TATATTGATT CGGCGCCGGA AATTAAAGGG TCAACAATCA GATATGTACT AAAGACGGAG GAAATTCGGC TAAAAGAGGA TTCAAACCAG GAAAAGAAGA TTCGGGGAAA AATTTTACTT TCCGTGCAGA AAAGCGATGA AGTTCCGCTT TTTGAATATG GAAGGGAAAT AAAAATATCG GGTAAAATAA GTATTCCTAA AGGCAGAACC AATCCCGGGG GATTTGATTA CAGGAAGTAT CTCAACCACT CCGGGATTTC CGCCACTGTT TTTGTTGTCG GCAGAAATAT ATACCCGCAG AAAAACGTAA AAGGCAATAT ATTTGTCAAA GCAGGCCTAA GTATAAGAGA AAGGATTGTA AATGTAATAA ACCAGAGCCT TCCGCCTCAG CAGGCGGGAC TACTTAGCGG CATGTTGATA GGCTACAGGG AAGGACTTTC CGAGGAAGTG GAAGAAGCTT TCAGCAATTC CGGGCTGACT CATTTAATGG CGGTCTCAGG AGCAAACGTT GCTTTTATCA TGCTTCCTCT TGTCTTTATA TTTAAAAAAC TTAGGTTTAG GCAAAACATC TACAACATTA TAATCATTGG TATCCTCTTG TTGTTTACCT TTATTACAGG ATTTGAACCG TCAGTCCTGC GTGCGGTAAT AATGGCGATA GTTATCCTCG TGGGGCAGAT TTTAAAAAGG GAGACGGATA TTTTTACCAG CATTGCCTTT GCTGCAATTC TGCTTCTTTT ATTAAATCCC GGAAACCTTT TTAACATAGG GTTTCAATTG TCCTTTGCAG CAACAATTTC ACTGGTTTTG TTCTATACCA ATTTAAAAAA CATGTTAAAT TTCGGCTTTC TTCCGGAATT TATAACCGAT GTGCTGGCGT CTACACTGGC GGCTCAAATA GGAGTATTGC CGATAACGGT GTTTTATTTT AATAAAATAT CTCTTATATC GGTTTTGTCA AACCTCATAG TTGCACCAGT AGTGGAATTT ATTACAATTA TGGGGTCCTT GATGGCTGTT TTGGGACAAA TACATATAAT CTTCTCCGTA TTGATAGGTT ATTGCAACAA CGCTCTTTTA AGTTTTGTGC TCTTTGTCAC AAAAACGACG GCAGAGCTGC CTTATTCGGT TATAACCGTT TCAACGCCTT CTGTTGTTTT AGTGATAATT TATTATATTT TTATATTGTT TTTATTTTGG TACAAGCCTA AATACAAGGT AAAACTAAAC TTAAAGTATT GCGTATTGGC AGGGGCTGTA TCTGTAGCGT TGATAGCGGT TAGCTTCCTC TGGCCTAAAG GAATGGAAGT GGTGTTTTTG GACGTTGGGC AGGGGGATGG TGCTTTTATC AGAACATGCA GCGGCAAGAC TATTTTGATT GACGGTGGTC CGGAAAGTGC TGGAGAAAAC GCTGTTGTAC CGTTTTTATT GGATTATGGT GTGACAGAAA TTGACCTGGT GGTTGTAAGC CATGGACATG ACGACCATTA TAAAGGGCTT TTGCCCGTAC TTGAAAACTT CAAGGTGAGA ACTCTTATAA TTCCCGACGT TGATACTGAT GAAGGACTGC TGGATGCAAT TGAAATTGCC CGAAAAAGAA AAATTTCGGT GGAAAAGTGT GAAAAGGACG ATGTAATTAC CCTTGACAAA AAAACGTATA TTGAGGTTTT GCATCCAAGG GAAGGGATTT ATTTCAATGA GTCCGGCATA AACAACAGTT CTTTGGTGTT AAAACTCAAT TTCAAAGATG TGAGCATACT GTTTACGGGA GATATTGAAA AAGAGGCCGA AAGGCTGCTT TGTGAGGATG AGGTAAATCT CGATGCGGAT GTGTTGAAAG TGGCGCACCA TGGCTCTTCT ACATCTTCCA CGGAGGAATT TTTGGACAGT GTTACTCCCG ATGTGGCTGT TATAAGCGTG GGTAAAAACA ATTTCGGGCA TCCTTCCGAA GAAGTTCTTC AGCGTATGGA ATCAAAGGGT ATATATGTCT TAAGAACCGA TATATCCGGG GCCGTAGTAC TGAAAACTTA TGGGGAAAAG ATTAGGATAA GACCAACCGT ACCGTAA
|
Protein sequence | MKRPLVCFSL SLMAGILCTN LTHSYLFAFL SCVVIGVIAF ILLKNKDNAK FIVGGIVLFY FIGAVYYLYG YNRNLHKFEE FAGKNVVIRG YIDSAPEIKG STIRYVLKTE EIRLKEDSNQ EKKIRGKILL SVQKSDEVPL FEYGREIKIS GKISIPKGRT NPGGFDYRKY LNHSGISATV FVVGRNIYPQ KNVKGNIFVK AGLSIRERIV NVINQSLPPQ QAGLLSGMLI GYREGLSEEV EEAFSNSGLT HLMAVSGANV AFIMLPLVFI FKKLRFRQNI YNIIIIGILL LFTFITGFEP SVLRAVIMAI VILVGQILKR ETDIFTSIAF AAILLLLLNP GNLFNIGFQL SFAATISLVL FYTNLKNMLN FGFLPEFITD VLASTLAAQI GVLPITVFYF NKISLISVLS NLIVAPVVEF ITIMGSLMAV LGQIHIIFSV LIGYCNNALL SFVLFVTKTT AELPYSVITV STPSVVLVII YYIFILFLFW YKPKYKVKLN LKYCVLAGAV SVALIAVSFL WPKGMEVVFL DVGQGDGAFI RTCSGKTILI DGGPESAGEN AVVPFLLDYG VTEIDLVVVS HGHDDHYKGL LPVLENFKVR TLIIPDVDTD EGLLDAIEIA RKRKISVEKC EKDDVITLDK KTYIEVLHPR EGIYFNESGI NNSSLVLKLN FKDVSILFTG DIEKEAERLL CEDEVNLDAD VLKVAHHGSS TSSTEEFLDS VTPDVAVISV GKNNFGHPSE EVLQRMESKG IYVLRTDISG AVVLKTYGEK IRIRPTVP
|
| |