Gene Cthe_0606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0606 
Symbol 
ID4808208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp741757 
End bp744093 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content40% 
IMG OID640106020 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_001037034 
Protein GI125973124 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAC CGCTGGTTTG TTTTAGTCTG TCTCTTATGG CCGGAATTTT ATGCACCAAT 
TTAACCCATT CATACTTGTT TGCTTTTTTG TCCTGTGTGG TAATTGGTGT TATTGCGTTT
ATTCTATTAA AGAACAAGGA TAACGCCAAA TTTATAGTTG GCGGAATTGT TCTGTTTTAC
TTTATTGGTG CGGTATATTA CTTATACGGC TACAACCGGA ACCTTCATAA ATTTGAAGAG
TTTGCCGGGA AAAATGTTGT AATAAGGGGA TATATTGATT CGGCGCCGGA AATTAAAGGG
TCAACAATCA GATATGTACT AAAGACGGAG GAAATTCGGC TAAAAGAGGA TTCAAACCAG
GAAAAGAAGA TTCGGGGAAA AATTTTACTT TCCGTGCAGA AAAGCGATGA AGTTCCGCTT
TTTGAATATG GAAGGGAAAT AAAAATATCG GGTAAAATAA GTATTCCTAA AGGCAGAACC
AATCCCGGGG GATTTGATTA CAGGAAGTAT CTCAACCACT CCGGGATTTC CGCCACTGTT
TTTGTTGTCG GCAGAAATAT ATACCCGCAG AAAAACGTAA AAGGCAATAT ATTTGTCAAA
GCAGGCCTAA GTATAAGAGA AAGGATTGTA AATGTAATAA ACCAGAGCCT TCCGCCTCAG
CAGGCGGGAC TACTTAGCGG CATGTTGATA GGCTACAGGG AAGGACTTTC CGAGGAAGTG
GAAGAAGCTT TCAGCAATTC CGGGCTGACT CATTTAATGG CGGTCTCAGG AGCAAACGTT
GCTTTTATCA TGCTTCCTCT TGTCTTTATA TTTAAAAAAC TTAGGTTTAG GCAAAACATC
TACAACATTA TAATCATTGG TATCCTCTTG TTGTTTACCT TTATTACAGG ATTTGAACCG
TCAGTCCTGC GTGCGGTAAT AATGGCGATA GTTATCCTCG TGGGGCAGAT TTTAAAAAGG
GAGACGGATA TTTTTACCAG CATTGCCTTT GCTGCAATTC TGCTTCTTTT ATTAAATCCC
GGAAACCTTT TTAACATAGG GTTTCAATTG TCCTTTGCAG CAACAATTTC ACTGGTTTTG
TTCTATACCA ATTTAAAAAA CATGTTAAAT TTCGGCTTTC TTCCGGAATT TATAACCGAT
GTGCTGGCGT CTACACTGGC GGCTCAAATA GGAGTATTGC CGATAACGGT GTTTTATTTT
AATAAAATAT CTCTTATATC GGTTTTGTCA AACCTCATAG TTGCACCAGT AGTGGAATTT
ATTACAATTA TGGGGTCCTT GATGGCTGTT TTGGGACAAA TACATATAAT CTTCTCCGTA
TTGATAGGTT ATTGCAACAA CGCTCTTTTA AGTTTTGTGC TCTTTGTCAC AAAAACGACG
GCAGAGCTGC CTTATTCGGT TATAACCGTT TCAACGCCTT CTGTTGTTTT AGTGATAATT
TATTATATTT TTATATTGTT TTTATTTTGG TACAAGCCTA AATACAAGGT AAAACTAAAC
TTAAAGTATT GCGTATTGGC AGGGGCTGTA TCTGTAGCGT TGATAGCGGT TAGCTTCCTC
TGGCCTAAAG GAATGGAAGT GGTGTTTTTG GACGTTGGGC AGGGGGATGG TGCTTTTATC
AGAACATGCA GCGGCAAGAC TATTTTGATT GACGGTGGTC CGGAAAGTGC TGGAGAAAAC
GCTGTTGTAC CGTTTTTATT GGATTATGGT GTGACAGAAA TTGACCTGGT GGTTGTAAGC
CATGGACATG ACGACCATTA TAAAGGGCTT TTGCCCGTAC TTGAAAACTT CAAGGTGAGA
ACTCTTATAA TTCCCGACGT TGATACTGAT GAAGGACTGC TGGATGCAAT TGAAATTGCC
CGAAAAAGAA AAATTTCGGT GGAAAAGTGT GAAAAGGACG ATGTAATTAC CCTTGACAAA
AAAACGTATA TTGAGGTTTT GCATCCAAGG GAAGGGATTT ATTTCAATGA GTCCGGCATA
AACAACAGTT CTTTGGTGTT AAAACTCAAT TTCAAAGATG TGAGCATACT GTTTACGGGA
GATATTGAAA AAGAGGCCGA AAGGCTGCTT TGTGAGGATG AGGTAAATCT CGATGCGGAT
GTGTTGAAAG TGGCGCACCA TGGCTCTTCT ACATCTTCCA CGGAGGAATT TTTGGACAGT
GTTACTCCCG ATGTGGCTGT TATAAGCGTG GGTAAAAACA ATTTCGGGCA TCCTTCCGAA
GAAGTTCTTC AGCGTATGGA ATCAAAGGGT ATATATGTCT TAAGAACCGA TATATCCGGG
GCCGTAGTAC TGAAAACTTA TGGGGAAAAG ATTAGGATAA GACCAACCGT ACCGTAA
 
Protein sequence
MKRPLVCFSL SLMAGILCTN LTHSYLFAFL SCVVIGVIAF ILLKNKDNAK FIVGGIVLFY 
FIGAVYYLYG YNRNLHKFEE FAGKNVVIRG YIDSAPEIKG STIRYVLKTE EIRLKEDSNQ
EKKIRGKILL SVQKSDEVPL FEYGREIKIS GKISIPKGRT NPGGFDYRKY LNHSGISATV
FVVGRNIYPQ KNVKGNIFVK AGLSIRERIV NVINQSLPPQ QAGLLSGMLI GYREGLSEEV
EEAFSNSGLT HLMAVSGANV AFIMLPLVFI FKKLRFRQNI YNIIIIGILL LFTFITGFEP
SVLRAVIMAI VILVGQILKR ETDIFTSIAF AAILLLLLNP GNLFNIGFQL SFAATISLVL
FYTNLKNMLN FGFLPEFITD VLASTLAAQI GVLPITVFYF NKISLISVLS NLIVAPVVEF
ITIMGSLMAV LGQIHIIFSV LIGYCNNALL SFVLFVTKTT AELPYSVITV STPSVVLVII
YYIFILFLFW YKPKYKVKLN LKYCVLAGAV SVALIAVSFL WPKGMEVVFL DVGQGDGAFI
RTCSGKTILI DGGPESAGEN AVVPFLLDYG VTEIDLVVVS HGHDDHYKGL LPVLENFKVR
TLIIPDVDTD EGLLDAIEIA RKRKISVEKC EKDDVITLDK KTYIEVLHPR EGIYFNESGI
NNSSLVLKLN FKDVSILFTG DIEKEAERLL CEDEVNLDAD VLKVAHHGSS TSSTEEFLDS
VTPDVAVISV GKNNFGHPSE EVLQRMESKG IYVLRTDISG AVVLKTYGEK IRIRPTVP