Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0575 |
Symbol | |
ID | 3832488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 598246 |
End bp | 600675 |
Gene Length | 2430 bp |
Protein Length | 809 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828516 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_429448 |
Protein GI | 83589439 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATC GACCGCTCCT CTGGGTTACC GTAGCCTATA TAGTCGGCCT GTTGCTGGCG CGCTTTCAGG AGGGGGCGGC AGGAGCGGTA GCGCCGGGGG ATACCGCGGT GCTGCTCTCT CCCCTGGTCA TAGCCTTGGC CCTCTGGGCC CTGGCCTACC TCTTCTGGCG GCCGCCGCAA CGGCCCGCCT GGATGTACTT GCTCCTGGCC GGATTTGTGG CCATGGGTTT TTTTATTAGT ACCTGGGACA ACTGCCTTCA CCAGAGTCAA CTTGCGGGCG ATCGCGACAC CTATCTCGAC CTCACCGGGA TGGTGACAGG GGAACCGCAG CTTTATCCCG ACCGGGTCGT TTATACCCTG GCGGCCCGGG AGGTCAGTCA GGGCGCATAT AGTAAAAAGA TAAAAGAAAA CGTCCAGGTA GTTATCTACC GGCGGGATAG AAACAAAAGC CTGCCCCGGT ACCATTACGG CGACGTTCTC CGCGTCCACG GCCAGCTCAC AGCGCCACCG CCGGCCCGCA ACCCCGGCGA GTTTGACTAT CGCGCCTACC TCGCCCGACG CTATATTTAT AACCGCATGT TGCTCAATGA CCCCCGGGCC ATAGTCAAGC TGGAAAAAGA GGCAGGTAAC CCCCTGGTAC GCCTGGCCCT GGCAGCCAAA GGCAGGGTAA AAACGGCTAT TGCAGCCGCC CTGCCACCGC GCCAGGCGGG GATTCTTGCC GCCCTTCTCT TCGGCGACGT CGAGGAATTA ACGGATCAGG ACAGCGACAC CTTTAAAAAC CTGGGGGTTT TCCATTTCTT CGCCGTCAGC GGCTCCAACA CGGCCCTGGT TTTACTTATA GTCATGGCTA TAGCCGGCCT CCTGGGTCTA AGAGCAGAGA CAGCGGTGGC CCTGGGGCTG GCCGGTCTGA TCTTTTATGC CGCTGTCACC GGTTTCACAC CCTCGGTAAA CCGGGCGACC ATTATGGCCG GCCTGGGGCT TATAGCCTTC TGGCGCCGCC AGCGCCGCGA TTTCTACACC GCCCTGGCCC TGGCAGCCCT GTTAATCCTG CTGGTACGCC CCCGTTCCCT TTATGACAGC GGTTTTCAGC TTTCCTTTAC CGCCACGTGG GGCATCGTGT ATCTTTACCC TTTGTTGGAT GACCTTCTGG CCTGGCTTCC CGCCTGGCGG GCCTACCTGG TGATCCCCCT GGCGGCGCAG GTAGCTACCC TGCCCCTGGT AGCCTATTAC TTTAGTTTTG TTTCCCTTCT CAGCCTGCCA GCCAACCTGA TCACCGCCGG CCTGGTGGGA GCCATTGTCA CCCTGGGCCT GGCGGCCTCC GCCCTGGCCC TCGTGAGCCT GCCCCTGGCA GGGACCGTGT TTAACGCCCT GGGCCCCCTC GTCAACCTAA TGCTAGCCTT CCTCGCCGGC CTGGCCGGCT TGCCGGGGAT GACCCTGCCC CTGGCCACAC CCTCGCCCCT GGGGGTGGCC GGTTATTACC TGGTCCTGAT CATCCTGAGG GAACTCTGGC TGCGCCGCCG GGAACCGCGC TGGTTGGCCT TGTGGCGGTG GCATCTTCGC GAGCTTGGAG TGGTGGCGGC CCTGACCCTG GCAACTTTGC TGGTCTACCT TTACCACCCG GGGCAGCCGG GAGAACTAGG GGTGACCTTT ATCGACGTCG GCCAGGGGGA CGCCATTTAC CTGGCCACCC CCGGCGGCCG GCACATCCTG GTCGACGGCG GCGGTCGGCC CTTCGAGCAG GGCGATTTTG ACGTCGGGGA GAGGGTGGTG GTGCCCTTCC TCCACCGCCA GGGGGTGCGG CGGTTGGATG CAGTCGTCAG CACCCACCCG GATACTGATC ATCTGGGCGG CCTCATGGCC GTGGTACGGC AGATGCCTGT CTCCCTGGTC GTCGTCCCCC CCTTGCGGGG CAGTATGGTT AACGAGTACC GTCCCTTCCT GGCGGAACTC CAGGCCCGGG GTATCCCCTG GCAGGAGGCG GGACGGGGGG CCACCCTGGC CCTGGACCCG GTAGTGGACC TCCAGGTCCT GCATCCCGGC CGGGAGATCA GCGGCAGCAA CTCCGACAGC AACAACAATT CCCTGGTCAT GAAGGTGGTT TACCGGGACT TCAGCCTTCT TCTGGGCGCC GATATTGAGG CTGAAGCCAT GGCCGATCTG GAGAGGGCCG GGATGAATGT CCGCAGTACC GTCTTTAAAG TACCCCACCA TGGCAGCCGC TTCGGCCTGG AGCCGTCCTT TTTGCAGCAG GTGTCCCCCC AAGCGGTGGT TTTCTCCGTG GGGGAGAGAA ATAACTTCGG CCACCCGGCG CCGGAGGTAG TGGGCTACTG GCAGGGGCGG GGCGTCCCCA TTTATCGCAC CGACCGGCAG GGGGCAATTA GCATCAGGAG CGACGGGGAT AGCTGGCAGG TAAAGACTGT TCTCCCTTAA
|
Protein sequence | MRDRPLLWVT VAYIVGLLLA RFQEGAAGAV APGDTAVLLS PLVIALALWA LAYLFWRPPQ RPAWMYLLLA GFVAMGFFIS TWDNCLHQSQ LAGDRDTYLD LTGMVTGEPQ LYPDRVVYTL AAREVSQGAY SKKIKENVQV VIYRRDRNKS LPRYHYGDVL RVHGQLTAPP PARNPGEFDY RAYLARRYIY NRMLLNDPRA IVKLEKEAGN PLVRLALAAK GRVKTAIAAA LPPRQAGILA ALLFGDVEEL TDQDSDTFKN LGVFHFFAVS GSNTALVLLI VMAIAGLLGL RAETAVALGL AGLIFYAAVT GFTPSVNRAT IMAGLGLIAF WRRQRRDFYT ALALAALLIL LVRPRSLYDS GFQLSFTATW GIVYLYPLLD DLLAWLPAWR AYLVIPLAAQ VATLPLVAYY FSFVSLLSLP ANLITAGLVG AIVTLGLAAS ALALVSLPLA GTVFNALGPL VNLMLAFLAG LAGLPGMTLP LATPSPLGVA GYYLVLIILR ELWLRRREPR WLALWRWHLR ELGVVAALTL ATLLVYLYHP GQPGELGVTF IDVGQGDAIY LATPGGRHIL VDGGGRPFEQ GDFDVGERVV VPFLHRQGVR RLDAVVSTHP DTDHLGGLMA VVRQMPVSLV VVPPLRGSMV NEYRPFLAEL QARGIPWQEA GRGATLALDP VVDLQVLHPG REISGSNSDS NNNSLVMKVV YRDFSLLLGA DIEAEAMADL ERAGMNVRST VFKVPHHGSR FGLEPSFLQQ VSPQAVVFSV GERNNFGHPA PEVVGYWQGR GVPIYRTDRQ GAISIRSDGD SWQVKTVLP
|
| |