Gene Moth_0575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0575 
Symbol 
ID3832488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp598246 
End bp600675 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content63% 
IMG OID637828516 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_429448 
Protein GI83589439 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATC GACCGCTCCT CTGGGTTACC GTAGCCTATA TAGTCGGCCT GTTGCTGGCG 
CGCTTTCAGG AGGGGGCGGC AGGAGCGGTA GCGCCGGGGG ATACCGCGGT GCTGCTCTCT
CCCCTGGTCA TAGCCTTGGC CCTCTGGGCC CTGGCCTACC TCTTCTGGCG GCCGCCGCAA
CGGCCCGCCT GGATGTACTT GCTCCTGGCC GGATTTGTGG CCATGGGTTT TTTTATTAGT
ACCTGGGACA ACTGCCTTCA CCAGAGTCAA CTTGCGGGCG ATCGCGACAC CTATCTCGAC
CTCACCGGGA TGGTGACAGG GGAACCGCAG CTTTATCCCG ACCGGGTCGT TTATACCCTG
GCGGCCCGGG AGGTCAGTCA GGGCGCATAT AGTAAAAAGA TAAAAGAAAA CGTCCAGGTA
GTTATCTACC GGCGGGATAG AAACAAAAGC CTGCCCCGGT ACCATTACGG CGACGTTCTC
CGCGTCCACG GCCAGCTCAC AGCGCCACCG CCGGCCCGCA ACCCCGGCGA GTTTGACTAT
CGCGCCTACC TCGCCCGACG CTATATTTAT AACCGCATGT TGCTCAATGA CCCCCGGGCC
ATAGTCAAGC TGGAAAAAGA GGCAGGTAAC CCCCTGGTAC GCCTGGCCCT GGCAGCCAAA
GGCAGGGTAA AAACGGCTAT TGCAGCCGCC CTGCCACCGC GCCAGGCGGG GATTCTTGCC
GCCCTTCTCT TCGGCGACGT CGAGGAATTA ACGGATCAGG ACAGCGACAC CTTTAAAAAC
CTGGGGGTTT TCCATTTCTT CGCCGTCAGC GGCTCCAACA CGGCCCTGGT TTTACTTATA
GTCATGGCTA TAGCCGGCCT CCTGGGTCTA AGAGCAGAGA CAGCGGTGGC CCTGGGGCTG
GCCGGTCTGA TCTTTTATGC CGCTGTCACC GGTTTCACAC CCTCGGTAAA CCGGGCGACC
ATTATGGCCG GCCTGGGGCT TATAGCCTTC TGGCGCCGCC AGCGCCGCGA TTTCTACACC
GCCCTGGCCC TGGCAGCCCT GTTAATCCTG CTGGTACGCC CCCGTTCCCT TTATGACAGC
GGTTTTCAGC TTTCCTTTAC CGCCACGTGG GGCATCGTGT ATCTTTACCC TTTGTTGGAT
GACCTTCTGG CCTGGCTTCC CGCCTGGCGG GCCTACCTGG TGATCCCCCT GGCGGCGCAG
GTAGCTACCC TGCCCCTGGT AGCCTATTAC TTTAGTTTTG TTTCCCTTCT CAGCCTGCCA
GCCAACCTGA TCACCGCCGG CCTGGTGGGA GCCATTGTCA CCCTGGGCCT GGCGGCCTCC
GCCCTGGCCC TCGTGAGCCT GCCCCTGGCA GGGACCGTGT TTAACGCCCT GGGCCCCCTC
GTCAACCTAA TGCTAGCCTT CCTCGCCGGC CTGGCCGGCT TGCCGGGGAT GACCCTGCCC
CTGGCCACAC CCTCGCCCCT GGGGGTGGCC GGTTATTACC TGGTCCTGAT CATCCTGAGG
GAACTCTGGC TGCGCCGCCG GGAACCGCGC TGGTTGGCCT TGTGGCGGTG GCATCTTCGC
GAGCTTGGAG TGGTGGCGGC CCTGACCCTG GCAACTTTGC TGGTCTACCT TTACCACCCG
GGGCAGCCGG GAGAACTAGG GGTGACCTTT ATCGACGTCG GCCAGGGGGA CGCCATTTAC
CTGGCCACCC CCGGCGGCCG GCACATCCTG GTCGACGGCG GCGGTCGGCC CTTCGAGCAG
GGCGATTTTG ACGTCGGGGA GAGGGTGGTG GTGCCCTTCC TCCACCGCCA GGGGGTGCGG
CGGTTGGATG CAGTCGTCAG CACCCACCCG GATACTGATC ATCTGGGCGG CCTCATGGCC
GTGGTACGGC AGATGCCTGT CTCCCTGGTC GTCGTCCCCC CCTTGCGGGG CAGTATGGTT
AACGAGTACC GTCCCTTCCT GGCGGAACTC CAGGCCCGGG GTATCCCCTG GCAGGAGGCG
GGACGGGGGG CCACCCTGGC CCTGGACCCG GTAGTGGACC TCCAGGTCCT GCATCCCGGC
CGGGAGATCA GCGGCAGCAA CTCCGACAGC AACAACAATT CCCTGGTCAT GAAGGTGGTT
TACCGGGACT TCAGCCTTCT TCTGGGCGCC GATATTGAGG CTGAAGCCAT GGCCGATCTG
GAGAGGGCCG GGATGAATGT CCGCAGTACC GTCTTTAAAG TACCCCACCA TGGCAGCCGC
TTCGGCCTGG AGCCGTCCTT TTTGCAGCAG GTGTCCCCCC AAGCGGTGGT TTTCTCCGTG
GGGGAGAGAA ATAACTTCGG CCACCCGGCG CCGGAGGTAG TGGGCTACTG GCAGGGGCGG
GGCGTCCCCA TTTATCGCAC CGACCGGCAG GGGGCAATTA GCATCAGGAG CGACGGGGAT
AGCTGGCAGG TAAAGACTGT TCTCCCTTAA
 
Protein sequence
MRDRPLLWVT VAYIVGLLLA RFQEGAAGAV APGDTAVLLS PLVIALALWA LAYLFWRPPQ 
RPAWMYLLLA GFVAMGFFIS TWDNCLHQSQ LAGDRDTYLD LTGMVTGEPQ LYPDRVVYTL
AAREVSQGAY SKKIKENVQV VIYRRDRNKS LPRYHYGDVL RVHGQLTAPP PARNPGEFDY
RAYLARRYIY NRMLLNDPRA IVKLEKEAGN PLVRLALAAK GRVKTAIAAA LPPRQAGILA
ALLFGDVEEL TDQDSDTFKN LGVFHFFAVS GSNTALVLLI VMAIAGLLGL RAETAVALGL
AGLIFYAAVT GFTPSVNRAT IMAGLGLIAF WRRQRRDFYT ALALAALLIL LVRPRSLYDS
GFQLSFTATW GIVYLYPLLD DLLAWLPAWR AYLVIPLAAQ VATLPLVAYY FSFVSLLSLP
ANLITAGLVG AIVTLGLAAS ALALVSLPLA GTVFNALGPL VNLMLAFLAG LAGLPGMTLP
LATPSPLGVA GYYLVLIILR ELWLRRREPR WLALWRWHLR ELGVVAALTL ATLLVYLYHP
GQPGELGVTF IDVGQGDAIY LATPGGRHIL VDGGGRPFEQ GDFDVGERVV VPFLHRQGVR
RLDAVVSTHP DTDHLGGLMA VVRQMPVSLV VVPPLRGSMV NEYRPFLAEL QARGIPWQEA
GRGATLALDP VVDLQVLHPG REISGSNSDS NNNSLVMKVV YRDFSLLLGA DIEAEAMADL
ERAGMNVRST VFKVPHHGSR FGLEPSFLQQ VSPQAVVFSV GERNNFGHPA PEVVGYWQGR
GVPIYRTDRQ GAISIRSDGD SWQVKTVLP