Gene Mboo_1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1210 
Symbol 
ID5410382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1229102 
End bp1230070 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content56% 
IMG OID640868437 
Productputative RNA methylase 
Protein accessionYP_001404371 
Protein GI154150753 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID[TIGR01177] conserved hypothetical protein TIGR01177 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00394731 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTCC TTTTTGAGCT CTCGGGGGAA AATCCTACCC TGCCGTGCGC CGAGCTGGAA 
TGTGTCGGTC GTGTCCTGGA GGCCCGGCCG CAGGTTGCCG TGGCTGAATG CGCACACCCA
AACAACACCC TGCGCCTGGC AATGACCCAT GGGGTACTCG AATACCTTGG CGAATGTGAC
CCGGATCCTG CCTCATTCCG GAAACTTCTT TTGGATCTCA GTATTGTTAC GGATCGACCG
TTCGCCGGCC GTGCAAGGCT GGTCCATGAG GGCTGCCAGT TGAAGAATTC CTGCTCCCAG
CGGGAATTCG AACGCCTGAT CGGCACCATG ATCAACGGAC CGGTCCAGCT GGTTAATCCT
GTTGAGGAAT ACCGTGCGAT CCTCTCGCAG GACCGGTGCT ATTTTGGAAG GGTGTTGTAC
CGGATTGATC GCGGGGCATA TGATGCCCGG AACCCCGGTA AGAGGGAATT TTTCCACCCC
GGTGTCATGA TGCCCCGTAT GGCACGGACA CTGGTCAACC TCTCGCTTTG CGGTTCCGGG
GCAATCCTGC TTGATCCGTT CTGCGGGACC GGAGGTATCC TGATTGAGGC AGAGATACTC
TCTATGAATG CAATTGGCAG TGACTTTGAC CCCATGATGA TCCGGGGAAG TGCCGGTAAT
GTGAGCTCAA GTACCCTCCT TCTTGCTGAT ACCACCAGCC TGCCGGTGCG TGATCGTTCG
GTGGATGCGG TTGTGACGGA TTTCCCGTAC GGCCAGTCCG TCTGCATCAA AAAAGCGGAT
ACCATGGAGC GCCTGTACTA TGATGCGCTC GGTGAGATCA ATCGTATTTT AAAACCGGGT
GCCCGGGCGG TTGTTGTGAC ACACCGCGAT ATTTCCTGCA TTGCCGTGCA GCATATGGCT
GTGCTCCAGC ATCATACCCA GCGGGTACAC AAAAGTCTTA CCCGGCATAT CCTTGTGCTG
GGGAAATGA
 
Protein sequence
MKLLFELSGE NPTLPCAELE CVGRVLEARP QVAVAECAHP NNTLRLAMTH GVLEYLGECD 
PDPASFRKLL LDLSIVTDRP FAGRARLVHE GCQLKNSCSQ REFERLIGTM INGPVQLVNP
VEEYRAILSQ DRCYFGRVLY RIDRGAYDAR NPGKREFFHP GVMMPRMART LVNLSLCGSG
AILLDPFCGT GGILIEAEIL SMNAIGSDFD PMMIRGSAGN VSSSTLLLAD TTSLPVRDRS
VDAVVTDFPY GQSVCIKKAD TMERLYYDAL GEINRILKPG ARAVVVTHRD ISCIAVQHMA
VLQHHTQRVH KSLTRHILVL GK