Gene Mboo_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1032 
Symbol 
ID5412247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1014213 
End bp1015586 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content42% 
IMG OID640868258 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001404193 
Protein GI154150575 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.41908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.322454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTGA CAGTTCCCGT TGCTGAAATT ATCAATAGTT CAACAAATCC TCTACTTGCG 
ATTGATCCCT CTTGGGAGCG CGTGCCTCTT GGAAAAATCG CAAAAGTACT GAATGGTTTT
GCATTTAAAT CAGAATTGTT TAACGATAAA AAAGGTACGC CTCTTATCCG CATTCGGGAT
ATCGGAAATA ACAAAACAGA GTGTTATTAT GACGGTGTAT TCGATGAAGC ATATGTCATA
CATCCGGGGG ATTTGCTCGT AGGGATGGAT GGGGATTTCA ATTGTTCTAC ATGGCGAGGT
CCAAAAGCCT TACTCAATCA ACGGGTTTGT AAAATTGAAG TTAATATTGA ACAATACAAC
AGAAAATTTT TAGAATATGT TTTACCGGGA TATCTGAAAG CTATCAATGA AAATACCTCT
TCGCAAACAG TGAAACATCT ATCGTCACGA TCAATCTCTG AAATACTTCT TCCAAATCCT
CCACTAACCG AACAGCAGCG CATCGTCGCC CGTGTCGAAG CCCTCCTGTC GCACGTCAAC
GCCGCCCGCG AACGGCTGAG CCGGGTGCCG TTGATCATGA AAAAGTTCCG GCAGGCAGTG
CTCGCGGCGG CGTGTAGTGG AGGGTTGACG GAGGGGTGGA GAAAGGAGAA TCCGGATATT
GAAGAAGCAA ATAAATTAGT CAAACGTCTA GAATCTATAA GAAAGCAATT TAAAATCCGC
GAAATTTCTT CAATAGATAA TTTAGAATTA TCTGACCTGC CAGATTCTTG GACTTGGATT
CGTTTAGCTA ATATTGCTAT CGTAATGGAT CCTGATCATA AAATGCCAAA AAGTTCAGAC
GGTGGAATAA TCTTTATTTC TCCAAAAGAC TTCAAAGAAA ATTATCAAAT TGATATGACA
AAAACAAAAC GGATATCCGA TGAAGAGTTT TTAAGATTAT CTAAAAAATT CGTCCCTAGA
CCGTTGGATA TTTTATATTC AAGAATTGGC GCAGATTTGG GGAAAGCAAG AAAAGCACCC
CAAGATATCA AATTTCATAT ATCATATAGT TTGGCGGTAA TCCGACAACT GGGTGAAATG
GAAAATTCTG ATTATTTGTT TTGGTTATTA AATTCAATGT TTATTAGGAA TCAGGCATTC
GAGAATGTGC GAAGCATCGG CGTTCCTGAT TTGGGATTAA GGGATATTGA TAATTTTATA
ATCCCCCTCC CACCCCTTGC CGAGCAGTAC GAGATCGTCC GGCGTGTCGG TTTACTGTTT
GAGCGTGCGG ATGCCATTGA TCGCGAGGTT GAAGCGGCGA CCCGGCGGTG CGAGCGGTTG
ACGCAGGCGG TACTGGGGAA GGCGTTCAGA GGAGAATTAA CGAGGAATTT ATGA
 
Protein sequence
MSLTVPVAEI INSSTNPLLA IDPSWERVPL GKIAKVLNGF AFKSELFNDK KGTPLIRIRD 
IGNNKTECYY DGVFDEAYVI HPGDLLVGMD GDFNCSTWRG PKALLNQRVC KIEVNIEQYN
RKFLEYVLPG YLKAINENTS SQTVKHLSSR SISEILLPNP PLTEQQRIVA RVEALLSHVN
AARERLSRVP LIMKKFRQAV LAAACSGGLT EGWRKENPDI EEANKLVKRL ESIRKQFKIR
EISSIDNLEL SDLPDSWTWI RLANIAIVMD PDHKMPKSSD GGIIFISPKD FKENYQIDMT
KTKRISDEEF LRLSKKFVPR PLDILYSRIG ADLGKARKAP QDIKFHISYS LAVIRQLGEM
ENSDYLFWLL NSMFIRNQAF ENVRSIGVPD LGLRDIDNFI IPLPPLAEQY EIVRRVGLLF
ERADAIDREV EAATRRCERL TQAVLGKAFR GELTRNL