Gene Mboo_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1148 
Symbol 
ID5411595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1150862 
End bp1152253 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID640868374 
Productnitrogenase 
Protein accessionYP_001404309 
Protein GI154150691 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.134067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.680511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGTG AAAATGCTGC CCCGGTACCG GGCCGGGCAA AACAGGTCAA TGAGAACCAG 
TGTAACATGT GCATGCCCAT CGGCGGTGTC GTGGCATTCA AGGGGATCAA GAATGCGATG
GTGCTCGTCC ATGGCTCGCA GGGGTGCAGC ACGTACATGC GCCTGGCCAA TGTCGAGCAC
TTCAATGAGC CATTTGACAT CGCGTCCTCG TCCTTGAACG AGAAACAGAC GATCCACGGA
GGCGAGGCAA ACCTCAAAAA AGCCATGGAC AATGTCATCA GGGTGTACCA GCCCAACGTC
CTTGGAATCT TAACGACCTG CCTTGCCGAG ACCATGGGCG AAGACCTCGA CCGGATCGTC
GCCTCCTACA TTGAAGAAAA AAAGATCGCC GGGATCGATA TCATCCCGGT CCCGACGCCC
AGCTACAGCG GTACGCATGC CGAGGGGTTC TGGGCGGCCG CCCGGGGGAT TGTTGCCTAC
TATGCACGTC CCGCATCATC CCACCGGCGC ATCAACGTGA TAATCCCGAA CATCAGCCCG
GCCGACATCC GCGAGATCAA ACGTATCCTC GGACTCATGG GCCTTGAGTA CACGCTGCTC
CCGGACTTCT CCATGACGCT TGACCGCCCG TTTGGCGGGA AATACCAGAA GATCCCGACC
GGGGGAACCC CGGCAGAACG GATAGCGGAG ATGCCGGGAG CCCCGGTAAC AATCCAGTTT
GGGACAACCT GCCCTGACAG CCTCTCACCC GGCCGGTATC TCGAACAGGA GTACAATGTC
CCGCTCGTGA ACCTCCCCCT CCCGATCGGC CTTGGGAATG TGGATCTCTT TGTTGAGACC
CTTGCGAACG TCAGCGGGAA TCCCGTTCCC GAAGAACTCG CGCTCGAACG GGGCTGGCTC
ATTGACGGCA TGGCGGATTC CCACAAGTTC AATGCCGACG GGAGGCCGGT GGTCTATGGC
GAACCGGAGC TCGTGTACGC GTTTTCGATG GCCTGCGCAG AGAACGGGGC CATGCCGGTG
GTCATCTCGA CCGGCTCAAA AAACAGCCTC CTTAAAGAAC GGCTCGCACC GCTCGTCCTC
AATGCGGACG AACCTCCGGT CGTTCTTGAG GAAGCAGACT TTGCGGCCGT TGCCGCTGCG
GCGCTCGGGG CTGAGGCAAA TATAGCGATT GGGCACTCCG GGGGAAAACT GCTTACCGAG
CGGTACGGCA TTCCCCTTGT CCGGGCCGGT TACCCGATCC ATGACCGGAT CGGCGGCCAG
AGGATCCTGT CGGCCGGCTA CCGGGGAACG CTCGCGTTTC TCGACCGGTT CACCAACACG
CTGTTAGAAC ACAAGTACGC CACGTACCGC CAGAAGATCA AAGAAGAACT CTGCACAGCA
GAAGGAGTGT AA
 
Protein sequence
MPGENAAPVP GRAKQVNENQ CNMCMPIGGV VAFKGIKNAM VLVHGSQGCS TYMRLANVEH 
FNEPFDIASS SLNEKQTIHG GEANLKKAMD NVIRVYQPNV LGILTTCLAE TMGEDLDRIV
ASYIEEKKIA GIDIIPVPTP SYSGTHAEGF WAAARGIVAY YARPASSHRR INVIIPNISP
ADIREIKRIL GLMGLEYTLL PDFSMTLDRP FGGKYQKIPT GGTPAERIAE MPGAPVTIQF
GTTCPDSLSP GRYLEQEYNV PLVNLPLPIG LGNVDLFVET LANVSGNPVP EELALERGWL
IDGMADSHKF NADGRPVVYG EPELVYAFSM ACAENGAMPV VISTGSKNSL LKERLAPLVL
NADEPPVVLE EADFAAVAAA ALGAEANIAI GHSGGKLLTE RYGIPLVRAG YPIHDRIGGQ
RILSAGYRGT LAFLDRFTNT LLEHKYATYR QKIKEELCTA EGV