Gene Mboo_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1998 
Symbol 
ID5410422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2064186 
End bp2065724 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content52% 
IMG OID640869240 
Productnitrogenase 
Protein accessionYP_001405155 
Protein GI154151537 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCAA AAAATGACGT CCACATCTGC GATAAAAAAC TAAATCTTGA AGAGAAGACC 
TGCCCCAACC GGGAGGATCG GGCCAACGGT ATCAACGTGT ACTATGGCCC GGTTACCCAG
TTAATTAAAG ATGCAAAAGC CGGCAATCTC AAATTGAGCG AGCGAAAATT CCAGCAATCA
GCAGGGTGTG CCCTCAATTT TTACCTTGCC ATCCGTATCA GCACAATACG GGATGCCGTA
CTCATTTTCC ATGCGCCGGT AGGTTGCTCG GCGTCAGCTC TTGGGTACAG GGAACTCTTC
CGCAATATAC CCGCAGCAGA TGGCAGGCCT CCCTTTGACC TCCACTGGCT GACAACGGGA
ATTACCGAGA AGGACATTGT CTACGGTGCC GGTGACAAAC TCAAAAAAGC CATACGAGTG
GCCGAAGAGC GGTATAAGCC CCGGGCAATC TTTGTGCTCA CTTCCTGTGC ATCCGGTATT
ATTGGAGAAG ACATTGAAGG CGCTGTCAGC GCTATGCAAC CTCACATCAA GGCAAAGATT
GTACCGGTCC ACTGCGAAGG CATCCGGTCG AGACTCGTAC AAACCGGTTA CGATGCATTC
TGGCACGGCG TATTAAAGTA CCTGGTCAAA AAGCCTGAAA AGAAACAGGA TGACTTAATC
AATATCGCAA GCATGCTCTC CTACACCTGG CAGGATCGCA GGGAGATGAC CGGCATTTTA
AAAAGAATGG GACTGCGGCC GAATTTTGTC CCGGAGTTTG CAAGCGTGCA GCAACTCGAA
GATCTCTCCG AGGCGGCGGT AACAGCTCCC CTCTGTGCAT CCTATACGGA CTATATCTCC
CGGGGCCTGG AGCAGGAATA CGGCGTACCC TATTTCATGT ACCCGTCTCC GGTAGGTTTT
CAGAATACGG ACGAATGGCT CCGTAAGATC GCGGAGTACA CCGGAAAAGA ACGGGAGGCC
GAGACGGTCA TCGAAGAGGA ACACAGGAAA TGGGGTCCCC GCATGGATGC TATCCGAAAA
GAACTCCAAA ACTTCAAGGG CAACGGCAAG AAGATTGAGA TGCTTGGTGC CCTTGGCCAG
GGCCGGCTCT TATCCCAACT GCCGTACTTC GATGAACTGG GCATTAAGTC ATCAGCAGCC
CTGTCCCAGG ATTATGACAA CCTGATCCTG GAAGACCTTG AAAAAGTCGT TGACCAGGTC
GGGGATTTCA ACATCCTGGT CAATACCTTC CAGGCTGCAG AACAGGCTCA TATCACAAGA
ATGCTCAACC CGGATATTAC GCTGACCTGC CCGTTCCAGG GAAGTGCCTA CAAGCGGAAC
AAAGGCGCTA CCCGTACCCA TGCAGTGCGG TGCGATAATC TCAAGTGGAG CCAGCAGACC
GCATATGCGG GAGCCGTGGC CTATGGCAGC TTCCTGCTCC AGGGGATGAA AAGTATCTCT
TGGCAAAAAA CTATGCTCGA AAAGACCCAG TACGGGTACA AGGACTGGTA CTTCAGGCAG
CCCAACCCGC TTTACTACCT CGAAAAGGAC GCTGTCTAA
 
Protein sequence
MQSKNDVHIC DKKLNLEEKT CPNREDRANG INVYYGPVTQ LIKDAKAGNL KLSERKFQQS 
AGCALNFYLA IRISTIRDAV LIFHAPVGCS ASALGYRELF RNIPAADGRP PFDLHWLTTG
ITEKDIVYGA GDKLKKAIRV AEERYKPRAI FVLTSCASGI IGEDIEGAVS AMQPHIKAKI
VPVHCEGIRS RLVQTGYDAF WHGVLKYLVK KPEKKQDDLI NIASMLSYTW QDRREMTGIL
KRMGLRPNFV PEFASVQQLE DLSEAAVTAP LCASYTDYIS RGLEQEYGVP YFMYPSPVGF
QNTDEWLRKI AEYTGKEREA ETVIEEEHRK WGPRMDAIRK ELQNFKGNGK KIEMLGALGQ
GRLLSQLPYF DELGIKSSAA LSQDYDNLIL EDLEKVVDQV GDFNILVNTF QAAEQAHITR
MLNPDITLTC PFQGSAYKRN KGATRTHAVR CDNLKWSQQT AYAGAVAYGS FLLQGMKSIS
WQKTMLEKTQ YGYKDWYFRQ PNPLYYLEKD AV