Gene Mboo_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1997 
Symbol 
ID5410421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2062812 
End bp2064170 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content54% 
IMG OID640869239 
Productnitrogenase 
Protein accessionYP_001405154 
Protein GI154151536 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAA CGGGCATTGA AAATGAATCA TCAGGCGACG ATATCCATGA TATCCAGGAG 
GCACCCCGGT ATTCGTGTTC GCTTGCCGGT GCCTATGCAT CGACGCTGGG TGTGTATGGA
GCGGTACCTG TCCTGCATTC CGGAGGGGGC TGCGGGGTCG CCCAGCTCTT CGGCCAGTTC
TACACCTCGG GCGAGAGCGC ACCGGGTGTG CAGGGCGGCA CCGGCACACC CTGTACTTCC
CTGGTGGAGC AGCATGTTAT CTTCGGTGGA GAGGACCGGC TGCGAAAACT CATCAGATCT
ACAACTGAAC TGATGGATGG CAGCCTGTTC GTGGTAATAA CCGGCTGCGT ACCGTCACTT
ATCGGGGACG ATGTAGAATC CGTGGTCAGG GAATTCAAAG AGGAGACGAA GAACCGGATA
CCCATCATTT ACGTGAACGC GCCGGGATTT TCCGGGAATA CGTACAAGGG GTACGAATTA
TTCTTAAATG CCGTGATCGA CCAGTACCTG GTGCCCATAA AAAAGAAGAA GCGCACCATC
AACATTCTCG GGGTCGTACC ATTCCAGCAC GTGTTCTGGA AAGGGGATCT GGAGATACTC
AAAAACACCT TTGCCAAGAT CGGTGTTGAG GTAAACACGA TCTTTACCGA GTTTGACGGC
CCGGAAAAAT TAAAACGGAT CCCGGCTGCC GAGTTAAACC TCGTTCTTAA CCCCTGGCTT
GGCCACAGCA TAGCAGAAAG GCTTGAGGAG AAGTTTGGGA CGCCGTACGT CACCTTCCCG
AGCGTACCTG TTGGTCCCCA GCAGACAACG GCCCTGCTTG GCATCGTGGC AGAAAAACTT
GGCATCCCGG AGAAAAAAGT AAAGGAGGTT GTCGCAGCTG AAGAGCGCAG GGCATACCGT
AACGTGGAGT ATTTCGGAGA TGCCCTGATT ATCGGGATGC CACACTCGTA CACCGCAGTC
GTCGGCGACA GCGCTACGGC AATCGGCATC ACGAAATATA TCGCAAACGA GGTTGGGTAC
CTGCCGGATA TTACCATTAT CACGGATAAT CCTCCGGAAG AAAAGCGCCC GGAGATCCTG
CGCGAGCTTT ATGAGCACAT CGACTGCGTG GTGAAACCCG AGATCTTCTT TGAGTCGGAC
ACGCACAAGA TCCGGCAGCT GTTAAAGGGC AGGAGTTTTT TAGTCCTGCT CGCAAGCTCG
CTTGAAAAGT ACATTACCGA TGAGTTTAAC GATGCGCTGC ACCTGAGTGT CTCGTTCCCG
ATAAACGACC GGATGATTAT CGACCACTCG TATGCCGGGT ACCGTGGCGG CATCGTGCTC
ATGGAAGACA TTATTACAAA ATTCACCGGG CCGCTCTGA
 
Protein sequence
MAETGIENES SGDDIHDIQE APRYSCSLAG AYASTLGVYG AVPVLHSGGG CGVAQLFGQF 
YTSGESAPGV QGGTGTPCTS LVEQHVIFGG EDRLRKLIRS TTELMDGSLF VVITGCVPSL
IGDDVESVVR EFKEETKNRI PIIYVNAPGF SGNTYKGYEL FLNAVIDQYL VPIKKKKRTI
NILGVVPFQH VFWKGDLEIL KNTFAKIGVE VNTIFTEFDG PEKLKRIPAA ELNLVLNPWL
GHSIAERLEE KFGTPYVTFP SVPVGPQQTT ALLGIVAEKL GIPEKKVKEV VAAEERRAYR
NVEYFGDALI IGMPHSYTAV VGDSATAIGI TKYIANEVGY LPDITIITDN PPEEKRPEIL
RELYEHIDCV VKPEIFFESD THKIRQLLKG RSFLVLLASS LEKYITDEFN DALHLSVSFP
INDRMIIDHS YAGYRGGIVL MEDIITKFTG PL