Gene Mboo_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2009 
Symbol 
ID5411890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2078472 
End bp2080022 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content55% 
IMG OID640869251 
Productnitrogenase 
Protein accessionYP_001405166 
Protein GI154151548 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0918882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.238149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA ACCAGGCACC TCATCTTGTC AATGAACAGA TCAATCTTGC TGATGCAACC 
TGCCCGAACC GCGAGGAGCG GGCACACGGG ATCAATGTGT ACTATGGAAA AGCATCTGAA
CTCCTGCGTG ATGCCAGGAG TGGAAACCTC AAACAGGTTG ACCGCAAATT CCAGCAGACC
TCAGGCTGCA CACTGAACTT CTACCTGACA GTCCGGGTCA ACACCATCCG GGATGCTGCC
ATTGTTTACC ATGCGCCGGT CGGTTGCTCT TGCCCGTCTC TCGGGTACCG TGAGCTGTTC
CAGCACATCC CGACAAGCAT GGGCATGCCC GAGAATTACG ATCTTCACTG GCTGACCACC
AGCATGAACG AGAAGGATAT GGTCTATGGG TCTACGGACA AGCTCAAGGC TGCAATTCTT
GAGGCACAGC GCCGTTACGA TCCCAAGGCG ATCTTCGTCC TGACTTCCTG TGCATCGGGA
ATCATCGGTG AGGATATAGA AGGGGCAGTT AACGAGGTCC AGCCAAAGGT ACGTGGACGG
ATCGTTCCCA TCCATTGCGA GGGTATACGA TCGCGGCTGG TGCAGACCGG GTACGATGCG
TTCTGGCATG CCGTCCTGAA GTACCTTGTG AAAAAACCGC AGAAGAAACA GAAAGACCTG
GTCAATGTTG CAAGCATGCT TTCGTATACC TGGCAGGACA GGCTTGAGAT CAAGAGACTC
CTTGGGAAGA TGGGGCTGCG GGTAAACTAT GTGCCGGAGT TCGCGTCCGT GGAACAGTTC
GAGCAGCTCT CGGAAGCCGC GGTGACGGCA CCATTGTGCC CTACCTACAC CGATTACCTC
TCCCGGGGAC TCGAACAGGA ATACGGTGTG CCGTTCTTCA TGTACCCGTC ACCGATGGGA
TTTTCCGGTA CCGACGGCTG GCTTCGGGAG ATAGGAAAAT ACACGGGTAA GGAGAAAGAG
GCCGAAGTGG TCATTGCAGA AGAGCACAGG AAATGGGACC CGAAACTCGC GGCGATCCAG
GAAGAGTTCC TTCATATCAA GCCAAACGGG GAGAAAGTGG AAGTGCTCGG CGCACTCGGC
CAGGGGCGGC TGCTTGCACA GGTGCCGTAC TTCGATGAAC TCGGGGTCAA ATCATCAGCC
GCGATGTGCC AGGACTATGA TAACCTCATC ATTGACGAGT TGGAAAAAGT GATCGCGCAG
GTCGGAGACT TCGATATCCT GGTCAACACG TTCCAGGCTG CCGAGCAGAC CCATATCAAC
CGGATTCTCG ACCCGGACAT GACGCTTACG TGCCCGTTCC AAGGAAGCGC CTACAAGCGC
CTGAAAGGCG TTACCCGTGT ACACGCACTC CGGGGTGACC CGAACCTCTG GGCCCAGCAG
AGCGCCTATG CCGGTGCTGT TGCGTACGGG AATTTCCTGC TCCAGGCATT CAAGAGCAGA
TCCCTCCAAC AGACCATGAA AGAGAAGACT GCGGACAACT ACAAGGCCTG GTACTTTGAA
CAGGACAATC CGCTCTACTT CAGGGACAAC GACGAACCGG TGGTCTCGTG A
 
Protein sequence
MTKNQAPHLV NEQINLADAT CPNREERAHG INVYYGKASE LLRDARSGNL KQVDRKFQQT 
SGCTLNFYLT VRVNTIRDAA IVYHAPVGCS CPSLGYRELF QHIPTSMGMP ENYDLHWLTT
SMNEKDMVYG STDKLKAAIL EAQRRYDPKA IFVLTSCASG IIGEDIEGAV NEVQPKVRGR
IVPIHCEGIR SRLVQTGYDA FWHAVLKYLV KKPQKKQKDL VNVASMLSYT WQDRLEIKRL
LGKMGLRVNY VPEFASVEQF EQLSEAAVTA PLCPTYTDYL SRGLEQEYGV PFFMYPSPMG
FSGTDGWLRE IGKYTGKEKE AEVVIAEEHR KWDPKLAAIQ EEFLHIKPNG EKVEVLGALG
QGRLLAQVPY FDELGVKSSA AMCQDYDNLI IDELEKVIAQ VGDFDILVNT FQAAEQTHIN
RILDPDMTLT CPFQGSAYKR LKGVTRVHAL RGDPNLWAQQ SAYAGAVAYG NFLLQAFKSR
SLQQTMKEKT ADNYKAWYFE QDNPLYFRDN DEPVVS