Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2009 |
Symbol | |
ID | 5411890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 2078472 |
End bp | 2080022 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640869251 |
Product | nitrogenase |
Protein accession | YP_001405166 |
Protein GI | 154151548 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0918882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.238149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAAAA ACCAGGCACC TCATCTTGTC AATGAACAGA TCAATCTTGC TGATGCAACC TGCCCGAACC GCGAGGAGCG GGCACACGGG ATCAATGTGT ACTATGGAAA AGCATCTGAA CTCCTGCGTG ATGCCAGGAG TGGAAACCTC AAACAGGTTG ACCGCAAATT CCAGCAGACC TCAGGCTGCA CACTGAACTT CTACCTGACA GTCCGGGTCA ACACCATCCG GGATGCTGCC ATTGTTTACC ATGCGCCGGT CGGTTGCTCT TGCCCGTCTC TCGGGTACCG TGAGCTGTTC CAGCACATCC CGACAAGCAT GGGCATGCCC GAGAATTACG ATCTTCACTG GCTGACCACC AGCATGAACG AGAAGGATAT GGTCTATGGG TCTACGGACA AGCTCAAGGC TGCAATTCTT GAGGCACAGC GCCGTTACGA TCCCAAGGCG ATCTTCGTCC TGACTTCCTG TGCATCGGGA ATCATCGGTG AGGATATAGA AGGGGCAGTT AACGAGGTCC AGCCAAAGGT ACGTGGACGG ATCGTTCCCA TCCATTGCGA GGGTATACGA TCGCGGCTGG TGCAGACCGG GTACGATGCG TTCTGGCATG CCGTCCTGAA GTACCTTGTG AAAAAACCGC AGAAGAAACA GAAAGACCTG GTCAATGTTG CAAGCATGCT TTCGTATACC TGGCAGGACA GGCTTGAGAT CAAGAGACTC CTTGGGAAGA TGGGGCTGCG GGTAAACTAT GTGCCGGAGT TCGCGTCCGT GGAACAGTTC GAGCAGCTCT CGGAAGCCGC GGTGACGGCA CCATTGTGCC CTACCTACAC CGATTACCTC TCCCGGGGAC TCGAACAGGA ATACGGTGTG CCGTTCTTCA TGTACCCGTC ACCGATGGGA TTTTCCGGTA CCGACGGCTG GCTTCGGGAG ATAGGAAAAT ACACGGGTAA GGAGAAAGAG GCCGAAGTGG TCATTGCAGA AGAGCACAGG AAATGGGACC CGAAACTCGC GGCGATCCAG GAAGAGTTCC TTCATATCAA GCCAAACGGG GAGAAAGTGG AAGTGCTCGG CGCACTCGGC CAGGGGCGGC TGCTTGCACA GGTGCCGTAC TTCGATGAAC TCGGGGTCAA ATCATCAGCC GCGATGTGCC AGGACTATGA TAACCTCATC ATTGACGAGT TGGAAAAAGT GATCGCGCAG GTCGGAGACT TCGATATCCT GGTCAACACG TTCCAGGCTG CCGAGCAGAC CCATATCAAC CGGATTCTCG ACCCGGACAT GACGCTTACG TGCCCGTTCC AAGGAAGCGC CTACAAGCGC CTGAAAGGCG TTACCCGTGT ACACGCACTC CGGGGTGACC CGAACCTCTG GGCCCAGCAG AGCGCCTATG CCGGTGCTGT TGCGTACGGG AATTTCCTGC TCCAGGCATT CAAGAGCAGA TCCCTCCAAC AGACCATGAA AGAGAAGACT GCGGACAACT ACAAGGCCTG GTACTTTGAA CAGGACAATC CGCTCTACTT CAGGGACAAC GACGAACCGG TGGTCTCGTG A
|
Protein sequence | MTKNQAPHLV NEQINLADAT CPNREERAHG INVYYGKASE LLRDARSGNL KQVDRKFQQT SGCTLNFYLT VRVNTIRDAA IVYHAPVGCS CPSLGYRELF QHIPTSMGMP ENYDLHWLTT SMNEKDMVYG STDKLKAAIL EAQRRYDPKA IFVLTSCASG IIGEDIEGAV NEVQPKVRGR IVPIHCEGIR SRLVQTGYDA FWHAVLKYLV KKPQKKQKDL VNVASMLSYT WQDRLEIKRL LGKMGLRVNY VPEFASVEQF EQLSEAAVTA PLCPTYTDYL SRGLEQEYGV PFFMYPSPMG FSGTDGWLRE IGKYTGKEKE AEVVIAEEHR KWDPKLAAIQ EEFLHIKPNG EKVEVLGALG QGRLLAQVPY FDELGVKSSA AMCQDYDNLI IDELEKVIAQ VGDFDILVNT FQAAEQTHIN RILDPDMTLT CPFQGSAYKR LKGVTRVHAL RGDPNLWAQQ SAYAGAVAYG NFLLQAFKSR SLQQTMKEKT ADNYKAWYFE QDNPLYFRDN DEPVVS
|
| |