Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1148 |
Symbol | |
ID | 5411595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1150862 |
End bp | 1152253 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640868374 |
Product | nitrogenase |
Protein accession | YP_001404309 |
Protein GI | 154150691 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.134067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.680511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGTG AAAATGCTGC CCCGGTACCG GGCCGGGCAA AACAGGTCAA TGAGAACCAG TGTAACATGT GCATGCCCAT CGGCGGTGTC GTGGCATTCA AGGGGATCAA GAATGCGATG GTGCTCGTCC ATGGCTCGCA GGGGTGCAGC ACGTACATGC GCCTGGCCAA TGTCGAGCAC TTCAATGAGC CATTTGACAT CGCGTCCTCG TCCTTGAACG AGAAACAGAC GATCCACGGA GGCGAGGCAA ACCTCAAAAA AGCCATGGAC AATGTCATCA GGGTGTACCA GCCCAACGTC CTTGGAATCT TAACGACCTG CCTTGCCGAG ACCATGGGCG AAGACCTCGA CCGGATCGTC GCCTCCTACA TTGAAGAAAA AAAGATCGCC GGGATCGATA TCATCCCGGT CCCGACGCCC AGCTACAGCG GTACGCATGC CGAGGGGTTC TGGGCGGCCG CCCGGGGGAT TGTTGCCTAC TATGCACGTC CCGCATCATC CCACCGGCGC ATCAACGTGA TAATCCCGAA CATCAGCCCG GCCGACATCC GCGAGATCAA ACGTATCCTC GGACTCATGG GCCTTGAGTA CACGCTGCTC CCGGACTTCT CCATGACGCT TGACCGCCCG TTTGGCGGGA AATACCAGAA GATCCCGACC GGGGGAACCC CGGCAGAACG GATAGCGGAG ATGCCGGGAG CCCCGGTAAC AATCCAGTTT GGGACAACCT GCCCTGACAG CCTCTCACCC GGCCGGTATC TCGAACAGGA GTACAATGTC CCGCTCGTGA ACCTCCCCCT CCCGATCGGC CTTGGGAATG TGGATCTCTT TGTTGAGACC CTTGCGAACG TCAGCGGGAA TCCCGTTCCC GAAGAACTCG CGCTCGAACG GGGCTGGCTC ATTGACGGCA TGGCGGATTC CCACAAGTTC AATGCCGACG GGAGGCCGGT GGTCTATGGC GAACCGGAGC TCGTGTACGC GTTTTCGATG GCCTGCGCAG AGAACGGGGC CATGCCGGTG GTCATCTCGA CCGGCTCAAA AAACAGCCTC CTTAAAGAAC GGCTCGCACC GCTCGTCCTC AATGCGGACG AACCTCCGGT CGTTCTTGAG GAAGCAGACT TTGCGGCCGT TGCCGCTGCG GCGCTCGGGG CTGAGGCAAA TATAGCGATT GGGCACTCCG GGGGAAAACT GCTTACCGAG CGGTACGGCA TTCCCCTTGT CCGGGCCGGT TACCCGATCC ATGACCGGAT CGGCGGCCAG AGGATCCTGT CGGCCGGCTA CCGGGGAACG CTCGCGTTTC TCGACCGGTT CACCAACACG CTGTTAGAAC ACAAGTACGC CACGTACCGC CAGAAGATCA AAGAAGAACT CTGCACAGCA GAAGGAGTGT AA
|
Protein sequence | MPGENAAPVP GRAKQVNENQ CNMCMPIGGV VAFKGIKNAM VLVHGSQGCS TYMRLANVEH FNEPFDIASS SLNEKQTIHG GEANLKKAMD NVIRVYQPNV LGILTTCLAE TMGEDLDRIV ASYIEEKKIA GIDIIPVPTP SYSGTHAEGF WAAARGIVAY YARPASSHRR INVIIPNISP ADIREIKRIL GLMGLEYTLL PDFSMTLDRP FGGKYQKIPT GGTPAERIAE MPGAPVTIQF GTTCPDSLSP GRYLEQEYNV PLVNLPLPIG LGNVDLFVET LANVSGNPVP EELALERGWL IDGMADSHKF NADGRPVVYG EPELVYAFSM ACAENGAMPV VISTGSKNSL LKERLAPLVL NADEPPVVLE EADFAAVAAA ALGAEANIAI GHSGGKLLTE RYGIPLVRAG YPIHDRIGGQ RILSAGYRGT LAFLDRFTNT LLEHKYATYR QKIKEELCTA EGV
|
| |