Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0555 |
Symbol | |
ID | 7271971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 547235 |
End bp | 548605 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643569202 |
Product | Nitrogenase |
Protein accession | YP_002465651 |
Protein GI | 219851219 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.376218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA CACGGGTACG ACAGGTGAAC GAGAACCAGT GCCAGATGTG TATGCCCCTC GGGGGCGTCG TCGCCTTCAA GGGGATCGAA GGGGCGATGG TGCTGGTGCA CGGTTCCCAG GGATGCAGCA CCTACATGCG GCTCGCAAAT GTCGAACATT ACAACGAACC GATCGATGTG GCGTCATCAG CTCTCAACGA GAAACAGACC ATCTACGGCG GGGAGAAGAA CCTGAAGAAG GCACTGGACA ACGTGATCAG GGTTTATGAG CCGAAGGTGC TCGGGATCGT CACCTCCTGC CTCGCAGAGA CGATGGGAGA GGACCTCACG CGGATGATCG AGTCCTACAC CAGGGAACGG AGTACCGAGG GGATAGACAT CATCCCGGTG GCCACACCGA GTTATGCGGG GAGCCACACC GAGGGATTCT GGGCGGCGAC AAGAGACCTC ATCGCCTACT TTGCCAGACC GACCGAACCG CACCAGCGGA TCAATGTGAT CATCCCCCAT ATCAGCCCGG CGGATATTCG TGAGATCAAG CGGATCTTCG ATCTGATGGG GCTTGAGTAC ATGCTGATCC CCGACTACTC CATGACCCTG GACCGCCCCT TTGGGGGACG GTACCAGAAG ATCCCGCCAG GCGGCACCAG CACCGCCGAC ATCGCAGCGA TGCCCGGGGC ACGGGCTACC GTCCAGTTCG GGCTGACCTG CCCGGACGAC CTGTCGCCGG GGCTGTACCT GCAGAAGCAG TTCGGCGTCC CGCTGATCAC CCTGCCGTTA CCGATTGGCC TCCAGAACAC CGACCGGCTG ATGGAGACCC TGCAGAGACT GAGCGGCCGG CCGCTGCCCG AAACCCTGGC CCTGGAGCGG GGATGGCTCC TCGATGGGAT GGCGGACTCC CACAAGTACA ATGCAGAAGG ACGCCCGGTC ATCTATGGTG AGCCTGAACT GGTCAACGCC TGTGTCAGCC TTTGCCTGGA GAACGGAGCC ATTCCAGCAG TCATCGCCAG CGGAACCAGG AACAGCCGGC TGGAGGAGGT GCTCACACCC CAACTGGCAG ATGCTGATGA AGCGCCGGTG CTCCTTGAGG AGGCCGACAT CGCCGCCATT TCAGAGGCAG CCTGCACGAC GAAGGCAAAT ATCGCCATCG GCCATTCAGG GGGACGGTCC CTGACCGAAC GACAGGGGAT CCCCATCGTC AGGGTGGGAT TTCCCATACA TGACCGGGTT GGAGGTCAGC GGCTTCTCTC CGCTGGATAT GCGGGGACAC TGGCATTCCT CGACCGGTTC ACCAACACGC TGCTGGAGGC AAAGTACAGT TCCTATCGGC AGCAGCGAAA AGACGAGATG ATCACCAGAG GAGGTATCTG A
|
Protein sequence | MSETRVRQVN ENQCQMCMPL GGVVAFKGIE GAMVLVHGSQ GCSTYMRLAN VEHYNEPIDV ASSALNEKQT IYGGEKNLKK ALDNVIRVYE PKVLGIVTSC LAETMGEDLT RMIESYTRER STEGIDIIPV ATPSYAGSHT EGFWAATRDL IAYFARPTEP HQRINVIIPH ISPADIREIK RIFDLMGLEY MLIPDYSMTL DRPFGGRYQK IPPGGTSTAD IAAMPGARAT VQFGLTCPDD LSPGLYLQKQ FGVPLITLPL PIGLQNTDRL METLQRLSGR PLPETLALER GWLLDGMADS HKYNAEGRPV IYGEPELVNA CVSLCLENGA IPAVIASGTR NSRLEEVLTP QLADADEAPV LLEEADIAAI SEAACTTKAN IAIGHSGGRS LTERQGIPIV RVGFPIHDRV GGQRLLSAGY AGTLAFLDRF TNTLLEAKYS SYRQQRKDEM ITRGGI
|
| |