Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0233 |
Symbol | nifE |
ID | 3102543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 231399 |
End bp | 233081 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637169455 |
Product | nitrogenase iron-molybdenum cofactor biosynthesis protein nifE |
Protein accession | YP_112768 |
Protein GI | 53802570 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0640534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCT TATCGAGCAA GATTCAGGAC GTCTTCAACG AGCCGGGTTG CGGCAAGAAC CAGGGCAAGT CCGAGAAGGA ACGCAAGAAG GGCTGCACCA AGCAGTTACA GCCCGGCGGG GCCGCCGGCG GCTGCGCCTT CGACGGCGCC AAGATCGCCC TGCAGCCGAT CACCGACGTG GCCCATCTGG TCCACGGCCC CATCGCTTGC GAAGGCAATT CCTGGGACAA CCGTGGCTCC AAGTCCTCCG GCTCCAACCT CTGGCGCACC GGTTTCACCA CCGACATCAA TGAGACCGAC GTGGTGTTCG GCGGCGAGAA GCGGCTGTAC AAGTCCATCA AGGAGATCGT CGAGAAATAC GACCCGCCCG CGGTCTTCGT CTACCAGACC TGCGTGCCGG CCATGATCGG CGACGACATC GAAGCCGTGT GCAAAGCGGC CTCGAAGAAA TTCGGCAAGC CGGTGATCCC GGTCAACTCC CCCGGTTTCG TCGGTCCGAA AAACCTGGGC AACAAGCTGG CGGGCGAAGC CATCCTCGAC CATGTCATCG GTACCGAGGA GCCTTCCTAC ACCACGCCTT ACGACATCAA CATCATCGGC GAATACAACC TGTCCGGCGA GCTATGGCAG GTGAAGCCGC TGCTGGACGA GCTCGGCATC CGCATCCTGT GCTGCATCTC CGGCGATGCC AAATACCGCG ACGTGGCCTG TTCGCACCGG GCCCGGGCGG CGATGATGGT GTGTTCCAAG TCCATGATCA ACATCGCCCG CAAGATGGAG GAGCGCTATG GCATCCCCTT CTTCGAGGGT TCTTTCTACG GCATCGGCGA CACCAGCGAC GCCTTGCGCG AGATTGCGCG GCTGCTGATC GAGCGTGGCG CGCCGGCGGA GCTGATGGAG CGCACCGAAG CCCTGATAGC CCGTGAGGAA GCCCGGGCCT GGGCCGCGAT CGAACCGTAC AAGAAGCGCC TGACCGGCAA GAAGGTGCTG CTCATCACCG GCGGCGTGAA ATCCTGGTCC GTGGTGGCCG CGTTGCAGGA AAGCGGCATG GAGGTGGTCG GCACCAGCGT GAAGAAATCG ACCAAGGAAG ACAAGGAGCG CATCAAGGAG ATCATGGGCG AGGACGCCCA CATGATCGAC GACATGACGC CGCGCGAAAT GTACAAGATG CTCAAGGAGG CCAAGGCCGA CATCATGCTG TCCGGCGGGC GTTCCCAGTT CGTCGCGCTC AAGGCCAAGA TGCCCTGGCT GGACATCAAC CAGGAGCGCC ATCACGCCTA CATGGGCTAT GTGGGCATGG TGGAACTGGT CAAGGAAATC GACAAGGCGC TGTTCAATCC AGTCTGGGAG CAGGTGAGGA AGCGCGCACC CTGGGAGGAA ACCACCTGGG AAGAGCGGGC CGACGCGGCT CTCGCCGCCG AAGCCGCCGC GTTGGCCGCC GATCCGGAGC TGGCCAGGGC GCAGCGCCGC GCCGCCCGCA TCTGCAAGTG CAAGGCGGTG GACCGCGGTG CCATCGAGGA CGCCATTCTC GCGTACGGGC TGGAAAGCGT CGAAGCGGTG ACCGAGCGTA CCCACGCGGG CAGCGGCTGC ACCGGTTGCA CCGGAACGAT CGCCGGCATC CTCGACGGCA TCGAAGACTG GCGGCCGGCT CCCTCGGCTG AACCGGCAAA ACGGGCAGCC TGA
|
Protein sequence | MSSLSSKIQD VFNEPGCGKN QGKSEKERKK GCTKQLQPGG AAGGCAFDGA KIALQPITDV AHLVHGPIAC EGNSWDNRGS KSSGSNLWRT GFTTDINETD VVFGGEKRLY KSIKEIVEKY DPPAVFVYQT CVPAMIGDDI EAVCKAASKK FGKPVIPVNS PGFVGPKNLG NKLAGEAILD HVIGTEEPSY TTPYDINIIG EYNLSGELWQ VKPLLDELGI RILCCISGDA KYRDVACSHR ARAAMMVCSK SMINIARKME ERYGIPFFEG SFYGIGDTSD ALREIARLLI ERGAPAELME RTEALIAREE ARAWAAIEPY KKRLTGKKVL LITGGVKSWS VVAALQESGM EVVGTSVKKS TKEDKERIKE IMGEDAHMID DMTPREMYKM LKEAKADIML SGGRSQFVAL KAKMPWLDIN QERHHAYMGY VGMVELVKEI DKALFNPVWE QVRKRAPWEE TTWEERADAA LAAEAAALAA DPELARAQRR AARICKCKAV DRGAIEDAIL AYGLESVEAV TERTHAGSGC TGCTGTIAGI LDGIEDWRPA PSAEPAKRAA
|
| |