Gene MCA0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0233 
SymbolnifE 
ID3102543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp231399 
End bp233081 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content64% 
IMG OID637169455 
Productnitrogenase iron-molybdenum cofactor biosynthesis protein nifE 
Protein accessionYP_112768 
Protein GI53802570 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0640534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCT TATCGAGCAA GATTCAGGAC GTCTTCAACG AGCCGGGTTG CGGCAAGAAC 
CAGGGCAAGT CCGAGAAGGA ACGCAAGAAG GGCTGCACCA AGCAGTTACA GCCCGGCGGG
GCCGCCGGCG GCTGCGCCTT CGACGGCGCC AAGATCGCCC TGCAGCCGAT CACCGACGTG
GCCCATCTGG TCCACGGCCC CATCGCTTGC GAAGGCAATT CCTGGGACAA CCGTGGCTCC
AAGTCCTCCG GCTCCAACCT CTGGCGCACC GGTTTCACCA CCGACATCAA TGAGACCGAC
GTGGTGTTCG GCGGCGAGAA GCGGCTGTAC AAGTCCATCA AGGAGATCGT CGAGAAATAC
GACCCGCCCG CGGTCTTCGT CTACCAGACC TGCGTGCCGG CCATGATCGG CGACGACATC
GAAGCCGTGT GCAAAGCGGC CTCGAAGAAA TTCGGCAAGC CGGTGATCCC GGTCAACTCC
CCCGGTTTCG TCGGTCCGAA AAACCTGGGC AACAAGCTGG CGGGCGAAGC CATCCTCGAC
CATGTCATCG GTACCGAGGA GCCTTCCTAC ACCACGCCTT ACGACATCAA CATCATCGGC
GAATACAACC TGTCCGGCGA GCTATGGCAG GTGAAGCCGC TGCTGGACGA GCTCGGCATC
CGCATCCTGT GCTGCATCTC CGGCGATGCC AAATACCGCG ACGTGGCCTG TTCGCACCGG
GCCCGGGCGG CGATGATGGT GTGTTCCAAG TCCATGATCA ACATCGCCCG CAAGATGGAG
GAGCGCTATG GCATCCCCTT CTTCGAGGGT TCTTTCTACG GCATCGGCGA CACCAGCGAC
GCCTTGCGCG AGATTGCGCG GCTGCTGATC GAGCGTGGCG CGCCGGCGGA GCTGATGGAG
CGCACCGAAG CCCTGATAGC CCGTGAGGAA GCCCGGGCCT GGGCCGCGAT CGAACCGTAC
AAGAAGCGCC TGACCGGCAA GAAGGTGCTG CTCATCACCG GCGGCGTGAA ATCCTGGTCC
GTGGTGGCCG CGTTGCAGGA AAGCGGCATG GAGGTGGTCG GCACCAGCGT GAAGAAATCG
ACCAAGGAAG ACAAGGAGCG CATCAAGGAG ATCATGGGCG AGGACGCCCA CATGATCGAC
GACATGACGC CGCGCGAAAT GTACAAGATG CTCAAGGAGG CCAAGGCCGA CATCATGCTG
TCCGGCGGGC GTTCCCAGTT CGTCGCGCTC AAGGCCAAGA TGCCCTGGCT GGACATCAAC
CAGGAGCGCC ATCACGCCTA CATGGGCTAT GTGGGCATGG TGGAACTGGT CAAGGAAATC
GACAAGGCGC TGTTCAATCC AGTCTGGGAG CAGGTGAGGA AGCGCGCACC CTGGGAGGAA
ACCACCTGGG AAGAGCGGGC CGACGCGGCT CTCGCCGCCG AAGCCGCCGC GTTGGCCGCC
GATCCGGAGC TGGCCAGGGC GCAGCGCCGC GCCGCCCGCA TCTGCAAGTG CAAGGCGGTG
GACCGCGGTG CCATCGAGGA CGCCATTCTC GCGTACGGGC TGGAAAGCGT CGAAGCGGTG
ACCGAGCGTA CCCACGCGGG CAGCGGCTGC ACCGGTTGCA CCGGAACGAT CGCCGGCATC
CTCGACGGCA TCGAAGACTG GCGGCCGGCT CCCTCGGCTG AACCGGCAAA ACGGGCAGCC
TGA
 
Protein sequence
MSSLSSKIQD VFNEPGCGKN QGKSEKERKK GCTKQLQPGG AAGGCAFDGA KIALQPITDV 
AHLVHGPIAC EGNSWDNRGS KSSGSNLWRT GFTTDINETD VVFGGEKRLY KSIKEIVEKY
DPPAVFVYQT CVPAMIGDDI EAVCKAASKK FGKPVIPVNS PGFVGPKNLG NKLAGEAILD
HVIGTEEPSY TTPYDINIIG EYNLSGELWQ VKPLLDELGI RILCCISGDA KYRDVACSHR
ARAAMMVCSK SMINIARKME ERYGIPFFEG SFYGIGDTSD ALREIARLLI ERGAPAELME
RTEALIAREE ARAWAAIEPY KKRLTGKKVL LITGGVKSWS VVAALQESGM EVVGTSVKKS
TKEDKERIKE IMGEDAHMID DMTPREMYKM LKEAKADIML SGGRSQFVAL KAKMPWLDIN
QERHHAYMGY VGMVELVKEI DKALFNPVWE QVRKRAPWEE TTWEERADAA LAAEAAALAA
DPELARAQRR AARICKCKAV DRGAIEDAIL AYGLESVEAV TERTHAGSGC TGCTGTIAGI
LDGIEDWRPA PSAEPAKRAA