Gene MCA0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0204 
Symbol 
ID3103905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp209618 
End bp211186 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content62% 
IMG OID637169427 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_112740 
Protein GI53802628 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.037885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAG TGCAGAACCA TACGGGGGCG GAGTCCGCTG GCCTAAACGC CGCGGTGAAT 
GTCGACGAGA TCATGCAGAA GGTGGCCGAG CACAAAGGCT GCGGCACTTC CGGCGGTTCC
GGAAAAGCCA GCTGCGGTTC CGGCGCGGGC GCAAACGACC TGCCGCCGGA GATTTGGGAG
AAGGTGAAGA ACCACCCCTG CTACAGCGAG GAAGCCCATC ACCACTACGC TCGCATGCAC
GTGGCGGTGG CACCGGCCTG CAACATCCAG TGCAACTACT GTAACCGCAA GTACGACTGC
GCCAACGAAA GCCGGCCGGG GGTGGTGAGC GAGAAGCTCA CTCCGGAACA GGCGGCGAAG
AAGGTACTGG CGGTGGCCTC CACCATTCCG CAGATGACGG TGCTCGGTAT CGCCGGTCCC
GGCGATCCGC TGGCCAACCC GGAAAAGACC TTCAAGACCT TCGAACTGGT CGCCAAGCAT
GCGCCCGACA TCAAGCTCTG CGTCTCCACC AACGGTCTGG CGCTGCCCGA TCACGTGGAG
CGGCTGTCCC AGTACAACAT CGATCACGTG ACCATCACCA TCAACATGAT CGATCCGGAG
GTGGGCGCCA AGATCTATCC GTGGATCTAC TACAAGAAAA AACGCTACAC CGGCGTCGAG
GCCGCCAAGA TCCTCAGCGA TCGCCAGTTG CAGGGCCTGG AGATGCTGAC CGAGCGCGGC
ATCCTGTCGA AGATCAACTC GGTGATGATC CCCGGCATCA ATGACGAGCA TCTGGTCGAG
GTCAACAAGG CGGTGAAGTC GCGGGGGGCG TTTCTGCACA ACATCATGCC GCTGATCTCG
GCGCCGGAGC ACGGCACTGT ATTCGGTCTG ACCGGTCAGC GCGGCCCGAC GGCGCAGGAG
CTCAAGGCCT TGCAGGATAA GTGCGAAGGC GAAATGAACA TGATGCGCCA TTGCCGTCAG
TGCCGCGCCG ATGCGGTGGG CCTGTTGGGC GAGGACCGTA GCGCGGAGTT CACCACCGAC
AAGATCATGG CGATGGAGGT CAATTACGAT CTCGACGCCC GCAAGGCCTA TCAGGAAGCC
GTGGAAAAGG AACGCCAAGC GGTGGTGGCA GCCAAGCAGG AAGAACTGCA AACCCTGGCC
GGTGCGCATT CCGACATCAA GATGCTGATC GCCGTGGCGA CCAAGGGCGG CGGCAAGGTC
AACGAACACT TCGGCCATGC CAGCGAATTC CAGATCTATG AGCTGTCCAC TGCGGGCGCC
AAGTTCGTCG GACATCGTCG TGTGGATCTG TACTGCCAGG GCGGTTACGG CGAGGAAGAT
GCACTGGGCA CGGTGATCCG GGCCATCAAC GACTGCCACG CAGTGTTCGT GGCCAAGATC
GGCGGCTGTC CGAAGAGCGA CCTCAAGGCG GCGGGTATCG ATCCCGTGGA CCAGTATGCC
GGTCAGTTCA TCGAACAGTC GGCGATCGCC TACTTCAAGG AGTATCTCGA CAAGGTCGCG
TCCGGCGAAA TCGAGCACGT GGCCAAGGGT GATGCGGTGA TCCGCCAGGG CGCTTTGATC
GCGGCTTGA
 
Protein sequence
MQTVQNHTGA ESAGLNAAVN VDEIMQKVAE HKGCGTSGGS GKASCGSGAG ANDLPPEIWE 
KVKNHPCYSE EAHHHYARMH VAVAPACNIQ CNYCNRKYDC ANESRPGVVS EKLTPEQAAK
KVLAVASTIP QMTVLGIAGP GDPLANPEKT FKTFELVAKH APDIKLCVST NGLALPDHVE
RLSQYNIDHV TITINMIDPE VGAKIYPWIY YKKKRYTGVE AAKILSDRQL QGLEMLTERG
ILSKINSVMI PGINDEHLVE VNKAVKSRGA FLHNIMPLIS APEHGTVFGL TGQRGPTAQE
LKALQDKCEG EMNMMRHCRQ CRADAVGLLG EDRSAEFTTD KIMAMEVNYD LDARKAYQEA
VEKERQAVVA AKQEELQTLA GAHSDIKMLI AVATKGGGKV NEHFGHASEF QIYELSTAGA
KFVGHRRVDL YCQGGYGEED ALGTVIRAIN DCHAVFVAKI GGCPKSDLKA AGIDPVDQYA
GQFIEQSAIA YFKEYLDKVA SGEIEHVAKG DAVIRQGALI AA