Gene Msil_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3646 
Symbol 
ID7092919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4004474 
End bp4006033 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content61% 
IMG OID643466934 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_002363893 
Protein GI217979746 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.304877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAG CCTATCAAAA CATTGAACAT GACGCCGGCG ACCCCGAGGC GCTCGACGAT 
CTGATGCAGA AGATCGCCGA CCACAAGGGA TGTGGAACGT CCGGCGGCAG CGGCAAATCG
AGCTGCGGCA CAGGAGCCGG GGCCAATGAC CTCGCCCCCG AAATCTGGGA AAAAGTCAAA
AACCATCCCT GCTACAGCGA AGAAGCCCAC CATCACTACG CGCGCATGCA TGTCGCCGTC
GCGCCGGCCT GTAATATTCA ATGCAACTAT TGCAACCGCA AATATGATTG CGCCAATGAA
TCGCGCCCCG GCGTCGTCAG CGAGAAGCTG ACGCCGGAAC AGGCGGCCAA GAAAGTGCTG
GCCGTCGCCT CGACGATCCC GCAGATGACG GTGCTCGGCA TCGCCGGCCC CGGCGATCCG
CTCGCCAATC CGGAAAAGAC GTTCAAGACC TTCGAACTGA TTTCCCGGAC CGCGCCCGAC
ATCAAGCTCT GCCTGTCGAC CAACGGCCTC GCCCTGCCCG ATCACGTCGA CACCATCGCC
AAATTCAACG TCGATCATGT GACCATCACG ATCAACATGA CCGACCCGGA AATCGGCGCC
AAGATCTACC CGTGGATCTT CTGGAAACAC AAGCGCATCA CCGGCTATGA GGCGGCCAAG
ATCCTGACCG ATCGCCAGCT GCAGGGCCTC GAAATGCTGA CGGAGCGGGG CATCCTCTGC
AAGATCAACT CGGTGATGAT CCCCGGCGTG AACGACAAGC ATCTCGTCGA AGTCAACCGG
GCGGTCAAAT CCCGCGGCGC ATTCCTGCAC AACATCATGC CGCTGATCTC CGCCCCGGAG
CACGGCACGG TATTCGGCCT CACCGGCCAG CGCGGCCCCT CGGCGCAGGA GCTGAAGGCG
CTGCAGGACA GCTGCGAAGG CGAAATGAAC ATGATGCGGC ATTGCCGCCA GTGCCGCGCC
GACGCCGTCG GACTGCTTGG CGAAGATCGC AGCGCCGAGT TCACCACCGA CAAGATCATG
GAGATGGAGG TCAACTACGA CCTTGAATCG CGCAAGGCCT ATCAGAGCAA GGTCGAGGAA
GAGCGCATCG CCAAGGTCGA AGCCAAGAAC GCCGAGCTCG AAACTCTGGC TGGCGCTTCC
AGCGACATCC AGATCCTGAT CGCCGTCGCG ACCAAGGGCA GCGGCCGCGT CAACGAGCAC
TTTGGCCACG CCAAAGAGTT CCAGGTCTAC GAACTCAGCA CCAAGGGCGC GAAATTCGTC
GGCCACCGCC GCGTCGATCT CTATTGCCAG GGCGGTTACG GCGAGGAAGA CGCGCTCGAG
ACGGTCATTC GCGCCATCAA TGATTGCGCG GCTGTCTTCG TCGCCAAGAT CGGCGGCTGC
CCGAAAGATT CGCTGCGTGC GGCCGGCATC GATCCGGTCG ATCAATACGC CTTCGAATTC
ATCGAGCAAT CGGCGATCGC CTACTTCAAG GATTACCTCG CCAAGGTGAA CAGCGGCGAG
ATCGCGCATG TCATCCGCGG CGACGCCGAC ATCCGCCAAG GCGCTTACGT GCCGGTCTGA
 
Protein sequence
MEEAYQNIEH DAGDPEALDD LMQKIADHKG CGTSGGSGKS SCGTGAGAND LAPEIWEKVK 
NHPCYSEEAH HHYARMHVAV APACNIQCNY CNRKYDCANE SRPGVVSEKL TPEQAAKKVL
AVASTIPQMT VLGIAGPGDP LANPEKTFKT FELISRTAPD IKLCLSTNGL ALPDHVDTIA
KFNVDHVTIT INMTDPEIGA KIYPWIFWKH KRITGYEAAK ILTDRQLQGL EMLTERGILC
KINSVMIPGV NDKHLVEVNR AVKSRGAFLH NIMPLISAPE HGTVFGLTGQ RGPSAQELKA
LQDSCEGEMN MMRHCRQCRA DAVGLLGEDR SAEFTTDKIM EMEVNYDLES RKAYQSKVEE
ERIAKVEAKN AELETLAGAS SDIQILIAVA TKGSGRVNEH FGHAKEFQVY ELSTKGAKFV
GHRRVDLYCQ GGYGEEDALE TVIRAINDCA AVFVAKIGGC PKDSLRAAGI DPVDQYAFEF
IEQSAIAYFK DYLAKVNSGE IAHVIRGDAD IRQGAYVPV