Gene M446_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3602 
Symbol 
ID6132912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4016320 
End bp4017816 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content68% 
IMG OID641643769 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_001770417 
Protein GI170741762 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.95628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.038494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCC CGGTGTTCTC CCTCGAGGCC CTCGTCTCGG CACCGATCTC GGTCGACGCG 
ATGCGGGCGA AACTGCAGGA GGGCGGCTGC TCGTCCACGT CCTGCGGCTC CTCGGCCAAG
CCGGACGAGA TGCCCGAGGA CGTCTGGGAC AAGATCAAGG ATCATCCCTG CTACTCGGAA
GAGGCGCATC ACTACTTCGC CCGCATGCAC GTCGCGGTGG CGCCGGCCTG CAACATCCAG
TGCAATTACT GCAATCGCAA GTACGATTGC GCCAACGAGA GCCGGCCCGG CGTGGTGTCG
GAGAAGCTCA CGCCGGACCA GGCCCTGCGC AAGGTGGTGA CGGTGGCCAA CGAGGTGCCG
CAGCTGTCGG TGCTCGGCAT CGCCGGCCCC GGCGACAGCT GCTACGATTG GAAGAAGACC
AAGGCCACCT TCGACGCGGT GTCCCGGGAG ATCCCCGACA TCAAGCTCTG CCTCTCCACC
AACGGGCTCG CCCTGCCCGA CCACGTCGAC GAGATCGCCG AGATGAACGT CGATCACGTC
ACGGTGACCA TCAACATGGT CGATCCGGAG ATCGGGGCGA AGATCTATCC CTGGATCTTC
CACCAGCACA AGCGCTACAC GGGCGTCGAG GCCGCCCGCA TCCTGCACGA GCGGCAGATG
CTGAGCCTGG AGATGCTGCG CGACCGCGGC ATCCTGGTGA AGGTGAACTC GGTCATGATC
CCCGGGATCA ACGACGAGCA CCTGCTCGAC GTGAACGCGT GGGTGAAGGA GCGGGGCGCC
TTCCTGCACA ACGTGATGCC GCTGATCTCG GCGCCCGAGC ACGGCACGCA TTTCGGCCTG
ACCGGCCAGC GCGGCCCGAC CGCCATGGAG CTGAAGGCCC TGCAGGACAA GCTGGAGGGC
GGCGCCAAGC TGATGCGCCA CTGCCGCCAG TGCCGGGCCG ACGCGGTCGG GCTCCTCGGC
GAGGATCGCG GCCAGGAATT CACCCTCGAC CAGCTGCCCG CGCAGATCCG CTACGATCCG
AGCAAGCGCG CCGCCTACCG GGAGGTCGTG GCGCACGAGC GCGGCGACCA CCAGGTCAAC
AAGCAGGCCG TGGTGGCGCA GCTCAGGGCG CTCGGGGCGG AGCAGACGCT CCTCGTCGCG
GTCGCCACCA AGGGCGGCGG CCGGATCAAC GAGCATTTCG GCCACGCCAA GGAGTTCCAG
GTCTACGAGG CGGGCCCGGG CGGCGTGACC TTCGTCGGCC ACCGCAAGGT CGACTCCTAC
TGCCTGGGCG GATTCGGCGA GGACGCGACG CTCGACGGCG TAATCGCCGC GCTGGAGGGG
GTCGAGGTCG TCCTCTGCGC CAAGATCGGC GATTGCCCGA AGGAGAGCCT GGAGGCCGCC
GGCATCCGCG CCACCGACCG CTACGCCCTC GACTACATCG AGGCGGCGAT CGGCGCGGTC
TACGCGGAGC AGTTCGGCCG CCGCGTCACC GACGCGCCGC TGATGCTCTC GGCCTGA
 
Protein sequence
MSSPVFSLEA LVSAPISVDA MRAKLQEGGC SSTSCGSSAK PDEMPEDVWD KIKDHPCYSE 
EAHHYFARMH VAVAPACNIQ CNYCNRKYDC ANESRPGVVS EKLTPDQALR KVVTVANEVP
QLSVLGIAGP GDSCYDWKKT KATFDAVSRE IPDIKLCLST NGLALPDHVD EIAEMNVDHV
TVTINMVDPE IGAKIYPWIF HQHKRYTGVE AARILHERQM LSLEMLRDRG ILVKVNSVMI
PGINDEHLLD VNAWVKERGA FLHNVMPLIS APEHGTHFGL TGQRGPTAME LKALQDKLEG
GAKLMRHCRQ CRADAVGLLG EDRGQEFTLD QLPAQIRYDP SKRAAYREVV AHERGDHQVN
KQAVVAQLRA LGAEQTLLVA VATKGGGRIN EHFGHAKEFQ VYEAGPGGVT FVGHRKVDSY
CLGGFGEDAT LDGVIAALEG VEVVLCAKIG DCPKESLEAA GIRATDRYAL DYIEAAIGAV
YAEQFGRRVT DAPLMLSA