Gene Msil_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2041 
Symbol 
ID7094239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2214156 
End bp2215682 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content64% 
IMG OID643465365 
Productlight-independent protochlorophyllide reductase subunit B 
Protein accessionYP_002362343 
Protein GI217978196 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.133297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTGA CGCTCTGGAC CTATGAAGGT CCTCCTCATG TCGGCGCCAT GCGCGTCGCC 
GCCGCGATGA AGGATGTGCA TTACGTGCTG CACGCGCCGC AGGGCGACAC CTATGCCGAT
CTGCTGTTCA CCATGATCGA GCGGCGCGAG AGCCGCCCGC CGGTGACCTA CACCACATTC
GAGGCGCGCG ACCTTGGCGG CGATACCGCC GCGCTGTTCC AGAGAACGGC GCAAGAGGCC
TATGAGCGTT TCAAGCCGAA GGCGCTGCTT GTCGGCTCCT CATGCACGGC CGAACTCATC
CAGGATGATC CGGGCGGGCT CGCCCGCGGG CTCGGCCTGC CGGTTCCGGT CATCCCGCTT
GAGTTTTCCG CCTATCAGCG CAAGGAAAAC TTCGGCGCTT CGCTCACCTT CTACAATCTC
GTGCGCGCCT TCGCCAAGGC GCTCGCCACG CCGCGCGCGA CGCGCTCGGC AGCAAGGCCA
AGCTGCAATC TGCTCGGCGC AACCGCGCTC GGATTCAGGC ATCGCGACGA TATCCGTGAA
GTGACCTTGC TGCTCGACCG GCTCGGCGTC GGGGTCAATA TCTGCGCGCC GCTTGGCGCT
TCGCCCGACG ATCTCGCGCG CCTGCCGGAG GCCGATTTCA ACATCGTGCT TTATCCCGAG
GTTGCGCTCG AGGCGGCTGA ATGGCTGAAG CGGACGCATC GCCAGCCTCT TGTCAAGACG
CAGCCGATCG GAGTTGGCGC GACCCACGCC TTTATCGAGG AGGTGGCGCG CCTTGCTGGT
CTCGATCCGC GCCCGCTTCT GGGACCAGCC GAGTCGCGCC TCGAATGGTA TTCGCGCTCT
GTCGATTCGA CTTATCTGAC CGGCAAGCGC GTGTTTATTT TCGGAGACGC CACCCATGCT
ATCGCGGCGG CGCGCGTCGC CTCGCGGGAG CTTGGATTTA CGGTCGCAGG GCTGGGTTCC
TACAGCCGCG AATTTGCCCG CGAGATGCGC GCGGCGGCGG CGCTCTATTC GCTCAACGCG
CTGATCACCG ACGACTATCT CGAAGTCGAG GCAGAGATCG AGAGACTGCA GCCGGAGCTG
GTGCTCGGCA CGCAGATGGA GCGCCATATC GCCAAGCGTC TTGGCATCCC CTGCGCGGTG
ATCTCCGCGC CGGTGCATGT GCAGGATTTT CCCGCGCGCC ATTCGCCGCA AATGGTCTTT
GAGGGCGCCA ATGTGATCTT CGACAGCTGG GTCCATCCTT TGATGATGGG CCTTGAGGAA
CATCTGTTGA CGATGTTTCG CGGAGACGAG GAATTTCACG ATGAGGCTGC GCCATCCCAT
CTCGGCGTTG CGGTTGCGAC TGCCCAGGCG ACTCACACGC CCGCAATTTT GGCGTGGGAC
GCGGGCGCCG AACGCGAGCT GAAATCCATT CCCTTTTTTG TGCGCGGAAA GGCCCGGCGC
AACACCGAGC GCTACGCGCA GGAGCGCGGT CTCCCGCTCA TCACCATCGA GACGCTTTAT
GATGCAAAAG CGCACTACAG CCGCTGA
 
Protein sequence
MQLTLWTYEG PPHVGAMRVA AAMKDVHYVL HAPQGDTYAD LLFTMIERRE SRPPVTYTTF 
EARDLGGDTA ALFQRTAQEA YERFKPKALL VGSSCTAELI QDDPGGLARG LGLPVPVIPL
EFSAYQRKEN FGASLTFYNL VRAFAKALAT PRATRSAARP SCNLLGATAL GFRHRDDIRE
VTLLLDRLGV GVNICAPLGA SPDDLARLPE ADFNIVLYPE VALEAAEWLK RTHRQPLVKT
QPIGVGATHA FIEEVARLAG LDPRPLLGPA ESRLEWYSRS VDSTYLTGKR VFIFGDATHA
IAAARVASRE LGFTVAGLGS YSREFAREMR AAAALYSLNA LITDDYLEVE AEIERLQPEL
VLGTQMERHI AKRLGIPCAV ISAPVHVQDF PARHSPQMVF EGANVIFDSW VHPLMMGLEE
HLLTMFRGDE EFHDEAAPSH LGVAVATAQA THTPAILAWD AGAERELKSI PFFVRGKARR
NTERYAQERG LPLITIETLY DAKAHYSR