Gene Msil_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2054 
Symbol 
ID7094252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2228760 
End bp2230217 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content64% 
IMG OID643465378 
Productchlorophyllide reductase subunit Z 
Protein accessionYP_002362356 
Protein GI217978209 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit
[TIGR02014] chlorophyllide reductase subunit Z 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0126422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTTC TGGATCATGA TCGCGCCGGC GGCTATTGGG GCGCCGTCTA TGTGTTCAGC 
GCCATCAAGG GGCTGCAAGT CATCATCGAC GGTCCCGTCG GCTGCGAAAA CCTGCCGGTT
ACGTCGGTTT TGCATTACAC TGACGCCCTT CCGCCGCATG AATTACCGAT CGTCGTCACG
GGCCTCAGCG AGGATGAGCT CGGTCAGACC GGGACGGAAG GCGCCATGAA GCGCGCCCAT
CGCACGCTTG ATCCGGGTCT GCCCAGCGTC GTCGTCACCG GCTCCATCGC CGAGATGATC
GGCGGCGGCG TCACGCCGGA AGGCGCCAAC ATCCAACGCT TTCTGCCGCG CACGATCGAT
GAAGATCAGT GGCAATGCGC CGACCGCGCC ATGAACTGGC TGTGGACGGA GTATGGTCCG
AAGAAAGTCC CGCAACTAAA ACCGCGCAAA CCCGACCAAA AGCCTCGCGT CAACATCATC
GGGCCGTGCT ATGGCGTGTT CAACAGCCCG AGCGATCTTG CCGAAATCCG CCGGCTGGTC
GACGGCATAG GCGCCGAAAT CAACATGATC TTCCCCCTCG GCAGCCATCT CGCCGATGTG
GCGAGGCTAG CCGACGCCGA CGCCAATATA TGCCTCTACC GCGAATTCGG ACGGCTGCTC
TGCGAGGGGC TTGAGCGGCC CTATCTGCAG GCTCCGATCG GGCTGCATTC CACGACCAGC
TTTCTACGCA CGCTCGGCGG ACTGCTTGAG CTCGACCCTG AACCTTTCAT TGACCGCGAG
AAGCACACGA CGATCAAGCC GCTGTGGGAT TTGTGGCGCT CGGTCACGCA GGATTTCTTC
GCGACCGCGA GCTTCGCAAT CGTCGCCAAT GAAACCTATG CGCGGGGCGT TCGTCATTTT
CTTGAAGAGG AAATGGGACT GCCCTGCGCC TTCTCGATGT GCCGCAGGGC CGGCGTAAAG
CCCGACAACG CGGCGGTGCG CGAGGCCATC GGCAAGAAGG CGCCGCTGAT CGTATTCGGC
AGCTTCAATG AACGCATGTA TCTCGCCGAG ACCGGCGCCC GCGCGATCTA CATCCCGGCG
TCGTTCCCCG GCGCGATCAT CCGTCGCCAT ACCGGCACGC CTTTCATGGG CTATGGCGGG
GCGACCTACA TCGTCCAGGA GGTATGCAAC GCGCTGTTCG ACGCGCTGTT CAATATCATC
CCGCTCGCCG CCGACATGGA CCGAGTCGAG GCGACGCCGG CCCGCCTTGG ACTCGCGGCG
TCGACGCCCT GGGACGAGGC GGCGCATCGC CTTCTCGAAC AATATGTCGA AGCCGAGCCG
GTTCTCGTGC GCATCTCGGC GGCAAAGCGC CTGCGCGACC GCGCCGAACA GGAGGCCCGC
TCGGCCGGAG AAGCAAGCGT AACGGCCGAG CGCGTTAGCC GCGCGCGCGA TCAGATCGCG
CAAGGGAGGG CGGCATGA
 
Protein sequence
MLVLDHDRAG GYWGAVYVFS AIKGLQVIID GPVGCENLPV TSVLHYTDAL PPHELPIVVT 
GLSEDELGQT GTEGAMKRAH RTLDPGLPSV VVTGSIAEMI GGGVTPEGAN IQRFLPRTID
EDQWQCADRA MNWLWTEYGP KKVPQLKPRK PDQKPRVNII GPCYGVFNSP SDLAEIRRLV
DGIGAEINMI FPLGSHLADV ARLADADANI CLYREFGRLL CEGLERPYLQ APIGLHSTTS
FLRTLGGLLE LDPEPFIDRE KHTTIKPLWD LWRSVTQDFF ATASFAIVAN ETYARGVRHF
LEEEMGLPCA FSMCRRAGVK PDNAAVREAI GKKAPLIVFG SFNERMYLAE TGARAIYIPA
SFPGAIIRRH TGTPFMGYGG ATYIVQEVCN ALFDALFNII PLAADMDRVE ATPARLGLAA
STPWDEAAHR LLEQYVEAEP VLVRISAAKR LRDRAEQEAR SAGEASVTAE RVSRARDQIA
QGRAA