Gene Msil_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2051 
Symbol 
ID7094249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2225177 
End bp2226124 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content68% 
IMG OID643465375 
Productchlorophyll synthesis pathway, BchC 
Protein accessionYP_002362353 
Protein GI217978206 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0614897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACTC TCGCGGTCAT TCTCGAAGAG CCGGAACATC TCGTCCTCGG GCGGCTCGAT 
ATCGCCGAGC CTGGCGAGGA GGATGTCGTC GTCGACATTG AATGGAGCGG GATCAGCACC
GGCACCGAAC GGCTGCTCTA CACCGGCCGC ATGCCTGAAT TTCCCGGCAT GGGCTACCCT
CTCGTGCCCG GATATGAATC CGTCGGGCGC GTCGTCGCGG CGGGCCCTCG CTCGGGCGCC
ACGGCCGGAG CCCGCGTCTT CGTGCCGGGC GCGCGCTGCT TCGGGTCTGT GCGCGGCCTG
TTCGGCGGCG CAGCCGCGCG GGTGGTTCTC CCGGGCAAGC GCGCGACGCC GATCGGAGAG
GCGCTCGGCG AGCGCGGCGT GCTGCTCGCT CTGGCGGCGA CCGCCTATCA CGCCACGGCG
TCTGGCGACG GCGCCGAACA GCCGGACCTC ATCATCGGAC ATGGCGCGCT GGGGCGTATC
ATGGCTCGTC TTGCGCTCGC CGCGGGCGCC ATGCCGCCGC CGACCGTGTA CGAAACCAAC
CCTGCCCGGC GCGACGGAGC GTGCGGTTAC AGCGTGCTCG ATCCGGCCGA TGACGATCGT
CGCGACTATC AATGCATCTG CGACGTTAGC GGAGATCCCG CGATTCTGGA CAGCCTGATC
GCGAGGCTCG CCCCCGGCGG CGAGATCATT CTCGCGGGCT TTTATGAGGC TCCGCTATCA
TTCGCCTTTC CGCCCGCCTT CATGCGGGAG GCGCGCATCC GGGTCGCCGC GCAATGGCTG
CCGGCCGATC TTTGCGCGGT CCGCTCTCTG GCTGAATCCG GCGCGCTCGA TCTTGGCGGC
CTCATCACCC ATCGCCGTGC CCCCGACAAT GCGGGTGAAG CCTACCGGAC GGCTTTCGGC
GATCCCTCCT GCCTCAAAAT GGTCCTGGAC TGGAGACAAC ATTCATGA
 
Protein sequence
MDTLAVILEE PEHLVLGRLD IAEPGEEDVV VDIEWSGIST GTERLLYTGR MPEFPGMGYP 
LVPGYESVGR VVAAGPRSGA TAGARVFVPG ARCFGSVRGL FGGAAARVVL PGKRATPIGE
ALGERGVLLA LAATAYHATA SGDGAEQPDL IIGHGALGRI MARLALAAGA MPPPTVYETN
PARRDGACGY SVLDPADDDR RDYQCICDVS GDPAILDSLI ARLAPGGEII LAGFYEAPLS
FAFPPAFMRE ARIRVAAQWL PADLCAVRSL AESGALDLGG LITHRRAPDN AGEAYRTAFG
DPSCLKMVLD WRQHS