Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2051 |
Symbol | |
ID | 7094249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2225177 |
End bp | 2226124 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643465375 |
Product | chlorophyll synthesis pathway, BchC |
Protein accession | YP_002362353 |
Protein GI | 217978206 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0614897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACTC TCGCGGTCAT TCTCGAAGAG CCGGAACATC TCGTCCTCGG GCGGCTCGAT ATCGCCGAGC CTGGCGAGGA GGATGTCGTC GTCGACATTG AATGGAGCGG GATCAGCACC GGCACCGAAC GGCTGCTCTA CACCGGCCGC ATGCCTGAAT TTCCCGGCAT GGGCTACCCT CTCGTGCCCG GATATGAATC CGTCGGGCGC GTCGTCGCGG CGGGCCCTCG CTCGGGCGCC ACGGCCGGAG CCCGCGTCTT CGTGCCGGGC GCGCGCTGCT TCGGGTCTGT GCGCGGCCTG TTCGGCGGCG CAGCCGCGCG GGTGGTTCTC CCGGGCAAGC GCGCGACGCC GATCGGAGAG GCGCTCGGCG AGCGCGGCGT GCTGCTCGCT CTGGCGGCGA CCGCCTATCA CGCCACGGCG TCTGGCGACG GCGCCGAACA GCCGGACCTC ATCATCGGAC ATGGCGCGCT GGGGCGTATC ATGGCTCGTC TTGCGCTCGC CGCGGGCGCC ATGCCGCCGC CGACCGTGTA CGAAACCAAC CCTGCCCGGC GCGACGGAGC GTGCGGTTAC AGCGTGCTCG ATCCGGCCGA TGACGATCGT CGCGACTATC AATGCATCTG CGACGTTAGC GGAGATCCCG CGATTCTGGA CAGCCTGATC GCGAGGCTCG CCCCCGGCGG CGAGATCATT CTCGCGGGCT TTTATGAGGC TCCGCTATCA TTCGCCTTTC CGCCCGCCTT CATGCGGGAG GCGCGCATCC GGGTCGCCGC GCAATGGCTG CCGGCCGATC TTTGCGCGGT CCGCTCTCTG GCTGAATCCG GCGCGCTCGA TCTTGGCGGC CTCATCACCC ATCGCCGTGC CCCCGACAAT GCGGGTGAAG CCTACCGGAC GGCTTTCGGC GATCCCTCCT GCCTCAAAAT GGTCCTGGAC TGGAGACAAC ATTCATGA
|
Protein sequence | MDTLAVILEE PEHLVLGRLD IAEPGEEDVV VDIEWSGIST GTERLLYTGR MPEFPGMGYP LVPGYESVGR VVAAGPRSGA TAGARVFVPG ARCFGSVRGL FGGAAARVVL PGKRATPIGE ALGERGVLLA LAATAYHATA SGDGAEQPDL IIGHGALGRI MARLALAAGA MPPPTVYETN PARRDGACGY SVLDPADDDR RDYQCICDVS GDPAILDSLI ARLAPGGEII LAGFYEAPLS FAFPPAFMRE ARIRVAAQWL PADLCAVRSL AESGALDLGG LITHRRAPDN AGEAYRTAFG DPSCLKMVLD WRQHS
|
| |