Gene M446_3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3696 
Symbol 
ID6135034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4122660 
End bp4123598 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content78% 
IMG OID641643867 
Productchlorophyll synthesis pathway, BchC 
Protein accessionYP_001770511 
Protein GI170741856 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.503769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00324331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACGCGC TGGCCGTGAT CCTCGACGGG CCCGAACGCC TCGGCCTCGA CCGGCTCGCC 
CTCGACTCGC CCGGCCCGGC CGACGCCGTG GTCGAAGTGG CGTGGAGCGG CATCAGCACC
GGCACCGAGC GCCTGCTCTG GTCGGGCCGG ATGCCGGACT TCCCCGGCAT GGGCTATCCC
CTGGTCCCCG GCTACGAGTC GGTCGGGCGC GTGGTCGAGG CCGGCCCCGA ATCCGGCCGC
CGCCCCGGCG AGACCGTCTT CGTGCCCGGC GCCCGCTGCT TCGGCCCGGT GCGCGGCCTG
TTCGGCGGCG CGGCCTCGCA CCTCGTGGTC GAGGGCGCGC GCCTGTTGCC CATCGACGCG
GCGCTCGGCG AGCGCGGCAT CCTGCTGGCC CTCGCCGCGA CCGCCTACCA CGCCCTGGCG
GGGGCGGAGC GCGGCGAGCG CCACCTCGTC GTCGGGCACG GCGTGCTCGG GCGCCTCCTG
GCGCGCCTCG CCCGGCTCAG CGGTCAGGAC CCGGTGGTCT GGGAGCGCGA TCCGGCCCGC
CGCGGCGGCG AGCACGGCTA CCCGGTGCTC GACCCGGCCG CCGACGAGAC CGGGCGCTAC
GCCCGCATCA CCGACGCGAG CGGCGATGCC GGCCTCCTCG ACGGGCTGAT CGCCCGCCTC
GCGCCCGGGG GCGAGGTGGT GCTGGCGGGC TTCTACGAGG CGCCGCTCTC CTTCGCCTTC
CCGCCGGCCT TCCTGCGCGA GGCGCGCCTC CGGGTCTCGG CGCAGTGGCG GCCCGGCGAC
CTCGACGCGG TCGCGGGCCT CGTGCGCGCG GGCGCGCTCG GCCTCGACGG CCTGATCAGC
CACCGCCGGC CCGCCGCGCA GGCCGCGAGC GCCTACCGCA CGGCCCTCAC CGATCCCGCC
TGCCTCAAGA TGGTCCTCGA CTGGAGAGCC CGCTCATGA
 
Protein sequence
MHALAVILDG PERLGLDRLA LDSPGPADAV VEVAWSGIST GTERLLWSGR MPDFPGMGYP 
LVPGYESVGR VVEAGPESGR RPGETVFVPG ARCFGPVRGL FGGAASHLVV EGARLLPIDA
ALGERGILLA LAATAYHALA GAERGERHLV VGHGVLGRLL ARLARLSGQD PVVWERDPAR
RGGEHGYPVL DPAADETGRY ARITDASGDA GLLDGLIARL APGGEVVLAG FYEAPLSFAF
PPAFLREARL RVSAQWRPGD LDAVAGLVRA GALGLDGLIS HRRPAAQAAS AYRTALTDPA
CLKMVLDWRA RS