Gene M446_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1056 
Symbol 
ID6131467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1173474 
End bp1174472 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content74% 
IMG OID641641349 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001768021 
Protein GI170739366 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.843879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACACA AGCGGGTGCG CGACCAAGTC ATCGTGATCA CGGGTGCGTC GAGCGGCATC 
GGCCTCGCCA CGGCGCGCAT GGCCGCGCGG CGGGGCGCCC GGGTGGTGCT CGCCGCGAGG
AGCGGGGACG TGCTGGAGGA GCTCGCGGCC GAACTCGGCG GAGCGGGCGG GCGGGCCCTC
GCGGTCGCCT GCGACGTCGG GCGCCAGGAG GACGTGGAGG CCCTCGCCGA CAGGGCGGTC
GCGACGTTCG GCGGCTTCGA CACCTGGGTC AACGTCGCCG GCCTGACCAC CTACGGCGCG
TTGCGGGACA TTCCCCTGGA GGATCACGAG CGCCTGGTCC GGACGAATTT CTGGGGCACC
GTGCACGGGT CGATGGCGGC GGTGGCGCAC CTGCGCCGGG GCGGCGGCGC GCTGATCAAC
GTCGCCAGCA TCGCCTCCGA CCTCGCCTTC CCGTTCCAGG GCCTCTACGC GGCCTCCAAG
CACGCGGTGA AGGGCTTCAC CGACACGCTG CGCATGGAGC TGATCGCGGA GGGCGCGCCG
GTGTCGGTCA CGCTGATCAA GCCCGCCTCC ATCGACACGC CGCTGCCGCA ACGGGCGCGC
AACACCATGG ACAGGGAGCC GATGCTGCCG CCCCCGGTCT ACAGGCCCGA GGAGGTGGCC
CACGCGATCC TGCACGCGGC CGTCCATGCG CCGCGCGACA TCTTCGTCGG CGGCGCCGGC
AAGCTCTTCG TGATGGGCAA GGAATTCGCG CCGGGGCTCT ACGACCAACT CGCGCCCGCC
ATCATCGCGC TCCAGAAGCG GGGGTCGCCG CCGCGCCACC CGGAGGGCGC CCTGTTCCGC
CCGCGGGAGG CCGGCCGGGT CCGCGGCGAC CAGCCCGGAT ACGTGCAGCG GACCAGCGCG
TACACGAGGG CCAGCCTGCA CCCGCTCGCC ACGGCCGCGG CCGGCCTGGG CCTCGCGGTG
GCCTCCGCCG CCTGGGCGAT GGGGAGCAGG CGCCGCTGA
 
Protein sequence
MRHKRVRDQV IVITGASSGI GLATARMAAR RGARVVLAAR SGDVLEELAA ELGGAGGRAL 
AVACDVGRQE DVEALADRAV ATFGGFDTWV NVAGLTTYGA LRDIPLEDHE RLVRTNFWGT
VHGSMAAVAH LRRGGGALIN VASIASDLAF PFQGLYAASK HAVKGFTDTL RMELIAEGAP
VSVTLIKPAS IDTPLPQRAR NTMDREPMLP PPVYRPEEVA HAILHAAVHA PRDIFVGGAG
KLFVMGKEFA PGLYDQLAPA IIALQKRGSP PRHPEGALFR PREAGRVRGD QPGYVQRTSA
YTRASLHPLA TAAAGLGLAV ASAAWAMGSR RR