Gene M446_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2083 
Symbol 
ID6134824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2329122 
End bp2330120 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content78% 
IMG OID641642312 
Productshort chain dehydrogenase 
Protein accessionYP_001768980 
Protein GI170740325 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.244735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0182364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGC GCCGTCCCAG CGCCGTCGTG ACGGGCGGCA CCGCCGGGGT CGGGCGGGCG 
GTGGCGCTCG CCTTCGCCCG CCGGGGCTAC GACGTCGCCG TGCTGGCCCG CGGCCGGCGC
GGGATCGACG GCACCCTGGC GGAGCTGCGC CGGGCGGGCG CGCGGGCGCT CGGCTTCCAG
GCCGACGTGG CGGATGCGGG CGCGGTGCAG CGGGCCGCCG ACGCGGTCGC GGAGGCCTGG
GCGGGGATCG ACGTCTGGGT CAACAACGCG ATGGTGACCG CCTACGCGCC GGTGCGGCGG
CTGAGCCCGG ACGAGTTCCG GCAGGTCACG GCCGTGACCT ATCTGGGCCA GGTGCACGGC
ACGCTGGCGG CCCTGCGGCA CATGGCACCG GCCGACCGCG GCACGATCGT CTGCATCGGC
TCGGCGCTCG CCTACCGGTC GATCCCGCTC CAGGCGCCCT ACTGCGCCGC CAAGGCGGCG
GTGCGCGGCT TCGTCGATTC CCTGCGCTGC GAGATCCTGC ACGACGGCAG CCGGGTGCGG
CTCACCATGG TGCAGCTGCC GGCGGTCAAC ACGCCGCAAT TCGACTGGGC CCGCTCGGTC
CTGCCGCGCC GGCTCCAGCC GGTGCCGCCG ATCTACCAGC CCGAGGCGAT CGCCCGGCAC
GTCGTGCGGG CGGCGGAGGA GGCGCCGCGC GAGCTCTGGA TCGGTCCCCC GGCCTGGCAG
GCGATCCTCG GCACCCTGGT GGCGCCCGGC CTGCTCGACC GCTACCTCGC CACGGCCGCC
TACGAGGGCG AGATGACGCC CGAGCCGGCG GACCCGCACC GGCCGGACAA CCTGTTCGGG
CCGGTCGACA CGGATCCCGG GGCGCATGGC CGCTTCGACG GGCGGGCGCG GGCGAGCGTG
GTCGCGGCCG CCCCGAGCAC GCTGAAGGCC GGGCTGGCCC TCGGGCTCGG GCTGCTCGCC
GGCGGGGCGC TGCTCGCCGC GCGGCGGCCG AGGCGGTGA
 
Protein sequence
MSPRRPSAVV TGGTAGVGRA VALAFARRGY DVAVLARGRR GIDGTLAELR RAGARALGFQ 
ADVADAGAVQ RAADAVAEAW AGIDVWVNNA MVTAYAPVRR LSPDEFRQVT AVTYLGQVHG
TLAALRHMAP ADRGTIVCIG SALAYRSIPL QAPYCAAKAA VRGFVDSLRC EILHDGSRVR
LTMVQLPAVN TPQFDWARSV LPRRLQPVPP IYQPEAIARH VVRAAEEAPR ELWIGPPAWQ
AILGTLVAPG LLDRYLATAA YEGEMTPEPA DPHRPDNLFG PVDTDPGAHG RFDGRARASV
VAAAPSTLKA GLALGLGLLA GGALLAARRP RR