Gene M446_2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2225 
Symbol 
ID6129173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2480232 
End bp2481137 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content76% 
IMG OID641642452 
ProductHAD family hydrolase 
Protein accessionYP_001769120 
Protein GI170740465 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.202904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.234791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCC GCACCCCCAG GCCCGCCCCG CGCGAGGTCC CGGTCATCGA CGGCATCGCG 
GAGCTCGCCT CCGGCTTCGA CGTGATCCTC TGCGACGTCT GGGGCGTGCT GCACGACGGC
CTGCGGGCCC ACCGCTCCGC GAGCGAGGCG CTGTCCCGGT TCCGGGCGCT GCCGGGTGAG
CGCCCGCGCC GGGTCGTGCT GGTCTCGAAC GCGCCCCGTC CCGGCGAGGC CGTGAGGGCG
CAGCTCGACG GGTTCGGCGT CCCGCGCGAG GCCTATGACG GGATCGTCAC CTCCGGCGAC
CTGACGCGCG CCCTGATCGA GGCGCGGCCG GGGGCGCCGC TCTACCATCT CGGGCCCGAG
CGCGACCTGC CGATCTTCGA GGGGCTGTCG GTGCGCCGCG CCCCGCCCGA GGAGGCCGCG
CAGGTGGTCT GTACCGGGCT GTTCGACGAC GAGGTCGAGA CGGCCGAGGA TTACCGCCCG
GTCCTGGCGG GCCTCAGCGC CCGCGGCCTG CCGATGATCT GCGCCAATCC CGACCTCGTC
GTGGAGCGGG GGGCGCGGCT CATCCCCTGC GCGGGGGCGC TGGCGGGCCT CTACGAGGCG
CTCGGCGGGG AGGTGATCTA TGCCGGCAAG CCGCACCGGC CGGTCTACGA GGCCGCGCTG
GCGAAGGCCG CGGCCGTGGA CGGCGCGGCG CCGGCGGCCC CGGAGCGCGT CCTGGCGGTC
GGCGACGCGA TCCGCACCGA CATCGCCGGG GCGAGCGGGT TCGGCATCGC CTCGGTGCTG
GTGGCGCGCG GCATCCACGC GGAGGAGCTC GGCTGCCACG CCGGCGAGCC GGTCGGCGAG
ATCGCCCATT GGCTGGAGGG GCAACCCGTC CACCCGGACG CGGTGATCGA CCTGCTGCGC
TGGTGA
 
Protein sequence
MATRTPRPAP REVPVIDGIA ELASGFDVIL CDVWGVLHDG LRAHRSASEA LSRFRALPGE 
RPRRVVLVSN APRPGEAVRA QLDGFGVPRE AYDGIVTSGD LTRALIEARP GAPLYHLGPE
RDLPIFEGLS VRRAPPEEAA QVVCTGLFDD EVETAEDYRP VLAGLSARGL PMICANPDLV
VERGARLIPC AGALAGLYEA LGGEVIYAGK PHRPVYEAAL AKAAAVDGAA PAAPERVLAV
GDAIRTDIAG ASGFGIASVL VARGIHAEEL GCHAGEPVGE IAHWLEGQPV HPDAVIDLLR
W