Gene M446_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1654 
Symbol 
ID6129065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1849789 
End bp1850991 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content76% 
IMG OID641641912 
Productimidazolonepropionase 
Protein accessionYP_001768581 
Protein GI170739926 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0408106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0708418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGCG ACCGCCTGTG GCGCAACGCC CGGCTCGCCA CCCTGGCCGA GGGGGCGCCG 
GGGCTCGGCC TCGTGGAGGA CGGGCTGATC GCCGCCCGCG ACGGGCGCAT CCTCTACGCG
GGGCCGGCGC GGGCGGCGCC CGCCTTCGCG GCCCGCGAGA CGGTCGATTG CGAGGGGCGC
TGGATCACCC CGGGCCTGAT CGACTGCCAC ACCCACCTCG TCCACGGCGG CGACCGGGCG
GCGGAGTTCG AGGCGCGGCT CGCCGGCGCC AGCTACGAGG AGATCGCGCG GGCGGGCGGC
GGCATCGTCT CGACCGTGCG CGCCACCCGG GCGGCGAGCG AGGACGCGCT CGTCGGGAGC
GCGCTGCGGC GCCTCGACGC GCTGATCGCC GAGGGCGTGA CCGCGGTCGA GGTGAAGTCC
GGCTACGGCC TCTCCGTCGC CTCCGAGCGC GCGAGCCTGC GGGCGGCCCG CCGCCTCGGG
GAGAGCCGCG ACGTCACCGT GACCACGACC TTCCTGGGTG CCCACGCGCT GCCGCCGGAG
GAGCCCGACA AGGACCGCTA CATCGCGCAT GTCTGCACCG AGATGCTGCC CGCCCTGGCG
CGGGAGGGGC TGGCCGACGC GGTCGACGCC TTCTGCGAGG GGATCGCCTT CTCGCCCGCC
CAGACCGCGC GGGTCTTCGA GGCGGCGCGG GCGGCGGGCC TGCCGGTGAA GCTGCACGCC
GACCAGCTCT CCGATCTCGG CGGGGCGGCG CTGGCGGCGC GGTTCGGCGC CCTCTCGGCC
GACCACCTGG AATACGCGGA CGAGGCCGGC GCCGCCGCCC TGGCCCGGGC CGGCACCGTG
GCGGTGCTGC TGCCGGGGGC CTTCTACTTC ATCCGGGAGA CGCGGCGGCC GCCCGTCGAC
CTGTTCCGCC GCCACGGCAC GCGGATGGCG CTCGCCACCG ACTGCAATCC CGGCACCTCC
CCCCTCACCT CCCTGCTCCT CGTGCTCAAC ATGGGCGCGA CGCTGTTCCG GCTCACCGTC
GAGGAATGCC TCGCGGGCGT GACCCGGGAG GCGGCCCGCG CCCTCGGGCG CCTGCACGAG
ATCGGCACGC TGGAGGCGGG CAAGTGGTGC GACCTCGCGG TCTGGGACGT CGAGCGCCCG
GCCGAACTCG TCTACCGCAT GGGATTCAAC CCGCTGCACG CCCGCATCCG GAGGGGCCGA
TGA
 
Protein sequence
MLCDRLWRNA RLATLAEGAP GLGLVEDGLI AARDGRILYA GPARAAPAFA ARETVDCEGR 
WITPGLIDCH THLVHGGDRA AEFEARLAGA SYEEIARAGG GIVSTVRATR AASEDALVGS
ALRRLDALIA EGVTAVEVKS GYGLSVASER ASLRAARRLG ESRDVTVTTT FLGAHALPPE
EPDKDRYIAH VCTEMLPALA REGLADAVDA FCEGIAFSPA QTARVFEAAR AAGLPVKLHA
DQLSDLGGAA LAARFGALSA DHLEYADEAG AAALARAGTV AVLLPGAFYF IRETRRPPVD
LFRRHGTRMA LATDCNPGTS PLTSLLLVLN MGATLFRLTV EECLAGVTRE AARALGRLHE
IGTLEAGKWC DLAVWDVERP AELVYRMGFN PLHARIRRGR