Gene Mext_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3752 
Symbol 
ID5832961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4158421 
End bp4159746 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID641369542 
Productallantoate amidohydrolase 
Protein accessionYP_001641197 
Protein GI163853154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.23358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC TTCCCGAGAC ATCGATGACA TCCATGCCCG ACATGATGAC CTTCGCCCCC 
GTGCGGATCG ATCCCGCCCG CCTCCAGGCG ATGATGGAGG CGGTCTCCGC CTTTGGCGCC
GGGCCGGACG GCGCCCTGAC CCGCCTGACC CTGTCGCCGG AGGACGGGCA GGCGCGCGAC
TGGCTCGCCG CGTGGTTTTC CGCGCACGGC TTCACCCCGC GGGTCGATGC GATCGGCAAC
CAGTTCGGCT GTCTGGAACT GGCCGGACCC GGCGCGCCCA CGGTGATGGT CGGCTCGCAT
CTCGACAGCC AGCCCAATGG CGGGCGCTTC GACGGCACGC TCGGCGTGCT CGCCGCCTGC
GAGGCCATCC TGTCCGTGCG CGCGGCGCTC GAAGTGGCGG GCAGGAGGTC GGCCTGCAAC
TTCACGGTCG CCAACTGGAC CAACGAGGAG GGCGCCCGCT TTCAGCCGAG CCTGCTCGGC
AGCAGCGTCT TCACCGGTGC GGCCGGGCTC GATTGGGCGC TGGCCCGCAG CGACGGCGAC
GGCGTCACTG TCGGCGAGGC CCTGTCGCGG ATCGGCTATG CCGGGAGCGA CGCCGTGGCG
GTGCCGGACG CCTTCATCGA GCTGCATATC GAGGGCGGGC CGATCCTGGA GCGCGAGGGC
CTGCGCTTCG GCGCCTTCAC CCGCTACTGG GGCGCCACCA AGTACCGCCT CGCCTTCCTC
GGGCGCCAAG CCCATACCGG CCCGACGCCG ATGGCCGAGC GGCGCGACGC ACTTCTCGGC
GCCGCCTACC TGATCGCCGA CCTCAAGGCG ATGACGGCCG ATTACGGCCT CGACCTGCAC
ACCTCCGTCG GCCGGCTCGA AGTGCGGCCG AACTCGCCCA ATACCGTGCC GAGCGAAGCG
GTTCTGTTCA TCGAGCTGCG CTCCGGCTCG CCCGCGATCC TCGAGGAGGC CGAACTCCGG
CTGAAGGCGG CTATCGATCT GGCCGCCGCG CGTGCGGAGG TGGGTCACGA GGTACGCGCC
ATCGACCGGC GCGCCGCCGG CCCGATGGCG CCGGGCCTCG TGCGGCTCGC CGAGCGCGCA
GGTACGGCCA ACGGCACGAC GACCCGCCAC CTCGACACGA TCGGCGGCCA CGACGCTGTC
AGCCTCAGCG CCGTCTGCCC CTCGGTGGTG CTGGCCGTGC CCTGCCGCGG CGGCGTGATG
CACCACCCGA CCGAGTTCAC GAGCCCCGAG GATCAGGCCT TCGGCACGCA GGTGCTGGCC
GACATGCTGA TGATCCTCGC CACCGAGGGC ATGGCCGCCC TCGAGACCGC GGGAGGGGAC
CGGTGA
 
Protein sequence
MPPLPETSMT SMPDMMTFAP VRIDPARLQA MMEAVSAFGA GPDGALTRLT LSPEDGQARD 
WLAAWFSAHG FTPRVDAIGN QFGCLELAGP GAPTVMVGSH LDSQPNGGRF DGTLGVLAAC
EAILSVRAAL EVAGRRSACN FTVANWTNEE GARFQPSLLG SSVFTGAAGL DWALARSDGD
GVTVGEALSR IGYAGSDAVA VPDAFIELHI EGGPILEREG LRFGAFTRYW GATKYRLAFL
GRQAHTGPTP MAERRDALLG AAYLIADLKA MTADYGLDLH TSVGRLEVRP NSPNTVPSEA
VLFIELRSGS PAILEEAELR LKAAIDLAAA RAEVGHEVRA IDRRAAGPMA PGLVRLAERA
GTANGTTTRH LDTIGGHDAV SLSAVCPSVV LAVPCRGGVM HHPTEFTSPE DQAFGTQVLA
DMLMILATEG MAALETAGGD R