Gene Mchl_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4047 
Symbol 
ID7118052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4262788 
End bp4264089 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content73% 
IMG OID643526766 
Productallantoate amidohydrolase 
Protein accessionYP_002422775 
Protein GI218531959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.439433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCA TGCCCGACAT GATGGCCTTC GCCCCCGTGC GGATCGACCC CGCCCGCCTC 
CAGGCGATGA TGGAGGCTGT CTCCGCCTTC GGCGCCGGGC CGGAGGGCGC CCTGACCCGC
CTGACCCTGT CGCCGGAGGA CGGGCAGGCG CGCGACTGGC TCGCCGCGTG GTTTTCCGCG
CACGGCTTCA CCCCGCGGGT CGATGCGATC GGCAATCAGT TCGGCTGTCT GGAACTGGCT
GGCCCCAGCG CGCCCACGGT GATGGTCGGC TCGCATCTCG ACAGCCAGCC CAATGGCGGG
CGCTTCGACG GCACGCTCGG CGTGCTCGCC GCCTGCGAGG CGATTCTGTC CGTGCGCGCG
GCGCTCGAAG CGGCGGGCAG GATGTCGGCC TGCAACTTCA CGGTCGCCAA CTGGACCAAC
GAGGAGGGCG CCCGCTTTCA GCCGAGCCTG CTCGGCAGCA GCGTCTTCAC CGGTGCGGCC
GGGCTCGACT GGGCGCTGGC CCGCAGCGAC GGCGACGGCG TCACCGTCGG CGAGGCCCTG
TCGCGGATCG GCTATGCCGG GAGCGACGCC GTGCCGGTGC CGGACGCCTT CATCGAGCTG
CATATCGAAG GCGGGCCGAT CCTGGAGCGC GAGGGCCTGC GCTTCGGCGC CTTCACCCGC
TACTGGGGCG CCACCAAGTA CCGCCTCGCC TTCCTCGGAC GCCAGGCCCA TACCGGCCCG
ACGCCGATGG CCGAGCGGCG CGACGCGCTT CTCGGCGCCG CCTACCTGAT CGCCGACCTC
AAGGCCATGA CGGCCGATTA CGGCCTCGAC CTGCACACCT CCGTCGGCCG GCTCGAAGTG
CGGCCGAACT CGCCCAACAC CGTGCCGAGC GAGGCGGTTC TGTTCATCGA GCTGCGCTCC
GGCTCGCCCG CGATCTTGGA GGAGGCCGAA CTCCGGCTGA AGGCGGCCAT CGATCTGGCT
GCCGCGCGCG CGGAGGTGGG TCACGAGGTG CGCGCCATCG ACCGGCGCGC CGCCGGCCCG
ATGGCGCCAG GCCTCGTGCG GCTTGCCGAG CGCGCGGGCA CGGCCAACGG CACGACGACG
CGCCACCTCG ACACGATCGG CGGCCACGAC GCCGTCAGCC TCAGCGCGGT CTGCCCCTCG
GTGGTGCTGG CCGTGCCCTG CCGCGGCGGC GTGATGCACC ACCCGACCGA GTTCACGAGC
CCGGAGGATC AGGCCTTCGG TACCCAGGTG CTGGCCGACA TGCTGATGAC CCTCGCCACC
GAGGGCATGG CCGCCCTCGA GACCGCGGGA GGGGACCGGT GA
 
Protein sequence
MTSMPDMMAF APVRIDPARL QAMMEAVSAF GAGPEGALTR LTLSPEDGQA RDWLAAWFSA 
HGFTPRVDAI GNQFGCLELA GPSAPTVMVG SHLDSQPNGG RFDGTLGVLA ACEAILSVRA
ALEAAGRMSA CNFTVANWTN EEGARFQPSL LGSSVFTGAA GLDWALARSD GDGVTVGEAL
SRIGYAGSDA VPVPDAFIEL HIEGGPILER EGLRFGAFTR YWGATKYRLA FLGRQAHTGP
TPMAERRDAL LGAAYLIADL KAMTADYGLD LHTSVGRLEV RPNSPNTVPS EAVLFIELRS
GSPAILEEAE LRLKAAIDLA AARAEVGHEV RAIDRRAAGP MAPGLVRLAE RAGTANGTTT
RHLDTIGGHD AVSLSAVCPS VVLAVPCRGG VMHHPTEFTS PEDQAFGTQV LADMLMTLAT
EGMAALETAG GDR