Gene M446_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2533 
Symbol 
ID6134689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2808341 
End bp2809651 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content73% 
IMG OID641642745 
Productfumarylacetoacetase 
Protein accessionYP_001769410 
Protein GI170740755 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TCGACGCCAC CCACGCCGCG ACCTTGCGCT CCTGGGTGCC CGGCGCGAAC 
GGCCACTCGG ACTTTCCGAT CCAGAACCTG CCGCTCGGGG TGTTCTCGCC GGGTGACGGG
ACGCCGCGGG CGGGCGTCGC GATCGGCGGG CGCATCCTCG ACTTGCCGGC GCTGCTTGCC
GCGAACCTCC TCTCGGGCGA GGCCGCGCTC GCCGCCGAGG CCGCGGGCGG GACGACGCTC
AACAGGCTCC TTGCCTTGGG GGCGGGGCCG CGCCGGGCGC TGCGGGCGCG CCTGTCAGCC
CTGTTCGCGG AGGGCTCGCC GGATCGCGAC CGGGTCGCGC CCCTGCTGCA TGAGGCCTCG
TCCTGCCGGC TCCACCTGCC GGCCGCGATC GGCGACTACA CCGACTTCTA CGTCGGTATC
CACCACGCGG AGAATATCGG CCGGCAATTC CGGCCGGACA ACCCGCTCCT GCCGAACTAC
AAGCACGTGC CGATCGGCTA CCACGGCCGC GCCTCCTCGA TCCGGCCCTC GGGCACCCCG
GTGCGGCGCC CGCGCGGGCA GTCAAAGCCG CCGGAGGCGG GCGACCCGGT CTTCGGCCCC
TCGCGGCGAC TCGACTACGA ACTCGAACTC GGGGTGTGGA TCGGCCCCGG CAATACTCTC
GGCGAGCCGA TCGCGATCGG CGACGCGCAC GCGCACATCG CGGGCGTGTG CCTCCTCAAT
GATTGGTCGG CGCGGGACAT CCAGGCGTGG GAGTACCAGC CGCTGGGACC GTTCCTCGCC
AAGAACTTCG CCACGACGAT CTCGCCCTGG ATCGTCACGG CGGAGGCGCT CGCACCCTTC
CGGATCGCGC AGAGCCCCCG GCCGGAGGGC GATCCGCGGC CGCTGCCCTA CCTAACCGAC
GAGGTCGACC AGCGAAGGGG CGCCTTCGAC CTCCGGCTCG AGGTGCTGCT GCTGACGCCC
GGCCTGCGAG CGGCGGGCCT CGGCCCCCAC CGGATTTCGG CCTCGAACAC GCGGCACATG
TACTGGACCG TGGCGCAGAT GGTGGCCCAC CACACCGGCG GCGGCTGCAA CCTGCAGCCG
GGCGACCTGC TCGGGACGGG CACGATCTCC GGCCCGGACC GCGACGCCTG CGGCAGCCTC
CTCGAAGCGA CCCTCGGTGG CCGGGAGCCG CTCCGGCTCG CGTCGGGCGA GGAGCGCCGG
TTTCTGGAGG ACGGCGACGA GGTGATCCTG CGGGCACGCG GCGTCCGCGA CACCTTCGCG
CCGATCGGCT TCGGCGAGTG CCGGGCGGAG CTCCTTGGAG CGGCGCCCTG A
 
Protein sequence
MADIDATHAA TLRSWVPGAN GHSDFPIQNL PLGVFSPGDG TPRAGVAIGG RILDLPALLA 
ANLLSGEAAL AAEAAGGTTL NRLLALGAGP RRALRARLSA LFAEGSPDRD RVAPLLHEAS
SCRLHLPAAI GDYTDFYVGI HHAENIGRQF RPDNPLLPNY KHVPIGYHGR ASSIRPSGTP
VRRPRGQSKP PEAGDPVFGP SRRLDYELEL GVWIGPGNTL GEPIAIGDAH AHIAGVCLLN
DWSARDIQAW EYQPLGPFLA KNFATTISPW IVTAEALAPF RIAQSPRPEG DPRPLPYLTD
EVDQRRGAFD LRLEVLLLTP GLRAAGLGPH RISASNTRHM YWTVAQMVAH HTGGGCNLQP
GDLLGTGTIS GPDRDACGSL LEATLGGREP LRLASGEERR FLEDGDEVIL RARGVRDTFA
PIGFGECRAE LLGAAP