Gene Achl_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3549 
Symbol 
ID7295030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3935477 
End bp3936472 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content68% 
IMG OID643591955 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002489594 
Protein GI220914285 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAGA TCGCCAGATG GAACCACGAC GGCGGGACGC AATCCGGCTT TGTCAGCGGC 
GGTGCCTGCC ACGCGTTGCC TGCGGGCCAG GACGTGCAAA CCCTGCTGGA CGCCGGCCTC
GAGGAGACGC TGGCCATTGC CCGGCGGACC ATCGGTTCCG GCGCGGCGGT TCCGCTGGCG
GACGTGCAGC TGCTCGCCCC GCTGGCGCCG GCCACCATCC GCGACTTCGT GGCGTTCGAG
GAACACGTTG AGGGCGTCCG GAAGAGCATC GACGGCGTCG CCGGCGTGGT GCCCGAATGG
TACGAGGCGC CCACGTTCTA CTTCACCAAC CCGCACACCG TGACCGGCAC GGGCGAGCTG
ATTGGGATCC CAGCCGGGTG CGTGGACCTG GACTTCGAGA CCGAGGTGGC AGCCGTCGTC
GGGCGCGTTC CCGGCAGCGA CGGCCGGAAC CTGGACACGG AGGCGGCGCA CCGGCACATC
TTCGGCTACA CCGTCCTCAA CGACTGGTCC GCCCGGGACC TGCAGCGGCG CGAAATGAAG
GTCAGCCTGG GACCGTGCAA AGGCAAGGAT TTCTCCAACA CCCTGGGCCC CTGGATCGTC
ACCGCGGACG AGTTTGAGGA CCGGCACGAC GCGGAGGGGT TCCTGCCCAT CTCCATGTCC
GTGGAGGTCA ACGGCGTACA GATCGGCCAG GACCTGCTCT CCAACATGGG CTGGCCGTTC
GCCGAACTCG TGGCCTACGC GTCGCAGGAT TCGGTAATCC GGCCGGGCGA TGTACTGGGA
TCCGGCACGT GTGGCAGCGG CTGCCTCGCC GAACTCTGGG GCCGAAACGG CGCCCAGACT
CCCCCGCCGC TGGCAACCGG CGACGTGGTG CGCATGACCG TTGAAGGCAT CGGAACCATC
GAAAACACCG TGGGCGATCG CCGCGAAGCC CTGACTCGGG TCCCCGCCCG GACTCGCCCC
CGGAACCGGG TTGCCGCAGT GCTTCCGGCT ACCTGA
 
Protein sequence
MVKIARWNHD GGTQSGFVSG GACHALPAGQ DVQTLLDAGL EETLAIARRT IGSGAAVPLA 
DVQLLAPLAP ATIRDFVAFE EHVEGVRKSI DGVAGVVPEW YEAPTFYFTN PHTVTGTGEL
IGIPAGCVDL DFETEVAAVV GRVPGSDGRN LDTEAAHRHI FGYTVLNDWS ARDLQRREMK
VSLGPCKGKD FSNTLGPWIV TADEFEDRHD AEGFLPISMS VEVNGVQIGQ DLLSNMGWPF
AELVAYASQD SVIRPGDVLG SGTCGSGCLA ELWGRNGAQT PPPLATGDVV RMTVEGIGTI
ENTVGDRREA LTRVPARTRP RNRVAAVLPA T