Gene Achl_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3053 
Symbol 
ID7294533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3394903 
End bp3397011 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content68% 
IMG OID643591463 
Productbifunctional aldehyde dehydrogenase/enoyl-CoA hydratase 
Protein accessionYP_002489103 
Protein GI220913794 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases
[COG2030] Acyl dehydratase 
TIGRFAM ID[TIGR02278] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CTGCCACAGC GCCCCACGCC ACGCTCGATA CCGTGGAGAC GGTGCCCAGC 
TTCATTATGG ACTCCTGGTG GACGCCCGAA ACCGGCTCCG CCGCTGCCAC CCCGGTCCGC
GACGCCAGCA CGGGCGAGCT CCTGGCGAAG GTGAGCACCG AGGGACTTGA CCTGGCGGCG
GCCGTCGAAT TCGGCCGGAC CACCGGCCAG GCCGAGCTGG GCAAGCTGAC CTTCCACCAG
CGCGCCCTCA AGCTCAAGGA GCTGGCGCAG TACCTGAACG CCCGGCGTGA ACACTTCTAT
GGCTTCTCCT ACCAGACCGG CGCAACCAAG ATCGATTCGA TGGTGGACAT CGACGGCGGC
ATCGGCGTGC TCTTCACCTT CGGGTCCAAA GGCCGGCGCG AACTGCCCAA CTCGCAGGTG
GTGGTGGACG GGCCCATGGA GGTGCTGTCC AAGGACGGGT CCTTCGCCGG TGAACACATC
TACACCCGCA TTCCCGGCGT CGCAGTGCAG ATCAACGCCT TCAACTTCCC GGTGTGGGGC
ATGCTGGAGA AGTTTGCCCC CGCGTTCATC GCCGGTGTGC CCACCATCGT CAAGCCCGCC
ACCCCCACCG GCTACGTTGC CGCTGCCGTG GTGAAGGCCA TTGTGGAGTC CAACATCTTG
CCCAAGGGAT CGCTGCAGCT GGTCTCCGGT TCGGTCCGCG GGCTGCTGGA CGTGCTGGAC
TACCGGGACC TGGTGTCCTT CACCGGATCC GCCGCCACCG CCAAGTCCCT GAAGGCGCAC
CCGAACGTGG TGGAAGGGGG CGTGAGGTTC ACCTCCGAAA CCGACTCCCT CAACGCCGCC
ATCCTCGGCC CCGACGCGGT GGAAGGCACT CCGGAATTCG AGGCCTTCGT CAAGTCCGTG
GTCACCGAGA TGACCGTCAA AGCGGGCCAG AAGTGCACCG CCATCCGCCG CGCCATCGTC
CCGCAGGAGC TGGTGCCCGC GGTGTCCGCC GCCATCGGCA AGCGCATCCA GGAGCGCGTG
GTGGTGGGCG ATCCGCGCGC CGAAGGCGTC ACCATGGGCG CGCTGGCGTC CGTGGAACAG
CTGGAGGACG TCCGGGCGGC CGTGCAGTCC ATGCTCGACG CCGGCGGTGA GCTTGCGTAC
GGAACCCTCG ATTCGCCGTC GGTCACCTCC GCCGACGGCA CCACCGGCGT GGTGGACGCC
GGCGCGTTCA TGGCCCCCGT GGTGCTCAAC TGGGGCAACC CTGAAGCCGA GGCGCTGCAC
TCGCTGGAAG CCTTCGGCCC GGTTTCCTCC GTGGTGGGTT ACGCGGACCT TGCCGACGCC
GTCCGGCTTG CCGCCCGCGG TGGCGGTTCG CTGGTGGCCT CAGTGTGCAC CAACGATCCC
GACGTGGCGC GTGAACTGGT GACCGGCATT GCCGCGCACC ACGGCCGCGT CCTTATGCTG
AACAGGGAAG ACGCCCGCAC CTCAACCGGC CATGGCTCAC CCGTGCCGCA CCTGGTCCAC
GGCGGCCCGG GCCGCGCCGG CGGCGGCGAG GAACTGGGCG GCATCCGCTC CGTCCTGCAC
CACATGCAGC GCACCGCCAT CCAGGGCTCG CCCAACATGC TTACTGCCGT CACCGGCGTC
TGGCATACCG GGGCGGACCG CAACTTCACG GTGGACACCG AGGGGACGCA CCCGTTCCGC
AAGCACCTGG AGACCCTGCG CATCGGTGAC GCCGTCCGCT CGGACCTGCG GCAGGTCACC
CTGGAGGACA TCACGGCGTT CGCCAACACC ACCGGGGACA CCTTCTACGC CCACACCAAC
CAGGAAGCTG CCGAGGCCAA CCCGTTCTTC CCGGGCATCG TGGCGCACGG CTACCTCCTG
CTGGCCTGGG GTGCGGGGCT GTTCGTGGAG CCCGCGCCGG GCCCTGTCCT GGCCAACTAC
GGCCTCGAGA ACCTGCGCTT CATCACGCCT GTTGCCGCCG GCGACTCCAT CCGGGTGACC
CTCACTGCCA AGAAGATCAC CCCGCGTGAG ACCGACGAAT ACGGCGAGGT GGCCTGGGAC
GCCGTCCTCA CCAACCAAGA CGACGAGATC GTGGCCACCT ACGACGTCCT CACCCTCGTC
GAAAAGTAA
 
Protein sequence
MTTTATAPHA TLDTVETVPS FIMDSWWTPE TGSAAATPVR DASTGELLAK VSTEGLDLAA 
AVEFGRTTGQ AELGKLTFHQ RALKLKELAQ YLNARREHFY GFSYQTGATK IDSMVDIDGG
IGVLFTFGSK GRRELPNSQV VVDGPMEVLS KDGSFAGEHI YTRIPGVAVQ INAFNFPVWG
MLEKFAPAFI AGVPTIVKPA TPTGYVAAAV VKAIVESNIL PKGSLQLVSG SVRGLLDVLD
YRDLVSFTGS AATAKSLKAH PNVVEGGVRF TSETDSLNAA ILGPDAVEGT PEFEAFVKSV
VTEMTVKAGQ KCTAIRRAIV PQELVPAVSA AIGKRIQERV VVGDPRAEGV TMGALASVEQ
LEDVRAAVQS MLDAGGELAY GTLDSPSVTS ADGTTGVVDA GAFMAPVVLN WGNPEAEALH
SLEAFGPVSS VVGYADLADA VRLAARGGGS LVASVCTNDP DVARELVTGI AAHHGRVLML
NREDARTSTG HGSPVPHLVH GGPGRAGGGE ELGGIRSVLH HMQRTAIQGS PNMLTAVTGV
WHTGADRNFT VDTEGTHPFR KHLETLRIGD AVRSDLRQVT LEDITAFANT TGDTFYAHTN
QEAAEANPFF PGIVAHGYLL LAWGAGLFVE PAPGPVLANY GLENLRFITP VAAGDSIRVT
LTAKKITPRE TDEYGEVAWD AVLTNQDDEI VATYDVLTLV EK