Gene Achl_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0139 
Symbol 
ID7291565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp151164 
End bp152519 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content62% 
IMG OID643588538 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002486231 
Protein GI220910922 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAT TTGCCGTTAT TGATCCGGCT ACAGGAACCA TCCACGCCGA GTATCCCGCA 
GCCACTGACG CAGAGGTGGA AGCCGGACTC GCTGCAGCGC AGAACACCTA CCATGAATGG
TCCCGCACCA CTACGGTGGC CGAACGCGCG GGCATGGCCA GGCGCCTTGC CGAGCTGTTC
GTTGAACGCA AGGACAAGCT GGCCGCCATC ATCAACCGGG AAATGGGTAA GCCCCTCCAG
CAAGCGGCCG GCGAAGCAGA GTTCTCGGGC TCCATCGCTT CGGCCTTCGC CGAAAACGCC
GAGGAGTGGC TGGCAGATGA GCAACTCGAG GTCGCTGACG GTTTGCGGAG CTTCTTCCGC
TACCAGGGCC TCGGCGTGAT CCTGGGCATC ATGCCGTGGA ACTACCCCTA CTATCAGGTG
GCACGGTTCG CGATTCCCAA CATCATCCTT GGCAACACCG TCATTGTGCG GCACGCCAGC
CAGTGCCCGG AATCGGCGTT GGCCCTGGAG GAACTCTTCC GCGATGCAGG CTTCCCCGAG
GGCGCCTACG TCAACCTGTT CGCTACCCAC CAGCAGATCT CCAACATCAT TGCCGACGAC
CGGGTGCAGG GTGTGTCGCT GACGGGCTCA GAGCAAGTGG GCGCAATCGT GGCTGAACAG
GCCGGGCGCG CGCTGAAAAA GTGTGTCCTG GAGCTCGGCG GCGCCGACGT GTTCCTCGTT
CTGGACACTG ACGATGTGGA CCTCGCTGTG AAGAAGGCCG TCATGGGTCG CATGGGCAAC
ACGGGCCAGT CCTGCAACGG TTCCAAGAGG ATCGTGGTGC TGGATAAGTA CTTCGATGAG
TTTTCGGAGA AGTTCAAGGC TGCCATCGCC GGACAGTCCT ACGAGAACGG CGATTTCGGA
CCGATGTCTT CGGACTCGGC CACCAAGTTC CTGGCTTCCC AGGTGCAGGG TGCGCTGGAC
CAGGGTGCGG AAATTCTGGT GGGCAACAAC CAGCCCCAGG GCAACGTCTT CACTCCGACA
ATCATCACCA ACATCACGCC GTCCATGGAC GTCTACAGCG AGGAACTCTT CGGCCCCGTT
GCGCAGCTGT ACAAAGTCAG CAGCGACGCC GAGGCGATCA ATCTTGCCAA CTCCTCGCCG
TACGGCCTGG GTTCCGTAGT GATCTGCGAC GACGTCGAGC GCGCCGAGCG CGTCGGCAAC
CAGCTCGACG TCGGCATGGT ATTCGTCGGT GCCTACGACC TCAGCGGTGC GGACGTGCCG
TTCGGCGGCG TCAAGAAGTC CGGCTACGGA CGCGAACTGG GCAAGGTGGG CATGCTGGAA
TTCGCCAACA AGAAGCTGTT CCGCTTCGCC AAATAA
 
Protein sequence
MSAFAVIDPA TGTIHAEYPA ATDAEVEAGL AAAQNTYHEW SRTTTVAERA GMARRLAELF 
VERKDKLAAI INREMGKPLQ QAAGEAEFSG SIASAFAENA EEWLADEQLE VADGLRSFFR
YQGLGVILGI MPWNYPYYQV ARFAIPNIIL GNTVIVRHAS QCPESALALE ELFRDAGFPE
GAYVNLFATH QQISNIIADD RVQGVSLTGS EQVGAIVAEQ AGRALKKCVL ELGGADVFLV
LDTDDVDLAV KKAVMGRMGN TGQSCNGSKR IVVLDKYFDE FSEKFKAAIA GQSYENGDFG
PMSSDSATKF LASQVQGALD QGAEILVGNN QPQGNVFTPT IITNITPSMD VYSEELFGPV
AQLYKVSSDA EAINLANSSP YGLGSVVICD DVERAERVGN QLDVGMVFVG AYDLSGADVP
FGGVKKSGYG RELGKVGMLE FANKKLFRFA K