Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3673 |
Symbol | |
ID | 7295155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 4081950 |
End bp | 4085510 |
Gene Length | 3561 bp |
Protein Length | 1186 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643592079 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002489717 |
Protein GI | 220914408 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC CTCCTCCCAC TCAAGGAAAG ATCGGGAAAA CCATGACCCA CGTTGCAATG GAACCGGCAG TCACCCACGC GGCCACCCCG CAGACGGTCG ACGTCGACGT CCCCCAGGCC AAAGCCCTCG CCACTGAGGC GGTCGCCTTG GTCCGTCGCT GGCTTACCGA GGCCGCGAAG GTTCCGGTGG ATGCCTCCGC CGAACAGCTC GCCGGAGTGC TGAAGGACCC CAACGGCCTG GACTTCACTG TGGGCTTCGT GGACGGCGTA GTCCGCCCCG AGGACCTGAA CGTCGCCGCC CGCAACCTCG CCGCACTGGC CCCGAAGGTT CCGGCCTTCC TCCCCTGGTA CATGAAGAGC GCCGTGGCCC TTGGCGGCAC CATGGCCCCC GCCTTGCCGC AGGTGGTTAT CCCCGCGGCC CGCAAGGTCC TGCGCGAAAT GGTGGGCCAC TTGATTGTGG ACGCCACCGA TGCCAAGCTG GGCCCCGCCA TCGCCAAGAT CAAGAAGGAC GGCATCAAGC TCAACGTCAA CCTCCTGGGC GAGGCAGTCC TCGGCGAGCA CGAGGCATCC CGCCGGCTCG AAGGCACGCA CACGCTGCTG GCGCGCCCCG ACGTCGACTA CGTCTCCATC AAGGTCTCCT CCACCGTGGC CCCGCACTCC GCCTGGGCCT TCGACGAAGC CGTGGAACAC GTCGTCGAGA AGCTCACCCC GCTGTTCCAG CGCGCTGCCT CCTTCGCCGC CCCCGGCAGC AGCACCGGCG GCAAGGCCAA GTTCATCAAC CTGGACATGG AGGAATACAA GGACCTGGAC ATGACCATCG CGGTCTTCAC CAGGATCCTG GACAAGCCCG AGTTCAAGGA CCTCGAAGCC GGCATCGTGC TGCAGGCCTA CCTCCCGGAC GCACTGTCCG CCATGATCCG CCTGCAGGAC TGGGCCGCAG AGCGCCGGGC CAACGGCGGC GCCGGCATCA AGGTCCGCGT GGTCAAGGGC GCCAACCTGC CCATGGAGCA GGTGGAAGCC TCGCTGCACG ATTGGCCGCT GGCCACCTGG GGTTCGAAGC AGGACTCCGA CACCAGCTAC AAGAGCGTCA TCAACTACTC GCTGCACCCG GAGCGGATCA AGAACATCCG GATCGGCGTT GCCGGCCACA ATCTGTTCGA TATTGCCTTC GCCTGGCTGC TGGCCAAGCA GCGCGGGGTA GAGTTCGGCA TCGAGTTCGA GATGCTGCTG GGCATGGCGC AGGGGCAGGC CGAAGCCGTC AAGAAGGACG TCGGCTCGCT CCTGCTGTAC ACGCCGGTGG TGCATCCGGC TGAGTTCGAC GTCGCCATCG CCTACCTGAT CCGCCGCCTC GAAGAAGGCG CCAGCCAGGA CAACTTCATG TCCGCCGTCT TCGAGCTCGA TAAGAACGAA GCCCTGTTCG AGCGGGAAAA GCAGCGTTTC CTCTCCTCGC TGGAGTCCCT GGACAACACC GTGCCGCCGG CCAACCGCCA GCAGAACCGC AGCCTGACGC CGGTCGCGAT GCCGCACGAT AGGTTCAAAG ACACCCCGGA TACTGATCCC TCCCTGCCGG CCAACCGCAC CTGGGGCCGG GCCATCCTGG ACCGCGTTCC GGGCTCCACT CTGGGTAACG CGTCGGTGAA GGCGGCCTTC ATCAACGACG AAGCCACCCT CAACAAGGCC ATCGAAACCG CCGTCGACAA GGGCAAGGCC TGGGGCGCAC TGTCCGGCGA CGAACGTGCC GCCATCCTGC ACCGCGCCGG TGACGCCCTC GAGGCCCGCC GTGCCGACCT CCTCGAGGTC ATGGCCAGCG AGACCGGCAA GACCATCGAC CAGGGCGATC CCGAGGTCAG CGAGGCAGTG GACTTTGCCC ACTACTACGC CGAGTCCGCG CGCAAGCTGG ACGCGGTCGA CGGCGCCACG TTCGTCCCGG CGAAGCTCAC CGTCGTGACG CCGCCGTGGA ACTTCCCGGT CGCCATCCCG GCAGGGTCCA CTCTCGCGGC ACTCGCCGCC GGCTCCGCCG TCGTCATCAA GCCTGCCAAG CAGGCCGCCC GCAGCGGCGC CGTGATGGTC GAAGCCCTGT GGGAAGCCGG CGTCCCCAAG GACGTCCTCA CCATGGTGCA GCTGGGTGAG CGGGAGCTCG GCACCCAGCT GATCAGCCAC CCGGCCGTGG ACCGCGTCAT CCTCACCGGC GGCTACGAGA CCGCGGAACT GTTCCGCTCC TTCCGCAAGG ACCTGCCGCT GCTGGCCGAG ACGAGCGGCA AGAACGCCAT CATCGTCACC CCCAGCGCCG ACCTGGACCT GGCGGCAAAG GACGTGGCGT ACTCAGCGTT CGGCCACGCC GGCCAGAAGT GCTCCGCCGC TTCACTGGTG ATCCTGGTTG GCTCCGTAGC CAAATCCAAG CGGTTCCACA ACCAGCTGAT CGACGGCGTC ACCTCGCTGA AGGTTGGGTA CCCGCAGGAC CCCACCAGCC AAATGGGCCC CATCATCGAG CCCGCCAACG GCAAGCTCCT CAACGCGCTC ACCACCCTGG GCGAAGGCGA AAACTGGGCA GTTGAGCCGA AGAAGCTGGA CGGCACCGGC AAGCTCTGGA GCCCCGGCGT GCGCTACGGC GTCAAGCGCG GTTCCTACTT CCACCTCACC GAGTTCTTCG GCCCGGTCCT GGGTGTCATG ACCGCCGACA ACCTCGAGGA GGCCATCGCC ATCCAGAACC AGATCGAGTA CGGCCTCACC GCAGGGCTGC ACTCGCTCAA CTCCGAGGAA CTCGGCATCT GGCTGGACGG CATCCAGGCC GGCAACCTGT ACGTCAACCG CGGCATCACC GGCGCCATTG TGCAGCGGCA GCCGTTCGGC GGCTGGAAGA AGTCCGCTGT TGGCGCCGGA ACCAAGGCCG GTGGGCCGAA CTACCTGGCC GGGCTCGGCG ACTGGACCCC CGCCGAGGCA ACCGCAAAGG CTGAGGTTAC GCACCCCGGC GTCCGGCGGA TCATCAACGC CGCAGGCGCC GCCCTTGATC CGGCCCGCCT TGAGTCCGTG CAGCGGGCCC TGGCGTCCGA CGCCGAGGCC TGGGCCAACG AGTTCGGCAC GGCCAGGGAC GTCTCGGGCC TGAGCGCGGA GCGCAACATC TTCCGCTACA GGAACCTTCC GGTCACCATC CGCCTCTCCG AAGGTGCGCC GCTGGCGGAC CTGGTGCGGA CGGCGGCCGC CGGCGTACTG GCGGGCTCGC CGCTGACGGT GTCCACCGCT GTCGAACTCC CCGCCCAGCT GCGGGCCGTA CTCCTCGGCC TCGACGTGGA CCTGACGGTG GAGTCCGACG CCGGCTGGCT GGCTTCGGCT GGGCAGCTTG CCTCGGCGGG CAAGCTCTCC GGCGCCCGCA TCCGCCTGCT GGGGGGCGAC GCTACGGCCC TGGCGGAAGC AACCGGCGGC CGGCCCGACA TCGCTGTCTA CTCCCACGCG GTGACCGAAG CCGGCCGGGT TGAGCTGCTG CCGTTCCTGC ACGAGCAGGC TGTCAGCATC ACCGCACACC GCTTCGGCAC GCCGAACCAC CTTTCGGACG CTCTCATCTA A
|
Protein sequence | MKRPPPTQGK IGKTMTHVAM EPAVTHAATP QTVDVDVPQA KALATEAVAL VRRWLTEAAK VPVDASAEQL AGVLKDPNGL DFTVGFVDGV VRPEDLNVAA RNLAALAPKV PAFLPWYMKS AVALGGTMAP ALPQVVIPAA RKVLREMVGH LIVDATDAKL GPAIAKIKKD GIKLNVNLLG EAVLGEHEAS RRLEGTHTLL ARPDVDYVSI KVSSTVAPHS AWAFDEAVEH VVEKLTPLFQ RAASFAAPGS STGGKAKFIN LDMEEYKDLD MTIAVFTRIL DKPEFKDLEA GIVLQAYLPD ALSAMIRLQD WAAERRANGG AGIKVRVVKG ANLPMEQVEA SLHDWPLATW GSKQDSDTSY KSVINYSLHP ERIKNIRIGV AGHNLFDIAF AWLLAKQRGV EFGIEFEMLL GMAQGQAEAV KKDVGSLLLY TPVVHPAEFD VAIAYLIRRL EEGASQDNFM SAVFELDKNE ALFEREKQRF LSSLESLDNT VPPANRQQNR SLTPVAMPHD RFKDTPDTDP SLPANRTWGR AILDRVPGST LGNASVKAAF INDEATLNKA IETAVDKGKA WGALSGDERA AILHRAGDAL EARRADLLEV MASETGKTID QGDPEVSEAV DFAHYYAESA RKLDAVDGAT FVPAKLTVVT PPWNFPVAIP AGSTLAALAA GSAVVIKPAK QAARSGAVMV EALWEAGVPK DVLTMVQLGE RELGTQLISH PAVDRVILTG GYETAELFRS FRKDLPLLAE TSGKNAIIVT PSADLDLAAK DVAYSAFGHA GQKCSAASLV ILVGSVAKSK RFHNQLIDGV TSLKVGYPQD PTSQMGPIIE PANGKLLNAL TTLGEGENWA VEPKKLDGTG KLWSPGVRYG VKRGSYFHLT EFFGPVLGVM TADNLEEAIA IQNQIEYGLT AGLHSLNSEE LGIWLDGIQA GNLYVNRGIT GAIVQRQPFG GWKKSAVGAG TKAGGPNYLA GLGDWTPAEA TAKAEVTHPG VRRIINAAGA ALDPARLESV QRALASDAEA WANEFGTARD VSGLSAERNI FRYRNLPVTI RLSEGAPLAD LVRTAAAGVL AGSPLTVSTA VELPAQLRAV LLGLDVDLTV ESDAGWLASA GQLASAGKLS GARIRLLGGD ATALAEATGG RPDIAVYSHA VTEAGRVELL PFLHEQAVSI TAHRFGTPNH LSDALI
|
| |