Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2799 |
Symbol | |
ID | 7294279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3125624 |
End bp | 3127054 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643591213 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002488853 |
Protein GI | 220913544 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03374] 1-pyrroline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00208609 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTCCAAA CCTTGCAGAA CTTCATCAAC GGGAAGTTCG TCACTCCCGC CGGCACCGAG GTGCTTGACG TGGTCAACCC CACCAACGGC GACGTCGTGG CGCACTCGCC GATCTCTGTG CAGGCTGACG TGGACGCTGC CATGGACGCC GCCGCGGAGG CGTTCAAGAG CTGGAAGCAC GCAACCCCCG GCCAGCGCCA GCTGATGCTG CTCAAGCTCG CCGACGCCGT CGAGGCCAAC AGCGACGAAC TTGTCGAGGC CCAGCACCGC AACACCGGCC AGGTCCGCTC CCTGATCGCG TCCGAGGAAG TTGCCGCCGG CGCTGACCAG CTGCGGTTCT TCGCCGGTGC CGCCCGCATC CTTGAGGGCA AGTCCGCCGG CGAGTACTTC GAGGGCCACA CCTCGTATGT CCGCCGCGAG CCCATTGGCG TCGTAGCCCA GGTGGCCCCG TGGAACTACC CGTTCCTTAT GGCCATCTGG AAGATCGGCC CGGCCCTGGC CGCCGGCAAC ACCGTGGTCC TCAAGCCCTC GGACACCACT CCGGAGTCCA CGCTGGTGCT GGCCCGGCTG GCCGGGGAGA TCCTGCCGGC GGGCGTCCTG AACGTTGTTC TCGGTACCGG CAAGACCGGC GCCATGATGG TTGACCACAA GGTCCCCGGC CTGGTGTCCA TCACCGGCTC GGTCCGCGCC GGCATTGCCG TCGCCTCCGG CGCCGCCAAG GGCCTCAAGC GCGCCCACCT GGAACTTGGC GGCAAGGCGC CCGCCATCGT CTTCAAGGAC GCCGACATCA AGAAGAGCGC CGCGGCCATC GCCGAATTCG CCTTTTTCAA CGCCGGCCAG GACTGTACCG CCATCACCCG GGTCCTGGTT GAGGATTCCG TGCACGATGA CGTCGTCGCC GCCATGGTGG AACACACCCG GACCCTGCAC ACCGGTTCGC AGAACGACGA AGACAACTAC TTCGGCCCGC TGAACAACGT GAACCACTTC AACGCCGTCA CCTCCGTGGT GGAAAACCTG CCGGCCAACT GCCGGATCGA AACCGGTGGA CACCGGGCGG GGGAGAAGGG CTACTTCTTC GAGCCCACCA TCATCACCGG CGCCAAGCAG ACCGATGACG TGGTGCAGCA GGAGACCTTC GGCCCGGTGA TCACGGTCCA GAAGTTCAGC ACCGAGGAAG AAGCCGTGGA GCTGGCCAAC GGGGTTGACT ACGCCCTCGC TTCCAGCGTC TGGACCACCA ACCATGGCAC CGCCATGCGC CTCAGCCGGG ACCTCGACTT CGGCGCTGTC TGGATCAACA CCCACATCCT CCTCACCGCC GAAATGCCCC ACGGCGGCTT CAAACAGTCC GGCTACGGCA AGGACCTCTC CATGTACGGC GTGGAGGACT ACACGCGCAT CAAGCACGTG ATGTCCGCCC TCGACTCCTA G
|
Protein sequence | MVQTLQNFIN GKFVTPAGTE VLDVVNPTNG DVVAHSPISV QADVDAAMDA AAEAFKSWKH ATPGQRQLML LKLADAVEAN SDELVEAQHR NTGQVRSLIA SEEVAAGADQ LRFFAGAARI LEGKSAGEYF EGHTSYVRRE PIGVVAQVAP WNYPFLMAIW KIGPALAAGN TVVLKPSDTT PESTLVLARL AGEILPAGVL NVVLGTGKTG AMMVDHKVPG LVSITGSVRA GIAVASGAAK GLKRAHLELG GKAPAIVFKD ADIKKSAAAI AEFAFFNAGQ DCTAITRVLV EDSVHDDVVA AMVEHTRTLH TGSQNDEDNY FGPLNNVNHF NAVTSVVENL PANCRIETGG HRAGEKGYFF EPTIITGAKQ TDDVVQQETF GPVITVQKFS TEEEAVELAN GVDYALASSV WTTNHGTAMR LSRDLDFGAV WINTHILLTA EMPHGGFKQS GYGKDLSMYG VEDYTRIKHV MSALDS
|
| |