Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2001 |
Symbol | |
ID | 4445480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2255748 |
End bp | 2257268 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639689810 |
Product | aldehyde dehydrogenase |
Protein accession | YP_831482 |
Protein GI | 116670549 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.230641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCGG CCACGCACGA CGCTCCCGCT GTGGCCGCCC GTGCCGGGCT CAAGTTGCCG TATACGCACG TGGGGGACAT CTTCGTCGAC GGTGCCTGGA CCCCGGCACG CGGCACCGGC CGCAACCCGG TCACCGATCC CGCCACGGGC GAGGTCTGGG GCTCCGTTCC GGACGGATCC CCTGAAGACG TCGACGCCGC CGTCGGCTCC GCCCGCAGGG CTTTCGACGA CGGCATGTGG CCGCGGCTGA CGCCGTCCGA GCGGGCGGCG TACCTGCTGC GCATCGCCGA GGAAGTGGAG AAGCGCGCCG AGGAGCTCTC GCTGACGAAC ACCCGGGAGA ACGGCTCGCC GGTCTCGGAA TCCGCGGGGG CCGCGGCCAA CGCCGCGGGC ATCTTCCGGT ACTTCGCCAC CCTCGCCGGT TACCTTGAAC GCGAGGACGT GCGGGCTTTC CCCCAGGGCG GCGGCGAGTC CGTGGTCCGA CGCGAACCAA TCGGTGTGTG CGCGCTGATC GCGCCCTGGA ACTTCCCGAT CAACCTCGTC GTGATCAAGC TGGCTCCGGC GCTGCTGGCC GGCTGTACCG TGGTGATCAA GCCGGCGTCG CCAACGCCGC TCTCGCTCCG GGTGATCATC GACGCCGTGG CCGCGGCCGG CGTCCCGGCC GGCGTCGTCA ACCTGGTGAC CGGCTCCGGC CGGCTGGGGG ACTCCCTGGT GAAGCACCCG GGCGTGGACA AGGTCGCCTT CACCGGTTCC ACCCCGGTCG GCCGGAAGAT CGCGGCGGCC TGCGGCGAAC TGCTGCGCCC TGTCACGCTG GAGTTGGGCG GAAAGTCAAG CGCAATCGTG CTGCCGGACG CGGACCTGGA CGCGATGTCC AAGGTGCTGA TCCGGTCCTC GATGCGCAAC ACCGGGCAGA CCTGCTACAT CTCCACCCGG ATCCTGGCCC CCGCCAGCCG TTACGAGGAA GTGGTGGACA TGGTCACCAG CACCATCGCC GCCGGCAAGC AGGGAGACCC GCTGGATCCG GATACGGTCT TTGGCCCGTG CGCCACCGAG TCGCAGTATA GGACCGTGCT GGAGTACGTG GAATCGGGTC TGGCCGAAGG CGCCCGCGCC ACCACCGGCG GACGTGCGGC GTCACTGGGA GGCGGACTCG AAGGCGGGTA CTTCGTGGAA CCGACCGTGT TCGCCGACGT CACCCCGGAG ATGCGGATCT CCCGGGAGGA GATTTTCGGC CCGGTCATCT GCATCCTGAA GTACAACGAC GCCGGCGGCA GCGCTGATGA GGCCGTGGAG TTGGCGAACA ACACCGAGTT CGGTCTGGGC GGCCTGGTGT TCGGCGCAGA CCCGGAAGCG GCGCTGGCCG TGGCGGACCG GATGGACACC GGCTCGGTAG GCATCAACTT TTTCGCCTCC AACCACGCCG CGCCCTTCGG CGGACGTCAC GATTCCGGCC TCGGCACCGA GTACGGCGTC GAGGGCCTCA ACGCCTACCT GAGCTACAAA TCCATTCACC GGAAGGTGTA G
|
Protein sequence | MTAATHDAPA VAARAGLKLP YTHVGDIFVD GAWTPARGTG RNPVTDPATG EVWGSVPDGS PEDVDAAVGS ARRAFDDGMW PRLTPSERAA YLLRIAEEVE KRAEELSLTN TRENGSPVSE SAGAAANAAG IFRYFATLAG YLEREDVRAF PQGGGESVVR REPIGVCALI APWNFPINLV VIKLAPALLA GCTVVIKPAS PTPLSLRVII DAVAAAGVPA GVVNLVTGSG RLGDSLVKHP GVDKVAFTGS TPVGRKIAAA CGELLRPVTL ELGGKSSAIV LPDADLDAMS KVLIRSSMRN TGQTCYISTR ILAPASRYEE VVDMVTSTIA AGKQGDPLDP DTVFGPCATE SQYRTVLEYV ESGLAEGARA TTGGRAASLG GGLEGGYFVE PTVFADVTPE MRISREEIFG PVICILKYND AGGSADEAVE LANNTEFGLG GLVFGADPEA ALAVADRMDT GSVGINFFAS NHAAPFGGRH DSGLGTEYGV EGLNAYLSYK SIHRKV
|
| |