Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2451 |
Symbol | |
ID | 4445039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2746328 |
End bp | 2747395 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639690265 |
Product | alcohol dehydrogenase |
Protein accession | YP_831930 |
Protein GI | 116670997 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGTT TCGACCAGCT CCCGGCCACC TCGGCCGCCG TAGTAGCCCA CGCGGCGGGC GACCTCCGGA TCGAAGACGT TCCTGTGCCG CCTCCGGGTC CCGACGAAGC TGTCGTGGAA GTGGCCTTCG GCGGCATCTG CGGCTCCGAC CTGCACTACT GGCTCCACGG TGCCGCGGGT GAGTCCATCC TCCGCGTACC CATGGTCCTG GGGCATGAAA TTGTGGGAAC GGTCCTGCAC GCGGCTGCGG ACGGCACCGG ACCCGAAGCC GGCACCCCGG TCGCCGTCCA CCCCGCCACG CCCGGCCCGG GCGCCGCACG GTATCCCGAG GACCGGCCCA ACCTATCGCC GGGCTGCACC TACCTGGGCA GCGCCGCCCG CTACCCGCAC ACCGATGGGG CCTTCAGCCG CTACGCCACG CTGCCCGCCC GGATGCTCCG GCCGCTCCCG GACGGGCTCA GCCTGCGGAC TGCCGCGCTG GCGGAACCGG CCAGCGTTGC CTGGCATGCC GTTGCCCGCG CCGGGGACGT CACTGGAAAG ACGGCCCTGG TGATCGGTAG CGGCCCCATC GGTGCACTGG CCGTCGCCGT GCTCAAACGC GCCGGTGCCA GGCGGGTCGT GGCCGTGGAC ATGCACCCCA AGCCACTGGA AATAGCCCAG GCCGTCGGCG CCGACGAAGT CCTCAAGGCA GACGAAAGCG ACGCCATCGC AGCGGTGGAG GCGGACGTGG TCATCGAATC GTCCGGCAGC CACCACGGCC TTGCCTCCGC CATCAAGGGC GCTGTCCGCG GAGGCAAGGT GGTGATGGTG GGCCTGCTGC CGTCGGGGCC TCAGCCCGTC CTGATCTCGC TTGCCATCAC CCGAGAGCTG GAACTCCTGG GCTCGTTCCG CTTCAACGGC GAAATCGACG AGGTTATTGC GGCTCTCGCT GACGGCACCT TATTCGTTGA CCCCGTGGTC ACCCACGACT TCCCGCTGGA ACGCGGACTC GAGGCCTTCG AAGTCGCCAG GAACTCGGCC GAGTCGGGGA AGGTGCTGCT GGACTTTAGC CCTGCTGCAG GGGAATGA
|
Protein sequence | MPGFDQLPAT SAAVVAHAAG DLRIEDVPVP PPGPDEAVVE VAFGGICGSD LHYWLHGAAG ESILRVPMVL GHEIVGTVLH AAADGTGPEA GTPVAVHPAT PGPGAARYPE DRPNLSPGCT YLGSAARYPH TDGAFSRYAT LPARMLRPLP DGLSLRTAAL AEPASVAWHA VARAGDVTGK TALVIGSGPI GALAVAVLKR AGARRVVAVD MHPKPLEIAQ AVGADEVLKA DESDAIAAVE ADVVIESSGS HHGLASAIKG AVRGGKVVMV GLLPSGPQPV LISLAITREL ELLGSFRFNG EIDEVIAALA DGTLFVDPVV THDFPLERGL EAFEVARNSA ESGKVLLDFS PAAGE
|
| |