Gene Arth_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2451 
Symbol 
ID4445039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2746328 
End bp2747395 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content70% 
IMG OID639690265 
Productalcohol dehydrogenase 
Protein accessionYP_831930 
Protein GI116670997 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGTT TCGACCAGCT CCCGGCCACC TCGGCCGCCG TAGTAGCCCA CGCGGCGGGC 
GACCTCCGGA TCGAAGACGT TCCTGTGCCG CCTCCGGGTC CCGACGAAGC TGTCGTGGAA
GTGGCCTTCG GCGGCATCTG CGGCTCCGAC CTGCACTACT GGCTCCACGG TGCCGCGGGT
GAGTCCATCC TCCGCGTACC CATGGTCCTG GGGCATGAAA TTGTGGGAAC GGTCCTGCAC
GCGGCTGCGG ACGGCACCGG ACCCGAAGCC GGCACCCCGG TCGCCGTCCA CCCCGCCACG
CCCGGCCCGG GCGCCGCACG GTATCCCGAG GACCGGCCCA ACCTATCGCC GGGCTGCACC
TACCTGGGCA GCGCCGCCCG CTACCCGCAC ACCGATGGGG CCTTCAGCCG CTACGCCACG
CTGCCCGCCC GGATGCTCCG GCCGCTCCCG GACGGGCTCA GCCTGCGGAC TGCCGCGCTG
GCGGAACCGG CCAGCGTTGC CTGGCATGCC GTTGCCCGCG CCGGGGACGT CACTGGAAAG
ACGGCCCTGG TGATCGGTAG CGGCCCCATC GGTGCACTGG CCGTCGCCGT GCTCAAACGC
GCCGGTGCCA GGCGGGTCGT GGCCGTGGAC ATGCACCCCA AGCCACTGGA AATAGCCCAG
GCCGTCGGCG CCGACGAAGT CCTCAAGGCA GACGAAAGCG ACGCCATCGC AGCGGTGGAG
GCGGACGTGG TCATCGAATC GTCCGGCAGC CACCACGGCC TTGCCTCCGC CATCAAGGGC
GCTGTCCGCG GAGGCAAGGT GGTGATGGTG GGCCTGCTGC CGTCGGGGCC TCAGCCCGTC
CTGATCTCGC TTGCCATCAC CCGAGAGCTG GAACTCCTGG GCTCGTTCCG CTTCAACGGC
GAAATCGACG AGGTTATTGC GGCTCTCGCT GACGGCACCT TATTCGTTGA CCCCGTGGTC
ACCCACGACT TCCCGCTGGA ACGCGGACTC GAGGCCTTCG AAGTCGCCAG GAACTCGGCC
GAGTCGGGGA AGGTGCTGCT GGACTTTAGC CCTGCTGCAG GGGAATGA
 
Protein sequence
MPGFDQLPAT SAAVVAHAAG DLRIEDVPVP PPGPDEAVVE VAFGGICGSD LHYWLHGAAG 
ESILRVPMVL GHEIVGTVLH AAADGTGPEA GTPVAVHPAT PGPGAARYPE DRPNLSPGCT
YLGSAARYPH TDGAFSRYAT LPARMLRPLP DGLSLRTAAL AEPASVAWHA VARAGDVTGK
TALVIGSGPI GALAVAVLKR AGARRVVAVD MHPKPLEIAQ AVGADEVLKA DESDAIAAVE
ADVVIESSGS HHGLASAIKG AVRGGKVVMV GLLPSGPQPV LISLAITREL ELLGSFRFNG
EIDEVIAALA DGTLFVDPVV THDFPLERGL EAFEVARNSA ESGKVLLDFS PAAGE