Gene Arth_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1794 
Symbol 
ID4445693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2009450 
End bp2010481 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID639689612 
Productalcohol dehydrogenase 
Protein accessionYP_831284 
Protein GI116670351 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0373035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCCT CAACCACCCT CACCACCGGA ATCCGGGCCG CGGTTCTTTC CGCCGCCCAC 
CATTTCGAGG TCCAGCATGT CCCGAAACCC AGTCCGGGCC CCGGCGCAGC CCTGGTCCGC
GTGTCATACA CCGGCATCTG CGGTTCCGAC TTTCCAATTG TTGACGGCCG CCATCCCCGT
GCCGCAATGC CACTCATCCT GGGGCATGAG ATCACCGGCA TTCTGGAGGA ACCAGGCGGG
AGCGGAATTC CCGCGGGCAC GAGGGTCGCT GTCAATCCAC TGTTGCCTTG CGGTCAGTGC
GGTGCCTGCC TAAAAGGGCT GGGGCATGTC TGCCGGAACC TGCGCCTCCT GGGCATCGAC
GTCCCGGGCT CCATGACTGA AGTCCTGGCC GTTCCGGTGT CGAACCTCTT CGCGTTCTCC
GCCGACGCGC CGGCGACCGA AGCGGCTCTG GCCGAACCCC TGGCCGTGGC GGTCCACGCC
GTCCGTCGCT CGCGGTTGGC ACCGGGAGAG AAGGTACTAA TCTTCGGTGC CGGGCCGATA
GGAATCCTGG TGGCCCTCGT GGCGAGGTTT CGCGGCGCCA AGGATGTGCT CCTTGTCGAG
CCAAGTGAGC AGCGCCGGCA CATTGTTGAG GCACTCGGCT TCAGGGCCCT TGCTCCGCAG
GACTCTCCGG TCGCTCGCGA AAATCGCGAG GCCACGGCGG ACGTTGTGTT TGACTGCGCC
GGGCACTCCA GTGTCACGCC GGCACTAACG GAGGCGGCGC CGGTTCGAGG ACGCATCGTG
ATCGTCGCCG TGCACCACGG ACCGGCCAAT ATCGATCTGC GTGAGCTCGC CTTTGCCGAG
CAGGAAATCA TCGGCGTCCG GGTTTACGAA CCGGCCGATT TCGCCGAATC CGTGCAGCTC
ATCGGAAACC GGGCACTTGG ACTCGCAGGA GTCCCGATAT CCGAATATCC CCTCGAGGCC
GTTGCCGATG CCTTTGCGGA GGCGCGCTCC GCCGCCGGGG CAGTCAAGGT GATCGTGCGC
AGCAACAATT AG
 
Protein sequence
MSPSTTLTTG IRAAVLSAAH HFEVQHVPKP SPGPGAALVR VSYTGICGSD FPIVDGRHPR 
AAMPLILGHE ITGILEEPGG SGIPAGTRVA VNPLLPCGQC GACLKGLGHV CRNLRLLGID
VPGSMTEVLA VPVSNLFAFS ADAPATEAAL AEPLAVAVHA VRRSRLAPGE KVLIFGAGPI
GILVALVARF RGAKDVLLVE PSEQRRHIVE ALGFRALAPQ DSPVARENRE ATADVVFDCA
GHSSVTPALT EAAPVRGRIV IVAVHHGPAN IDLRELAFAE QEIIGVRVYE PADFAESVQL
IGNRALGLAG VPISEYPLEA VADAFAEARS AAGAVKVIVR SNN