Gene Arth_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1804 
Symbol 
ID4445668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2019012 
End bp2020061 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content62% 
IMG OID639689622 
Productalcohol dehydrogenase 
Protein accessionYP_831294 
Protein GI116670361 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCG CACGGCTTCA CTCCCCCGGA AATATCAGGG TCGATGACAT CCCCAGACCG 
TCCGCCGACG CGGGTGACAT CATCATCAGA GTCCGGGCCG CGTCGATCTG CGGCACAGAC
CGTCGGATTG CCGCCAACGG GCATTTCAAG CTTCCGGAAG GGACTCCGCG TGTTCTTGGG
CACGAGTTTG CCGGCGAGAT TGTGGAGGCG GGCAGCGAGG TCAGTGGTTA CGCCGTCGGA
GACCGCGTCA GCGTTACGCC CAACGTCGGC TGCGGGACAT GCCCCAACTG CCTCGTTGGG
CTGAACAACA TGTGCCCCTC CTATGAAGCC TTCGGCATCA CGATGGACGG GGGCTTCCAG
GAGTACGTCC GGATACCCCG CTTTGCCCTC AACCGAGGCA ACGTGTTCCA CCTTCCGGAG
ACTGTGGGCT ATGCCGAGGC CGCACTGGTC GAACCACTCT CGTGCTGCTA CAACGCGGTC
AGCAAACTTG ATGTCCGACC GGACTCCACC GTGCTGATCA TGGGTGCCGG ACCCATCGGG
GCCTGTCACG TCATGCTGGC AAAGCTCTAC GGCGCCCGGA AAGTCATCGT TTCGAACAAC
CGGCAACCGC GGCTCGACTT CGCGGGTACT CTCGGCGCCG ATGTGCTGGT CAACCTCACC
GAACGCGACC TGGCCACTGT CGTGGCCGAG GAAACCGGTG GTCTGGGAGT CGATGTTGCC
CTGACCTGCG TCTCCAAGCC CGAGGTACAG GCTCAGGCCG TCGACCTGCT GGCAACGCAC
GGAAGAGTCA ATTTCTTTGC CGGACTCGGC AAAGCGCAAC CTGTTGCCCT TGACACCAAC
CGGGTCCACT ACCAGGGGCT GACTCTGACC GGTACAACGG GTTCCAGCAA TTCCGATTAT
GCGTCCGCCC TCAGCCTCGT GGGGGAGGGC AGGCTGGACC TCTCGCCACT GATCAGCCAG
ACGTTCACAC TGGATGACAT CGAAAAGGCC ATGGACTACG CCGGATCAGG CCAAGGGATG
AAGGCCATGA TCCTCTTCGA ATCGAACTAA
 
Protein sequence
MKAARLHSPG NIRVDDIPRP SADAGDIIIR VRAASICGTD RRIAANGHFK LPEGTPRVLG 
HEFAGEIVEA GSEVSGYAVG DRVSVTPNVG CGTCPNCLVG LNNMCPSYEA FGITMDGGFQ
EYVRIPRFAL NRGNVFHLPE TVGYAEAALV EPLSCCYNAV SKLDVRPDST VLIMGAGPIG
ACHVMLAKLY GARKVIVSNN RQPRLDFAGT LGADVLVNLT ERDLATVVAE ETGGLGVDVA
LTCVSKPEVQ AQAVDLLATH GRVNFFAGLG KAQPVALDTN RVHYQGLTLT GTTGSSNSDY
ASALSLVGEG RLDLSPLISQ TFTLDDIEKA MDYAGSGQGM KAMILFESN