Gene Arth_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3962 
Symbol 
ID4447622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4476902 
End bp4477957 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID639691793 
Productalcohol dehydrogenase 
Protein accessionYP_833437 
Protein GI116672504 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGC TCGTCTACGG CGGTCCCGGC GAAAAGTCAT GGACCGACGT TCCGGATCCC 
GCCATCCAGA ACCCCAGCGA CGTAATCGTC AAGGTGGACA CCACCACCAT CTGTGGAACG
GACCTGCACA TCCTCAAGGG GGACGTGCCC GCAGTTCAGA AAGGCCGGAT CCTGGGGCAT
GAGGGCGTGG GAACCATCAC CGAAGTGGGC TCCTCGGTCA CCAGCCTGAA AGTAGGGGAC
CGGGTCATCA TCTCCTGCAT CAAGTCCTGC GGCCACTGCG CCAACTGCAA GACCGGTCTT
TATTCGCACT GCATGGGCGA GGAAGGCGCA GCAGGTATCG GCTGGGTCTT CGGACACCTG
ATCGACGGTA CGCAGGCCGA ATACGTGCGG GTCCCATACG CGCAGAACTC GCTGCACCTT
CTCCCCGAAG GGGTCAGCGA CGACCAGGCC GTGATGCTCT CCGACATCCT GCCCACCGGC
TTTGAAATCG GTGTGCAGTA CGGGCGGGTC AAGCCGGGGG ACACCGTGGC GGTTGTAGGC
GCGGGGCCGG TCGGGTTGGC AGCAATCGCC ACCGCCGGGC TGTACGGCGC GGCAACCATC
ATCGCGATCG ACCTTGACGC CAACCGGCTT GAAAAGTCCC GCGAATTCGG CGCCACGGAC
GTCGTGCTCT CCGGCGACGC CGACTGGAAG GAACAGGTGC TGGCGCTCAC GGACGGACAG
GGCGTGGATG TGGCCATAGA AGCGGTGGGC ATCCCGGCGA CCTTCGGAAT GTGCACGGAG
ATCGTGCGCC CCGGCGGCAA CGTGGCCAAC GTCGGCGTGC ATGGAAAGTC CGTCGAACTC
CATGTGGAGA ACCTCTGGAT CCAGAACATC AACATCAGCA TGGGCCTGGT CAACGCCAAC
ACCACGCCGA TGCTCCTCAA GCTGGTGGCG CAGAGGAAGG TTCCCGCGGA GAAATTCGCC
ACCCACCATT TCACGTTCGA CCAGTTCATG GACGCCTACG ACACCTTCGC CCGCGCAGCC
GAAACCAAGG CACTCAAAGT CGTGATCACG GCGTGA
 
Protein sequence
MKALVYGGPG EKSWTDVPDP AIQNPSDVIV KVDTTTICGT DLHILKGDVP AVQKGRILGH 
EGVGTITEVG SSVTSLKVGD RVIISCIKSC GHCANCKTGL YSHCMGEEGA AGIGWVFGHL
IDGTQAEYVR VPYAQNSLHL LPEGVSDDQA VMLSDILPTG FEIGVQYGRV KPGDTVAVVG
AGPVGLAAIA TAGLYGAATI IAIDLDANRL EKSREFGATD VVLSGDADWK EQVLALTDGQ
GVDVAIEAVG IPATFGMCTE IVRPGGNVAN VGVHGKSVEL HVENLWIQNI NISMGLVNAN
TTPMLLKLVA QRKVPAEKFA THHFTFDQFM DAYDTFARAA ETKALKVVIT A