Gene Saro_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1476 
Symbol 
ID3916141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1518089 
End bp1519171 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content63% 
IMG OID640444219 
Productalcohol dehydrogenase, zinc-containing 
Protein accessionYP_496753 
Protein GI87199496 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.209599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGGAC GTGCATCGGT GCTGGTAAAA CCGAACCAAC TGGAGACTTG GGACGTCAAA 
GTCGCCGATC CGGAACCGGG CGGTGCCCTG GTTTCGATCG TGCTGGGCGG GGTATGCGGC
AGCGACGTCC ACATCTTGAC CGGCGAGGCT GGCGTGATGC CCTTCCCCAT CATTCTGGGC
CATGAGGGCG TGGGACGCAT CGAGAAGCTG GGGCACGGCG TCAGCACTGA CTACGCTGGC
GAGGAACTTA AGCCCGGCGA TCTGGTATAT TGGTCGCCGA TCGCGCTGTG CCATCGATGC
TATTCCTGCA ATGTTCTCGA CGAGACACCT TGCGAAAATA CCCAGTTCTT CGAAGATGCT
TCCAAGCCGA ACTGGGGTAG CTACGCCGAT TATGCATGGC TGCCCAACGG CATGCCGTTC
TATAAGCTAC CAGCCCAAGC GCAGCCCGAA GCGGTCGCTG CGCTTGGCTG CGCCCTTCCA
ACCGCCCTGC GCGGCTTTGA TCGCTGCGGC TCGGTTCGCG TCGGTGAAAC TGTGGTTGTC
CAAGGTGCAG GCCCTGTCGG CCTGTCGGCG GTGCTCGTGG CGGCGCAGGC CGGGGCGCGT
GACGTGATCG TTATTGACGG CTCACCACTT CGTCGCGAAG CGGCCACCGC ATTGGGCGCC
TCGCTGACGA TCGGCCTCGA CGTCGCGCCC GAGGAACGGC GCCGAATGAT TTACGATCGC
GTTGGTCGCA ATGGCCCCAA TGTCGTCATC GAGGCAGCCG GAGTTCTGCC AGCGTTCCCC
GAAGGCGTGG ACCTGACCGG CAACCACGGC CGCTACATCG TGCTAGGACT TTGGGGCGCC
ATAGGCACTC AACCGATCAG CCCGCGCGAC CTCACAATCA AGAACCTGAC TATCGCTGGT
GCGACCTTCC CCAAGCCCAA GCATTATTAC CAGGCCTTGC ATTTGGCGAC AGCCCTGCAG
GACCGTGTAC CGCTAGCCGG TCTGGTCAGC CACCGGTTCG GCGTCAGCCA GGCGGGCGAA
GCGCTGAGTC TCACCAAGAG TGGCACAGCG ATCAAGGCCG TGATCGATCC GACGATCACC
TGA
 
Protein sequence
MLGRASVLVK PNQLETWDVK VADPEPGGAL VSIVLGGVCG SDVHILTGEA GVMPFPIILG 
HEGVGRIEKL GHGVSTDYAG EELKPGDLVY WSPIALCHRC YSCNVLDETP CENTQFFEDA
SKPNWGSYAD YAWLPNGMPF YKLPAQAQPE AVAALGCALP TALRGFDRCG SVRVGETVVV
QGAGPVGLSA VLVAAQAGAR DVIVIDGSPL RREAATALGA SLTIGLDVAP EERRRMIYDR
VGRNGPNVVI EAAGVLPAFP EGVDLTGNHG RYIVLGLWGA IGTQPISPRD LTIKNLTIAG
ATFPKPKHYY QALHLATALQ DRVPLAGLVS HRFGVSQAGE ALSLTKSGTA IKAVIDPTIT