Gene Saro_2512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2512 
Symbol 
ID3916833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2716714 
End bp2717754 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID640445269 
Productalcohol dehydrogenase 
Protein accessionYP_497782 
Protein GI87200525 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.881464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCCC TGCGCTACTA CGGTGCCCGC GACATCCGCC ATGAATCGAT GGATGATCCG 
ACGCCGCAAT CGGACCGCGA CGCAATCGTG AAGGTCGATG CCTGCTCGAT CTGCGGCTCC
GACCTTCACA TCTACCACGG CCACGGCTTT TCCGAGGATA TCGGCTTCTG CGTGGGCCAT
GAAGCGGTGG GCGAAGTGGT CGAGGTCGGG CGCGGCGTCC ACCGGCTCAA GGTCGGGCAA
AAGGTGATGA TCCCCGCCGC GGTCGGCTGC GGGGCCTGCC GCTCGTGCCT CGCAGGGGTG
GTCAACACCT GCGAAAACAA TGGCTCGGGC TGCTACGGCC TGTCCGCGAA GCTACAGGGA
TCGCAGGCAG AGGCGGTGCG CGTTCCCGCT GCGGATGCCA ATGCGGTCGC CATTCCCGAA
GGCGTCAGCA CCGAACAGGC GCTGATGATG ACCGACGCGC TCGCCACGGC ATGGTTCGGT
GCACGCCAGG CCGATATCCG CCCCGGCAGT TCGGTCGGCA TCATCGGCCT CGGGCCGATC
GGCCTCATGG CGGCGGAGAG CGCATTCGTG ATGGGCGCAC ATGTTGTCTA TGCGATCGAT
CCCGTGCCGG AACGCCGCGC CATCGCGGAA AGCCTCGGGG CCATTGCCTT GCATCCGGAC
GAGGCTTCCG CGCGGATCAA GGAGGACACG CACGGCAGGC GCCTCGATTG CGTGGTGGAA
GTCGTCGGAT CGGATGCCAC CGTCGACATG GCCCTGCGGC TCGTGCGCGT GCGCGGCACG
GTCTCGGTGA TCGGCGTCCA GCAATCGCGC CGCTTTCCCT TCCCGCTCGA GCGGGCCTTC
GCCGGCGGAC TCACCTTCCG CGTGGGCACC TGCTCGGTCC CGGAGGAACT GCCAGCTCTG
TTCCCGCTTG TCGCTTCGGG CCGCCTGCGC CCCGAACGCT ACATCAGCCA CCGCCTGCCC
CTGTCGCAGG GCGCCGAAGC CTACCGCATG TTCGAGGCGC GCGAGGCAGG CGCGCTCAAG
ATGGTGCTTG TGCCGGACTG A
 
Protein sequence
MKALRYYGAR DIRHESMDDP TPQSDRDAIV KVDACSICGS DLHIYHGHGF SEDIGFCVGH 
EAVGEVVEVG RGVHRLKVGQ KVMIPAAVGC GACRSCLAGV VNTCENNGSG CYGLSAKLQG
SQAEAVRVPA ADANAVAIPE GVSTEQALMM TDALATAWFG ARQADIRPGS SVGIIGLGPI
GLMAAESAFV MGAHVVYAID PVPERRAIAE SLGAIALHPD EASARIKEDT HGRRLDCVVE
VVGSDATVDM ALRLVRVRGT VSVIGVQQSR RFPFPLERAF AGGLTFRVGT CSVPEELPAL
FPLVASGRLR PERYISHRLP LSQGAEAYRM FEAREAGALK MVLVPD