Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2512 |
Symbol | |
ID | 3916833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2716714 |
End bp | 2717754 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640445269 |
Product | alcohol dehydrogenase |
Protein accession | YP_497782 |
Protein GI | 87200525 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.881464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCCC TGCGCTACTA CGGTGCCCGC GACATCCGCC ATGAATCGAT GGATGATCCG ACGCCGCAAT CGGACCGCGA CGCAATCGTG AAGGTCGATG CCTGCTCGAT CTGCGGCTCC GACCTTCACA TCTACCACGG CCACGGCTTT TCCGAGGATA TCGGCTTCTG CGTGGGCCAT GAAGCGGTGG GCGAAGTGGT CGAGGTCGGG CGCGGCGTCC ACCGGCTCAA GGTCGGGCAA AAGGTGATGA TCCCCGCCGC GGTCGGCTGC GGGGCCTGCC GCTCGTGCCT CGCAGGGGTG GTCAACACCT GCGAAAACAA TGGCTCGGGC TGCTACGGCC TGTCCGCGAA GCTACAGGGA TCGCAGGCAG AGGCGGTGCG CGTTCCCGCT GCGGATGCCA ATGCGGTCGC CATTCCCGAA GGCGTCAGCA CCGAACAGGC GCTGATGATG ACCGACGCGC TCGCCACGGC ATGGTTCGGT GCACGCCAGG CCGATATCCG CCCCGGCAGT TCGGTCGGCA TCATCGGCCT CGGGCCGATC GGCCTCATGG CGGCGGAGAG CGCATTCGTG ATGGGCGCAC ATGTTGTCTA TGCGATCGAT CCCGTGCCGG AACGCCGCGC CATCGCGGAA AGCCTCGGGG CCATTGCCTT GCATCCGGAC GAGGCTTCCG CGCGGATCAA GGAGGACACG CACGGCAGGC GCCTCGATTG CGTGGTGGAA GTCGTCGGAT CGGATGCCAC CGTCGACATG GCCCTGCGGC TCGTGCGCGT GCGCGGCACG GTCTCGGTGA TCGGCGTCCA GCAATCGCGC CGCTTTCCCT TCCCGCTCGA GCGGGCCTTC GCCGGCGGAC TCACCTTCCG CGTGGGCACC TGCTCGGTCC CGGAGGAACT GCCAGCTCTG TTCCCGCTTG TCGCTTCGGG CCGCCTGCGC CCCGAACGCT ACATCAGCCA CCGCCTGCCC CTGTCGCAGG GCGCCGAAGC CTACCGCATG TTCGAGGCGC GCGAGGCAGG CGCGCTCAAG ATGGTGCTTG TGCCGGACTG A
|
Protein sequence | MKALRYYGAR DIRHESMDDP TPQSDRDAIV KVDACSICGS DLHIYHGHGF SEDIGFCVGH EAVGEVVEVG RGVHRLKVGQ KVMIPAAVGC GACRSCLAGV VNTCENNGSG CYGLSAKLQG SQAEAVRVPA ADANAVAIPE GVSTEQALMM TDALATAWFG ARQADIRPGS SVGIIGLGPI GLMAAESAFV MGAHVVYAID PVPERRAIAE SLGAIALHPD EASARIKEDT HGRRLDCVVE VVGSDATVDM ALRLVRVRGT VSVIGVQQSR RFPFPLERAF AGGLTFRVGT CSVPEELPAL FPLVASGRLR PERYISHRLP LSQGAEAYRM FEAREAGALK MVLVPD
|
| |