Gene Saro_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1724 
Symbol 
ID3916299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1812584 
End bp1814701 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content65% 
IMG OID640444465 
Productshort chain dehydrogenase 
Protein accessionYP_496998 
Protein GI87199741 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.616533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCTC AGACCGAGAC CTTGATTGCC GCTGCCGCCA TTCCGTTCGC GGTTCCGACC 
AGCCGTTGGG ACGATGCCGT CGCGGCAAAG CTCGGCCCGG CAGAACTTCT GCTCTATCGC
TCGAACCTGC TCGGTTCCGA TCTTACCGTC ACCAACTTCG GTGGCGGCAA TACGTCGGCC
AAGCTTGAAG AGATGGATCC GCTGACTGGT GAGCCGGTCG AAGTGCTGTG GGTCAAGGGT
TCGGGCGGTG ACATCGGCTC TATGAAGATC GACGGCTTTG CCACTCTTTA CCAGGCAAAG
CTGCTTGGCC TCGAGGCGCA CTATGCCGGG CCGCAGGACG ACGACAAGAT GGTCGGCTTC
CTGCCGCACT GCACGTTCAA CCTCAACTGT CGCGCGGCCA GCATTGATAC GCCGCTGCAC
TCGCTGCTGC CCTTCGCGCA TGTCGATCAC GTCCATCCCG ATGCGATCAT CGCGCTGGCC
GCTTCCTCGG GGGGCGAGGC CGCCACGAAG GAAATCTGGG GCGGCCGGAT CGGCTGGCTG
CCGTGGAAGC GTCCCGGCTA CACCCTTGGC GTGATGCTCC GCGATTTCGT CAAGGCTAAC
CCGGGCGTCG AAGGTGTCAT GCTTGCCGGC CATGGCATCA TCTGCTGGGC CGACAGCGCC
AAGGCCTGCT ATGAACATAC CGTTCGGCTG ATCGCGGACG CGGCCGGCTA TCTCAATGCC
CGGCTCGCCG AAAAGCCAGC GTTCGGAGGT CGGAAAGTGG CGCCGAACCC GGATCGGGCA
AAGATCGCCG CCGACCTCAT GCCTCGCCTG CGTGGCTTCA TGACCGGTGC GCGCAACAAG
CTTGGGCACT TCTCGGACGA TGCCGAGGCG CTGGAGTTCG TTGGCTCGGT GGACTTCGAG
CGTCTCGCCG CGCTTGGCAC CTCGTGCCCC GACCATTTCC TGCGCACCAA GATCGCGCCG
CTGACGCTCG ATCCCTCGCG GCTGCAAGAC GACGACTACC TCGCGCGGAA GATTGCCGGC
TATCGCGATC TCTATGCGGC CTATTATGAA CGCTGCAAGC GCCCGAACTC GCCGGCAATG
CGCGATTCCA ACCCTGTCGT CGTGCTCGTC CCGGGCGTCG GACGCATCAC GTTCGCCACC
GACAAGACCA CCGCGCGGCT CGCTGGCGAA TTCTACGGCA ACGCCATCAA CGTGATGCGC
GGGGCCGAAG CCATCGGCGA TTACATTGCG CTCGATGAGC AGGAAGCCTT CGACATCGAA
TACTGGCTGC TCGAAGAGGC CAAGCTCCAG CGCATGCCTG CGCCCCGGCC TCTGGTCGGC
AAGATCGCGC TGGTCACCGG CGGGGCAGGG GGCATCGGCG CGGCATCGGC CGCCCGCCTG
CTGCGCGAAG GCGCCTGCGT CGTGCTGGCC GATCGTGCCG CCGACGCGGT CGAGGACGTC
CGCGCCGGTT TCGCAAGGCA GTTCGGCAAC GACGTCGTGC GCGCGGCCGT CTGCGACGTG
ACCGACGAGG CGCAGGTCCA GGCTGCTTTC GACGTGGCCG CACGTGAATT TGGCGGGCTC
GACATTCTGG TCGCCAACGC CGGCATCGCA TCTTCTGCAC CGCTCGAGGA AACGACCGTC
GATCTGTGGA ACCGCAACTA CGACGTCCTC GCGCAGGGGT ATTTCCTGAC CTCCCGCTCC
GCCTGGCCGC TCATGAAGCG CATGAAGGAG CAGGGCGGCG CGTCTGTCGT GTTCATCGGT
TCCAAGAACG GCGTTGCCGC CGCTACGAAC GCCAGTGCCT ATGCTTCCGC GAAGGCTGCC
GCGAACCATC TCGCGCGGTG CCTCGCGCTT GAAGGCGCGC CGTTCGGCAT CCGCGTCAAT
ACCGTCAACC CCGATGCCGT CATCAAGGGC AGCAAGATCT GGGACGGCGA CTGGCGCAAG
GAACGCGCCG GGGCCCACGG CATCGACAGC GGCAAGGAAC TGGAAGAGCA CTACCGCCAG
CGCTCGATGC TCAAGCGCGA TGTTCTGCCC GAAGATATCG CGGAAGCAGT CTATTTCCTC
GCTTCGGACA TGTCGGCAAA ATCCACCGGC AACATGATCA ACGTTGATGC GGGGAACGCC
CAGGCCTTCA CTCGCTGA
 
Protein sequence
MNAQTETLIA AAAIPFAVPT SRWDDAVAAK LGPAELLLYR SNLLGSDLTV TNFGGGNTSA 
KLEEMDPLTG EPVEVLWVKG SGGDIGSMKI DGFATLYQAK LLGLEAHYAG PQDDDKMVGF
LPHCTFNLNC RAASIDTPLH SLLPFAHVDH VHPDAIIALA ASSGGEAATK EIWGGRIGWL
PWKRPGYTLG VMLRDFVKAN PGVEGVMLAG HGIICWADSA KACYEHTVRL IADAAGYLNA
RLAEKPAFGG RKVAPNPDRA KIAADLMPRL RGFMTGARNK LGHFSDDAEA LEFVGSVDFE
RLAALGTSCP DHFLRTKIAP LTLDPSRLQD DDYLARKIAG YRDLYAAYYE RCKRPNSPAM
RDSNPVVVLV PGVGRITFAT DKTTARLAGE FYGNAINVMR GAEAIGDYIA LDEQEAFDIE
YWLLEEAKLQ RMPAPRPLVG KIALVTGGAG GIGAASAARL LREGACVVLA DRAADAVEDV
RAGFARQFGN DVVRAAVCDV TDEAQVQAAF DVAAREFGGL DILVANAGIA SSAPLEETTV
DLWNRNYDVL AQGYFLTSRS AWPLMKRMKE QGGASVVFIG SKNGVAAATN ASAYASAKAA
ANHLARCLAL EGAPFGIRVN TVNPDAVIKG SKIWDGDWRK ERAGAHGIDS GKELEEHYRQ
RSMLKRDVLP EDIAEAVYFL ASDMSAKSTG NMINVDAGNA QAFTR