Gene Saro_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1561 
Symbol 
ID3917236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1618665 
End bp1620215 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content67% 
IMG OID640444301 
Productshort chain dehydrogenase 
Protein accessionYP_496835 
Protein GI87199578 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.636243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGCGA AACAAAGGGA AGGGCGTGCG CGCCTCGTTC TGGTGACCGG CGCGGCCCGC 
GGCATCGGTC TGGCGTGTGC CCACCGCTTT GCCAGGGCCG GTGATCGCGT GGTCATGGCC
GACCGCGACT TGGCGGCGTG CACTTTCGAG GCTGAAAGGC TCGGCTCGCG GCATGTCGCG
CTGCAACTGG ACGTTTCGGA CGAAGCGGCG GTCGAGCACG CGATGGACGG TCTTCTGCAG
CAGTTCGGCG CGTTCGATGT CGTGATCAAC AACGCCGGGG TGGTGGATCG CTTTGCCCGG
CCGCTTCTTG ACGTACCGCC GGAGGATATC GACCGGCTGA TAGGCGTCAA TCTCGAAGGT
CCCTATCTGG TTGTGCGCGC TGCGCTGCGG ACGATCCTTG CCGGACGGCG TGGCGCGGCA
ATCGTCAACG TCGCATCGGG CGCGGCACTG CGCGCGCTGC CGGGCCGTGC CGCCTACAGC
ATGACAAAGG CGGGCGTCAT CGGCATGACC CGCGCGATGG CGATAGAGCT TGGCCCCCAG
GGTATCGCCG TCAACGCCGT GCTGCCGGGA TACATCGACA CCGAAATTCT CCTTGCTCTG
GAGCGGGAGG GCAAGTTCGA CCGCGCCGCC GCTGCCGGCG CAATACCGAT GGGAAGGCTC
GGCCGGACAG ATGAGATTGC CGAGGCGGTC CATTACCTCG CGCGCGGGGG TTATCATTGC
GGCAGCCTGC TTTCGGTCGA TGGTGGGGTC GATGCGTATG GCGGTTCGGG CAAGGCCTCC
ACCGCCGTCA TGCCGCACCG CCCGGTGCGC GCGGGCGACG TCGCTTGCGT GACCGGCGGG
GCGAGCGGCA TCGGCGCCGT TGTGGCAGAC CGGCTTGCCG GGCTCGGCTG GCTCGTGGCG
ATAATCGACA GCCGGGAAAT CGCGGACGGA CCACACCCTG CGTGGCAGGC CGACATCGCC
AGCGAAGCCT CGGTCGAGAG CGCGATGGCA GGCATCGCTG GCCAGCTCGG CCCGGTGACG
CTGCTGGTCA ACAATGCCGG TATCGTCGAA CCCATGGCGA AGTCTGCCGA CCAGGCGCTT
GCCGACTTCC GCCGCACGAT CGACGTGAAC GTGAAGGGCA CTATCCATGC ATCGCGCGCG
GCTGCGCGGC AGATGATCGG CGCGGGCGGT GGGGCCATTG TCAATCTTTC CTCCATCACG
GCATCGCTCG GTTTGCCGGG GCGCAATGCC TATTGCGCGT CGAAATCTGC CGTCACCATG
CTCACCCGCA GTCTCGCCTG CGAATGGGCC GCGCATGGCA TCCGGGTGAA TGCGGTCGCG
CCAGGATACA TCCTGACCCC CGCAGTGCAG GCCTTGCTGG CTTCGGGAGA GCGCGACATG
AACTCCGTCG TCCGGCGCAT ACCGGTGGCG CGCCTTGGTC AGCCTGACGA AGTGGCGGAC
GCCATCGCGT TTCTGGCCTC GGATGCGGCA TCCTATGTTA CCGGCGCCAC GCTTCAGGTG
GATGGCGGCT ATCTTGCCAG CGGGCATCCG CCCGATGGAC CGATGCCCTG A
 
Protein sequence
MEAKQREGRA RLVLVTGAAR GIGLACAHRF ARAGDRVVMA DRDLAACTFE AERLGSRHVA 
LQLDVSDEAA VEHAMDGLLQ QFGAFDVVIN NAGVVDRFAR PLLDVPPEDI DRLIGVNLEG
PYLVVRAALR TILAGRRGAA IVNVASGAAL RALPGRAAYS MTKAGVIGMT RAMAIELGPQ
GIAVNAVLPG YIDTEILLAL EREGKFDRAA AAGAIPMGRL GRTDEIAEAV HYLARGGYHC
GSLLSVDGGV DAYGGSGKAS TAVMPHRPVR AGDVACVTGG ASGIGAVVAD RLAGLGWLVA
IIDSREIADG PHPAWQADIA SEASVESAMA GIAGQLGPVT LLVNNAGIVE PMAKSADQAL
ADFRRTIDVN VKGTIHASRA AARQMIGAGG GAIVNLSSIT ASLGLPGRNA YCASKSAVTM
LTRSLACEWA AHGIRVNAVA PGYILTPAVQ ALLASGERDM NSVVRRIPVA RLGQPDEVAD
AIAFLASDAA SYVTGATLQV DGGYLASGHP PDGPMP