Gene Saro_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3235 
Symbol 
ID3917493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3456553 
End bp3457611 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID640446019 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_498504 
Protein GI87201247 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTAACC TGCTTGTCAC CGGCGGCGCC GGCTTCATCG GCGGCAATTT CGTCCACTAT 
TGGGCGCAGC AGCACCCTGA CGACACGATC GTCGTGCTCG ACTGCCTGAC TTATGCTGGC
AACCGGTCGA CCATTGCGGG GGTGGAACAG GCCGAACTCG TGGTGGGCGA CATTCGCGAC
ACCGACCTCG TCGAGAAGCT GCTGCGCGAG CGGGACATCG CGACGCTCGT CCACTTCGCC
GCAGAAAGCC ATGTCGACCG TTCGATTACC GGACCGGACG CCTTCATCGA AACCAATATC
CTTGGGACCA ACAGCCTGCT CAAGGCCGCG CGCAAGGTCT GGCTGGACGA AGGTTCGGGC
CGCGCCCACC GCTTCCACCA CATCTCGACC GACGAAGTAT ACGGGTCGCT CGGTCCCAGC
GATCCGGCCT TCTCGGAAAC CACGCAGTAC CAGCCGAACT CGCCCTATTC GGCGTCGAAG
GCCGCATCGG ACCACCTCGT GCGCGCCTAT CACCATACCT ATGGTCTGGA TGTGACGACG
ACGAACTGTT CGAACAATTA TGGGCCGTAC CATTACCCGG AAAAGCTGAT CCCGCTGTTC
ATCCTCAACG CGCTGTCGGG CAAGCCGCTG CCGATCTACG GCGACGGCAT GAACGTGCGC
GACTGGCTTT ACGTCGAGGA CCACTGCCGG GGAATCGAGG CGGCGCTGAA GAACGGCAAG
GCCGGCGAGA CCTACAACAT CGGTGGCGGC GAGGAACTGC CCAACATGGC GGTTATCGAC
CGTATCTGCG CGGAAGTGGA TCGGGCATTC GTCGAAGTCG AGGGGCTTGC GGAGCGTTAT
CCGGATGCGC CCGCCGCCAA GGGCCGGGCG ACCAGCGAAC TCAAGACCTT CGTCGAGGAC
CGCAAGGGGC ACGATCGCCG ATATGCAATC GACGAGACCA AGGCGCGTGC GGAGCTGGGC
TATGTGCCGC AGCACGACTT CGAGACAGGC CTTCGCGGCA CCCTGCGCTG GTACTTCGAC
AACGAAGCGT GGTGGCGGCC GCTCAAGGAT CGCGGCTGA
 
Protein sequence
MSNLLVTGGA GFIGGNFVHY WAQQHPDDTI VVLDCLTYAG NRSTIAGVEQ AELVVGDIRD 
TDLVEKLLRE RDIATLVHFA AESHVDRSIT GPDAFIETNI LGTNSLLKAA RKVWLDEGSG
RAHRFHHIST DEVYGSLGPS DPAFSETTQY QPNSPYSASK AASDHLVRAY HHTYGLDVTT
TNCSNNYGPY HYPEKLIPLF ILNALSGKPL PIYGDGMNVR DWLYVEDHCR GIEAALKNGK
AGETYNIGGG EELPNMAVID RICAEVDRAF VEVEGLAERY PDAPAAKGRA TSELKTFVED
RKGHDRRYAI DETKARAELG YVPQHDFETG LRGTLRWYFD NEAWWRPLKD RG