Gene Saro_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1595 
Symbol 
ID3918703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1655345 
End bp1658650 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content68% 
IMG OID640444335 
Productpolysaccharide deacetylase 
Protein accessionYP_496869 
Protein GI87199612 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis
[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGCC CGATCTTCTA CGACCCGACC GGTCGCCGCA GCCTTCTCGC CAAGCGTTCG 
CTGGCAGTCG TCTTCCTTGC CGTGATCCTT GCGGCGCTGG CCTTCGCGAC GACACTCGTC
GCGGTGCCGC GACAGCGCGC GCTGTCGCTG CCGCAGCCCC ATTTCGAAGC CGCCAGCCTG
CACCGGGGCC ATGGGCTGTC CCGCCTCCGC TCCTGGCTGC CCCGGCTGCA CCTGCAACGC
AGCCAGGAAG CCCTGGCGAT CGGCTTCTAC CAGCCTCGCA GCGATGCCAG CTTCCAGTCG
CTTTCGCGCC ACGGCAACCA GCTCGACTGG GTCGTGCCCG CGCTGCTCAA CGTCGCGGGG
CCGGAACACC GCCTTGCCGT GTCCAATGAC CCGCGCTTCG ACAGCTTCGT CGCGGCAGGG
CATCACCACT ACAAGGTCTT GCCGATGGTC CAGAACGTCG GCGGCGACGG CTGGGACGGC
GCGAACGTGG CCCGCATTCT GGAAAACGCC ACCGAGCGCC ATGCGCTCGA ACGCCAGTTG
GCCGACTACG TGGCGCGACG CGGCGACGCA GGCCTTGTGC TCGACTTCGA GGACCTGCCA
GCCACGGCCA TGCGCCCCTA TGTCGCGTTT CTGCGCGAAA CGAACGCCCT CCTGCCCGCC
GGAGCCCAGC TTGCGGTGAC AGCCCCCGCG GGCGAGGACG ACTGGCCGAT TGCCGCTCTT
GGCCGTGCCG CCGACAAGGT CATCCTCATG GCCTACGACG AACACTGGCA GAATGGTGCT
GCTGGTCCGA TCGCATCGCA AACGTGGTTC ACGAACGAGG TCGAGAAGTC CGCCCGTGCC
CTGCCCGCCG ATCGCCTGGT CATCGCGCTT GGCAGCTATG GCTACGACTG GCACCAGGGC
GGCACCGATG CGCTGTCTCT CAGCGACGCC TGGCTCGCCG CTCATGACAG CCACGCTCCG
ATCATCTTCG ACAAGGCCAG CGGCAACGCG GGCTTCGCTT ACGAACAGGA CGGCGTATCC
CACACCGTCT GGATGCTCGA TGCCGCCACC GACTGGAACG AGATGCTGGC GGTCCAACGC
CTCGGCATCG GCAATCTAGC GCTCTGGCGC CTCGGCACTG AGGACCCGGG CTTCTGGAAT
GACCTCGCGG CCCTGCGCTC GGGCAAGGCG CCCGATCTGT CGCGCCTCGT CTCCGAAGGC
GGCAACGACG TCGAAGGCAC CGGCGAGATC CTGCGCATCA CCGCCACCCC CGTCGCGGGC
AGCAGGACCG TCGCCTTCGG GGACCAGCAC CTGATCCGCG ACGAACGCTA TGGCGACTTG
CCCACGCCCT TCGTCGTGCG CCGTACCGGG GCCGCCAACC CCAAACTCCT CGCCCTCACC
TTCGATGACG GTCCCGATCC GAAGTGGACC CCCAAGGTCC TTTCCACGCT CGAAGCCGCC
CATGTGCCCG CGACCTTCTT CGTCATCGGC GAAAACGCGC TTGTCCGACC CGACCTTCTT
CGCCGCATGG TCGCCGACGG CGACGAGATC GGCAACCATA CCTACACCCA CCCCAACCTG
GCAGGTTCAT CGGAAGCCGA GACCCGGCTC GAAATCAACG CCACCCAGCG CCTCGTCCAG
GCCTACACCG GCCGCAGCAT GACCCTTTTC CGCGCACCCT ACTTCGGCGA CGCGGAGCCG
ACGACCGCAG ACGAACTACA GCCGGCCCTT CGCGCACAGC AGGCCGGCTA TACCATCGTC
GGCCTTCATG TGGACCCGAA CGACTGGCAG AATCCCGGTA CGGACGAAAT CGTCCGCCAA
GTCGTCAACC AGATCCACAA CGCGACGGAA CAATCCAGCA GCAACGTCAT CCTGCTACAC
GACGGCGGCG GCGACCGCTC GCAGACCGTC GCCGCCCTCC CGCGTATCGT CGCGGTGCTG
CGGGCCGAAG GCTACAGCTT CGTCACCGCG TCGCAACTCG TCGGCATCCC GCCGCAAGCA
GCCATGCCCC GGCTGTCGTC ATCCGATCTC GCCGCGGTGC GGGTCGACGT CGCCGCCTTC
CTCGCGCTCG ACGGCCTGGT CACGCTGTTC TCGTGGCTGT TCTTCGTCGC GATCGCGCTC
GGTATTGCCA GGGCGGTCAT CATGGCTGGC CTGGCTTGGT GGCAAAGCCG CTCGCCCCGC
GCCGAGCCTC CCGCGTTCGA ACCCACCGTC TCTGTCATAA TCCCCGCCTG GAACGAGGAA
CGCGTGATCG CCGCGTCGGT CGAGCGCGTG CTCGCCAGCG ATTACCCCGC CCTTCAGGTG
ATCGTCGCCG ATGACGGGTC GAAGGACGCC ACCAGCGCGG TTGTCGCCCG CCATTTTGGT
CATGACCCGC GCGTCACGCT CCTCACGCTT GCGAACGGAG GCAAGGCGGC GGCGCTGAAC
CGTGCCCTGC GCCACGCCAC CGGCGAGGTC GTGATCGCGC TCGATGCCGA TACGCAGTTC
GAACCGCTGA CCATTCGCCG CCTTGCGCGC TGGTTTGCCG ACCCCCGGAT CGGCGCGGTC
GCAGGCGATG CCCGCGTCGG CAACCGGGTG AACCTCGTCA CGCGCTGGCA GGCGGTGGAA
TACATCACCG CCCAGAACCT CGAACGCCGC GCGCTTGCGG GCTTCGACGC CATGACCGTC
GTGCCCGGTG CCGTGGGTGC ATGGCGTCGC GCGGCGCTCG ACAGCGTGGG CGGCTATCCC
GAAAACACGC TGGCCGAAGA CCAGGATCTC ACCATCGCCA TCCAGCGCAA GGGCTGGCGC
GTGACATACG ACCCCCGTGC CGTCGCCTGG ACAGAGGCGC CCCAGACCTT CCGCGCGCTC
GCTCGCCAGC GCTATCGCTG GGCCTTCGGC ACGCTCCAGT GCCTGTGGAA GCATCGCAAG
GTCATCACTT CCCGCAAGCC CGCCGGTCTT GGCCTTGTCG GTCTCCCGCA GGCCTGGCTG
TTCCAGATCG CCTTCGCCGC GATCTCGCCG CTGATCGACG GTGCCTTGAT CGCCTCGATC
ATCTCCACGG TCGTCAGGGT GGTCCAGCAT GGCTGGGCCC AGACCCAAGG CGATCTTGGT
CGCATGGCGA TCTACTGGAG CCTCTTCACC GCCATCGACG TCATCTGCGG CTGGATTGCC
TACCGCCTCG ACGACAAGCG GCCACCTTAT CCGGCGCACC TGCTGGTCGC CCAGCGCATC
GTCTACCGCC AGATCATGTA CTGGGTTGTC CTGCGAGCCC TTGCCTCCGC CATCGGAGGC
TGGATCGTCG GTTGGGGAAA GCTCGAGCGT ACCGGGACCG TCAGTCCCGC CCCGGAACGC
GCTTGA
 
Protein sequence
MTRPIFYDPT GRRSLLAKRS LAVVFLAVIL AALAFATTLV AVPRQRALSL PQPHFEAASL 
HRGHGLSRLR SWLPRLHLQR SQEALAIGFY QPRSDASFQS LSRHGNQLDW VVPALLNVAG
PEHRLAVSND PRFDSFVAAG HHHYKVLPMV QNVGGDGWDG ANVARILENA TERHALERQL
ADYVARRGDA GLVLDFEDLP ATAMRPYVAF LRETNALLPA GAQLAVTAPA GEDDWPIAAL
GRAADKVILM AYDEHWQNGA AGPIASQTWF TNEVEKSARA LPADRLVIAL GSYGYDWHQG
GTDALSLSDA WLAAHDSHAP IIFDKASGNA GFAYEQDGVS HTVWMLDAAT DWNEMLAVQR
LGIGNLALWR LGTEDPGFWN DLAALRSGKA PDLSRLVSEG GNDVEGTGEI LRITATPVAG
SRTVAFGDQH LIRDERYGDL PTPFVVRRTG AANPKLLALT FDDGPDPKWT PKVLSTLEAA
HVPATFFVIG ENALVRPDLL RRMVADGDEI GNHTYTHPNL AGSSEAETRL EINATQRLVQ
AYTGRSMTLF RAPYFGDAEP TTADELQPAL RAQQAGYTIV GLHVDPNDWQ NPGTDEIVRQ
VVNQIHNATE QSSSNVILLH DGGGDRSQTV AALPRIVAVL RAEGYSFVTA SQLVGIPPQA
AMPRLSSSDL AAVRVDVAAF LALDGLVTLF SWLFFVAIAL GIARAVIMAG LAWWQSRSPR
AEPPAFEPTV SVIIPAWNEE RVIAASVERV LASDYPALQV IVADDGSKDA TSAVVARHFG
HDPRVTLLTL ANGGKAAALN RALRHATGEV VIALDADTQF EPLTIRRLAR WFADPRIGAV
AGDARVGNRV NLVTRWQAVE YITAQNLERR ALAGFDAMTV VPGAVGAWRR AALDSVGGYP
ENTLAEDQDL TIAIQRKGWR VTYDPRAVAW TEAPQTFRAL ARQRYRWAFG TLQCLWKHRK
VITSRKPAGL GLVGLPQAWL FQIAFAAISP LIDGALIASI ISTVVRVVQH GWAQTQGDLG
RMAIYWSLFT AIDVICGWIA YRLDDKRPPY PAHLLVAQRI VYRQIMYWVV LRALASAIGG
WIVGWGKLER TGTVSPAPER A