Gene Saro_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1895 
Symbol 
ID3917116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2007263 
End bp2009083 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content66% 
IMG OID640444639 
Productphosphogluconate dehydratase 
Protein accessionYP_497169 
Protein GI87199912 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.488647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATGA ACCCAACTGT CGAGCGCGTC ACCCAGCGCA TCATCGAGCG TTCGCGCAGC 
ACCCGCGCCG CATATCTCGA TCTTGTGGAA CGTTCGCGCG ACCAGGGGCT CAACCGGCCC
AGGCTTTCGT GCGGCAACCT GGCCCACGGC TTTGCCGCAT CGGGCGAGGA CAAGCCGGCG
ATCAAGTCGG GCAAGGCGAT GAACATCGGC ATCGTCACTG CCTACAACGA TATGCTTTCG
GCGCATCAGC CGTATGGCCG CTATCCCGAG CAGATCAAGA TATTCGCCCG GGAAGTGGGC
GCGACGGCGC AGGTTGCAGG CGGCGTTCCA GCGATGTGCG ATGGCGTCAC GCAGGGCCAG
GATTCGATGG AGCTGTCGCT CTTCAGCCGC GACGTGATCG CAATGGCGAC GACAGTCGGG
CTAAGCCACG CCATGTTCGA AGGCGCGCTC CTGCTCGGAA TCTGCGACAA GATCGTGCCC
GGGCTGCTGA TCGGCAGCCT GCGCTTTGGC CACCTGCCGA CCATTCTCGT GCCGGCGGGC
CCCATGCCGA CAGGCCTTCC CAACAAGGAG AAGGTCCGTA TCCGCCAGCT ATATGCAGAG
GGCAAGGTCG GTCGCGACGA ACTGCTCGAA AGCGAGAGCG CCAGCTATCA CTCCGCCGGC
ACGTGCACCT TCTATGGCAC GGCCAACTCC AACCAGATGA TGATGGAGAT GATGGGGCTG
CACATGCCCG GCTCCAGCTT CGTCCTGCCC GGCACGAAGA TCCGTCAGGA ACTGACGCGG
GCGGCGACGC ATCGCATCGC CCAGATTGGT TGGGATGGCG ACGATTATCG TCCTCTCGGC
AGGTGCGTCG ACGAGAAGGC CATCGTCAAT GCGATCGTCG GCCTGCTGGC AACAGGTGGC
TCGACCAACC ACGTGATCCA CCTGCCGGCC ATCGCGCGGG CCGCCGGCAT CCAGATAGAC
TGGAACGACA TGGACGACCT GTCGCGCGTC GTCCCGCTTA TCGCCAGCGT CTATCCCAAT
GGCGCGGGCG ACGTGAACTA CTTCGCAGCG GCGGGCGGCA TGCCCTATGT GATCCGCGAG
CTGATCGGGT CCGGCCTTGC CCATCCGGAT ATCCTGACGG TCTACGGCCA GTCGCTGGAG
GAAGGCGCCC AGCAGCCTGT CATGGAAGGC GACAACCTGC GCTGGGATCC GGCGCCCGAG
GTTTCGGGAG ACGACAGCAT GCTGCGCCCT GTTTCGGCGC CGTTCCAGCC CGAAGGCGGC
TTCAGGTTGC TGAAAGGCAA CCTCGGCCGG GGTACGATCA AGGTCAGCGC GGTCGATCCC
TCACGCTGGA CGATCGAGGC GCCTTGCCGG GTGTTCGAGG ACCAAAATGC CGTGCTCGAC
GCGTTCAAGG CCGGCGAACT GGAGCGTGAC GTCATCGTTG TCGTACGCTT CCAGGGGCCT
GCCGCAAACG GCATGCCCGA ACTGCACAAG CTGACCCCGC CGCTTGGCGT CCTGCAGGAT
CGCGGGTTCA AGGTCGCGCT CGTCACCGAT GGCCGTATGT CGGGCGCTTC GGGCAAGGTG
CCTGCCGCAA TCCATGTCTC GCCCGAAGCC AAGCTCGGTG GCCCGCTGGC AAGGCTGCGC
GACGGCGACG TGGTGCGGGT ATGCGCCAAC AGCGGCGAGC TTGTCGCGGT CGTGCCCGCC
GAGGAGTGGA GCGCGCGCGA GGAAGCAGTT GCCCCGGCTA GTGCTCCCGG CGTAGGCCGC
GAACTCTTCG CGCTCATGCG GCAGCATTCC GATCCCGCCG AGCGCGGCGG ATCGGCGATG
CTCGCGGCGG CGGGGCTCTG A
 
Protein sequence
MAMNPTVERV TQRIIERSRS TRAAYLDLVE RSRDQGLNRP RLSCGNLAHG FAASGEDKPA 
IKSGKAMNIG IVTAYNDMLS AHQPYGRYPE QIKIFAREVG ATAQVAGGVP AMCDGVTQGQ
DSMELSLFSR DVIAMATTVG LSHAMFEGAL LLGICDKIVP GLLIGSLRFG HLPTILVPAG
PMPTGLPNKE KVRIRQLYAE GKVGRDELLE SESASYHSAG TCTFYGTANS NQMMMEMMGL
HMPGSSFVLP GTKIRQELTR AATHRIAQIG WDGDDYRPLG RCVDEKAIVN AIVGLLATGG
STNHVIHLPA IARAAGIQID WNDMDDLSRV VPLIASVYPN GAGDVNYFAA AGGMPYVIRE
LIGSGLAHPD ILTVYGQSLE EGAQQPVMEG DNLRWDPAPE VSGDDSMLRP VSAPFQPEGG
FRLLKGNLGR GTIKVSAVDP SRWTIEAPCR VFEDQNAVLD AFKAGELERD VIVVVRFQGP
AANGMPELHK LTPPLGVLQD RGFKVALVTD GRMSGASGKV PAAIHVSPEA KLGGPLARLR
DGDVVRVCAN SGELVAVVPA EEWSAREEAV APASAPGVGR ELFALMRQHS DPAERGGSAM
LAAAGL