Gene Saro_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0161 
Symbol 
ID3918296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp158147 
End bp160069 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content65% 
IMG OID640442886 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_495444 
Protein GI87198187 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAG AACCGGCAAC GCCGTTGCTA GACACGGTCA AGACCCCAGA CGACCTTCGC 
AAGCTTGCGC CAACGCAGCT CCGCCAACTG GCGGACGAAC TGCGTGTCGA GATGATATCC
GCCGTCGGCC AGACGGGCGG GCACCTCGGT TCCGGCCTGG GCGTGGTCGA GCTGACCGTG
GCAATCCACT ATGTGTTCAA CACGCCCGAG GACAGGCTTG TGTGGGACGT GGGCCACCAG
GCCTATCCTC ACAAGATCCT GACCGGGCGG CGCGACCGGA TCCGCACCCT GCGTCAGGCA
GGCGGCCTTT CCGGCTTCAC CAAGCGCAGC GAGAGCGAGT ACGACCCTTT CGGGACGGCG
CACTCGTCCA CCTCGATTTC AGCGGCGCTC GGCTTTGCCA TCGCTAACAA GCTTTCGGGC
AGACTGGGCA AGGGCATCGC GGTGATCGGC GATGGCGCCA TGAGCGCGGG CATGGCCTAC
GAAGCGATGA ACAACGCCGA GGCCGCAGGC AATCGGCTGA TCGTCATTCT CAACGACAAC
GACATGTCCA TCGCGCCGCC CGTCGGCGGG CTTTCGGCCT ATCTGGCGCG GCTGGTTTCG
TCCGGTCCGT TTTTGGGGCT CCGCGACATT GCCCGCAGGC TTTCGCGCAA GCTGCCCCGC
CCGCTGCACG AAGCCGCGCG CAAGACCGAC GAGTTCGCTC GCGGCATGGC GATGGGCGGT
ACCCTGTTCG AGGAGCTTGG CTTCTATTAC GTCGGCCCGA TTGACGGCCA CAACATCGAC
CAGCTCATCC CGGTTCTCGA AAACGTGCGC GATGCGGCCG AAGGGCCGTG TCTGATCCAT
GTGGTGACGC AGAAGGGCAA GGGGTATGCC CCTGCCGAAG CCGCGGCCGA CAAGTATCAC
GGCGTGCAGA AGTTCGACGT CATCACTGGT GAGCAGGTGA AGGCCAAGGC TGCCGCGCCC
GCCTATCAGA ACGTGTTCGG CGAGACGCTG GCCAAGCTGG CGGACGCCGA CCCGACGATC
TGCGCGATCA CCGCCGCAAT GCCCAGCGGC ACCGGCGTCG ACAAGTTTGC CAAGGCTCAT
CCCGACCGCA CCTTCGATGT CGGCATTGCC GAACAGCATG CGGTGACCTT TGCTGCGGGC
CTCGCCGCAG AAGGGATGCG GCCGTTCTGC GCGATCTATT CGACCTTCCT GCAGCGCGCT
TTCGACCAGG TCGTCCACGA CGTGGCGATC CAGAACCTGC CGGTGCGCTT CGCCATCGAC
CGCGCAGGCC TGGTGGGCGC GGATGGTGCA ACCCACGCCG GTTCGTTCGA CGTGACCTAT
CTGGCAACGT TGCCGAACCT GGTCGTCATG GCTGCTGCCG ACGAGGCGGA ACTGGTCCAC
ATGACCTATA CCGCGGCACT GCATGACAGC GGCCCGATCG CTTTCCGCTA TCCGCGCGGA
AACGGTGTGG GCGTGCCACT GCCCGAGGTT CCCGAGCGGC TCGAGATTGG CAAGGGCCGG
ATCATCAGGC AGGGTAGCAA GGTCGCGCTG CTGTCGTTGG GTACGCGGCT GGCAGAGGCG
CTCAAGGCTG CCGATCAGCT CGACGCCAGG GGATTGTCGA CGACTGTCGC CGACCTGCGC
TTTGCCAAGC CGCTGGACGT GGCGCTGATC CGTCAGCTGA TGACCACGCA TGACGTGATC
GTGACGGTGG AGGAAGGCTC GATCGGCGGC CTGGGCGCGC ACGTCCTGAC CATGGCGAGC
GACGAGGGAC TGGTGGACGG GGGCCTCAAG ATCAGGACCA TGCGCTTGCC CGATCTGTTC
CAGGACCACG ACGCGCCTGA AAAGCAGTAT GACGAGGCGG GGCTCAACGC GCCGCATATC
GTCGATACCG TACTGAAGGC GCTGCGGCAC AACAGCGCCG GGGTAAGTGA AGCGCGGGCC
TGA
 
Protein sequence
MSQEPATPLL DTVKTPDDLR KLAPTQLRQL ADELRVEMIS AVGQTGGHLG SGLGVVELTV 
AIHYVFNTPE DRLVWDVGHQ AYPHKILTGR RDRIRTLRQA GGLSGFTKRS ESEYDPFGTA
HSSTSISAAL GFAIANKLSG RLGKGIAVIG DGAMSAGMAY EAMNNAEAAG NRLIVILNDN
DMSIAPPVGG LSAYLARLVS SGPFLGLRDI ARRLSRKLPR PLHEAARKTD EFARGMAMGG
TLFEELGFYY VGPIDGHNID QLIPVLENVR DAAEGPCLIH VVTQKGKGYA PAEAAADKYH
GVQKFDVITG EQVKAKAAAP AYQNVFGETL AKLADADPTI CAITAAMPSG TGVDKFAKAH
PDRTFDVGIA EQHAVTFAAG LAAEGMRPFC AIYSTFLQRA FDQVVHDVAI QNLPVRFAID
RAGLVGADGA THAGSFDVTY LATLPNLVVM AAADEAELVH MTYTAALHDS GPIAFRYPRG
NGVGVPLPEV PERLEIGKGR IIRQGSKVAL LSLGTRLAEA LKAADQLDAR GLSTTVADLR
FAKPLDVALI RQLMTTHDVI VTVEEGSIGG LGAHVLTMAS DEGLVDGGLK IRTMRLPDLF
QDHDAPEKQY DEAGLNAPHI VDTVLKALRH NSAGVSEARA