Gene Saro_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3836 
Symbol 
ID5077447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp2809 
End bp3795 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content63% 
IMG OID640480946 
Productdihydrodipicolinate synthetase 
Protein accessionYP_001165608 
Protein GI146275447 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCGTG AACTGCTGAC TGCTGCTGAT GTGAAGGGTG CTTGGGCGAT CGTGCCGACT 
CCAGCCAAGG AAGGTGCCTC CGACTGGCGC GCCGCTGATA CTGTTAATGT CGAAGAAGCA
GCCCGCATGA TCGACGGCCT GATCGAGGCC GGGGTCGATG GCATTCTCAG TATGGGGACG
CTGGGCGAGG CCGCAACCAT GACACTTGAC GAAAAGCTCG TCTTTATGAA GACTATCGTC
GATACCGCCG CGGGCCGGGT GCCGGTGTTT GTCGGGACAA CCTGCATCAA CACCCGCGAC
ACCATTGCGC TAACCCGCAA GGCAGTCGAT ATCGGCGCGA CCGGGACGAT GTTGGGCGTA
CCGATGTGGT GCGCCCCCAG TGTCGACGTC GCGGTACAGT TCTATCGCGA CGTTGCCGAG
GCGGTGCCCG ACATCAACAT CGCGATCTAT GCCAACCCCG AAGCCTTCAA GTTCGATTTC
CCGCGCACGT TCTGGGGCCA GGTCGCCGAA ATTCCGCAGG TTGTCACCGC CAAGTACATC
GGCGTCGGCA CCCTGCTGCC CGATCTCGCG GCGATCAAGG GGCGGATCAA GCTGCTGCCG
ATCGATTTCG ATTACTACGG CGCGGCGCGG ATGGACGATT CGATCGACGC CTTCTGGACC
AGCGGCGCGG TCTGCCACCC GCTGGTCAGC ACGACCCTGC GCGATGTGGT CGCCGCAGCG
CGCGCGAGCG GCGACTGGAG CGCTGCCAAG GCTTTCATGG GCCGGCTCGC GCCGACTGCG
GCGACGCTTT TCCCGAACGG CAGCTTCAAG GAATTCTCGA CCTACAACAT CCCCCTGGAA
AAGGCGCGGA TGACCGCCGG CGGCTGGATG AATGCCGGGC CTTGCCGTCC GCCCTATCAC
CTGTGTCCCG AAAACTATCT CGAAGGCGCG CGCAATTCAG GCAGGATGTG GGCTGAACTG
GGCAAGGCGC TCGAGGCAGA GCGTTGA
 
Protein sequence
MARELLTAAD VKGAWAIVPT PAKEGASDWR AADTVNVEEA ARMIDGLIEA GVDGILSMGT 
LGEAATMTLD EKLVFMKTIV DTAAGRVPVF VGTTCINTRD TIALTRKAVD IGATGTMLGV
PMWCAPSVDV AVQFYRDVAE AVPDINIAIY ANPEAFKFDF PRTFWGQVAE IPQVVTAKYI
GVGTLLPDLA AIKGRIKLLP IDFDYYGAAR MDDSIDAFWT SGAVCHPLVS TTLRDVVAAA
RASGDWSAAK AFMGRLAPTA ATLFPNGSFK EFSTYNIPLE KARMTAGGWM NAGPCRPPYH
LCPENYLEGA RNSGRMWAEL GKALEAER