Gene Saro_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0939 
SymbolastD 
ID3918025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp985781 
End bp987196 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content68% 
IMG OID640443673 
Productsuccinylglutamic semialdehyde dehydrogenase 
Protein accessionYP_496218 
Protein GI87198961 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03240] succinylglutamic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACGGG CCGAAATCGT TTCGCATGAA CCCGCAACCG GCGCCGAAGT GTGGCGCGGC 
AAGGTTGGTG ACGTCGAAGA GGTCGTGGCG CGGGCCCGCC GGGCCTGGCC TGCCTGGGCA
GCGCAACCCC TGGCCACGCG CATCGAACTC GTCAGGCGCT TCGCCAACGA AGTGCGCAAG
GACGCGGACA ATCTGGCGAC GATGATCTCG CGCGAGACGG GCAAGCCGCT ATGGGAAGCC
CGCACCGAAG TGGACAGCGT GGTCAACAAG GTCGAGATCT CGATCCGCGC CTATGCCGAC
CGCACATCCC AGCGAAAGCT CGACTCCGCA CTCCAGGGCA CCGCCGCGCT GCGGCACAAG
CCGCATGGCG TGCTGGCGGT GCTGGGGCCG TACAATTTCC CCGCGCATCT GCCCAACGGC
CACATCGTGC CCGCGCTGAT CGCCGGTAAC GCGGTGGTCT TCAAGCCTTC GGAAAAGACG
CCGGCGACGG GCGAGATGCT GGCCCAGTGC TTCCATCGCG CGGGCATTCC GGCAGCCGTG
GTGCAGGTCC TGATAGGCGG TCCGGAAGAG GGCCAGGCGC TGGTCGCGCA TGACGGCATC
GACGGCGTGC TGTTCACCGG CTCGGCCCAC GCGGGCATCG CGATCAACCG CAAGCTCGCG
TCGAACCCGG GCAAGATCGT GGCGCTGGAG ATGGGCGGCA ACAACCCCAT CGTCGTCTGG
GATACGCCCA AGATCGAGGA CGCGGCCACG CTGATCGTCC AGTCGGCCTT CACCAGCGCC
GGCCAGCGTT GCACGGCAGC ACGCCGGTTG ATCATCAAGG CCTCGATGTT CGACGAGGTG
ATCGACCACG TGAAGCGGCT TGCGGACCGC ATCATCGTCG GCGCGCCGTT CGACGATCCG
GCACCCTTCA TGGGCCCGGT GATCGACAAT CGCACCGCCG ACGGGCTGAC CGAAAGCTTC
GTCTACCTGC TGTCGTCGGG CGGGCGGCCA ATCAAGCACA TGGTCCGCCT GCAGGAAGAC
CGCCCGTTCC TTTCGCCAGC GATCATCGAC GTGACCGCCG TTGCCGACCG GCCCGACGTG
GAACTGTTCG GCCCCCTCCT GCAGGTCGTC AGGGTCGACG ATTTCGACGA GGCCATCGCC
GAGGCGAACA ACACGCGCTT CGGGCTGTCG GCGTCGCTGA TCGGCGGCGA CCCGCAGGAC
TACAACCGGT TCTGGGCGAA CATCCGCGCG GGCGTGGTCA ACTGGAACCG GCCGACCAAC
GGCGCGTCAT CGGCCGCACC GTTCGGCGGC GTCGGGTTGT CGGGCAATCA TCGGCCCAGC
GCCTATTACG CGGCGGACTA TTGCGCCTAT CCGGTCGCCT CGACCGAAGT CGATCAGCCG
CGCGCCAGCA TCGGCGTCGG CCTGCGTAGC GACTAG
 
Protein sequence
MARAEIVSHE PATGAEVWRG KVGDVEEVVA RARRAWPAWA AQPLATRIEL VRRFANEVRK 
DADNLATMIS RETGKPLWEA RTEVDSVVNK VEISIRAYAD RTSQRKLDSA LQGTAALRHK
PHGVLAVLGP YNFPAHLPNG HIVPALIAGN AVVFKPSEKT PATGEMLAQC FHRAGIPAAV
VQVLIGGPEE GQALVAHDGI DGVLFTGSAH AGIAINRKLA SNPGKIVALE MGGNNPIVVW
DTPKIEDAAT LIVQSAFTSA GQRCTAARRL IIKASMFDEV IDHVKRLADR IIVGAPFDDP
APFMGPVIDN RTADGLTESF VYLLSSGGRP IKHMVRLQED RPFLSPAIID VTAVADRPDV
ELFGPLLQVV RVDDFDEAIA EANNTRFGLS ASLIGGDPQD YNRFWANIRA GVVNWNRPTN
GASSAAPFGG VGLSGNHRPS AYYAADYCAY PVASTEVDQP RASIGVGLRS D