Gene Saro_1326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1326 
Symbol 
ID3917775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1369010 
End bp1370335 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content70% 
IMG OID640444063 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_496604 
Protein GI87199347 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCTC GCCGCTTCAC CGCCAATGGC CCGCTCAAGG GCCGCATCGG CGTGCCCGGC 
GACAAGTCGA TCAGCCACCG CTCGATCATG CTCGGCGCGC TGGCAGTGGG CGAGACGCGC
GTGACCGGCC TGCTCGAAGG CGAGGACGTC CTTTCCACCG CCGCCGCGAT GCGCGCGATG
GGCGCGACGA TCGAACGCGA CGCGGACGGC ATGTGGCACG TTCACGGCGT TGGCGTGGGC
GGCCTGCTCC AGCCGCAACA GGCGCTGGAC ATGGGCAATT CGGGCACGTC GACCCGCCTG
CTGATGGGCC TTGTCGCAAC CCACCCGATC ACGGCGACGT TCGTGGGCGA TGCTTCGCTG
TCGAAGCGCC CGATGGGCCG CGTGATCGAT CCGCTCTCGA CGATGGGCGC CGAGTTCACC
GCATCGCCGG GTGGCCGCCT GCCCCTTACC CTGCGCGGAA TTTCACCTGC CGTGCCAATC
GAATACCGCC TCCCCGTCGC ATCGGCGCAG GTGAAGAGCG CGGTCCTGCT CGCGGGCCTC
AACACGCCCG GCGTGACCAC GGTAATCGAA CCGATCCCCA CCCGCGACCA TTCCGAACGC
ATGCTGCGCG GCTTCGGCGC GGAGCTGACC GTCGATGTCG CCGCCGATGG CGCGCGCGTC
ATCAGGGTGC GCGGCGAGGC CGAACTCAAG CCGCAGGACA TCGCCGTCCC CGGCGACCCG
TCATCCGCCG CGTTCTTCGT GGTGGCGGCG CTGCTGGTCG AAGGCTCGGA CCTCGTCGTC
GAGAACGTCG GCCTCAACCC CACCCGCGCC GCGCTGTTCG ACGTGCTGCG CCTGATGGGC
GGCTCCATCG AGGAGCTGAA CCGGCGCGAA GTGGGCGGCG AACCGGTGGC GGACCTGCGC
GTGCGCCACT CGCTGCTGAC CGGCATCGAT GTCGATCCCG CCGTAGTGCC GAGCATGGTC
GACGAATTCC CGGTGCTGTT CGTCGCCGCC GCCCTTGCCA AGGGCCGCAC GGTGACGACC
GGCCTCGAGG AACTGCGCGT GAAGGAAAGC GACCGCATCA GCGCGATGCG CGCCGCGCTC
GAACTGGCAG GCGCGACCGT CACCGAGACC GAGGACGGCC TGATCATCGA CGGCACCGGC
GGCGACCCCC TGCCCGGCAC CGCAGAGGGC GCGAGCGTCG TCACGCACCT CGACCACCGC
ATCGCGATGA GCATGGCGAT TGCCGGCATC GCCAGCCGCA ACGGCGTGGA AGTGGATGAC
ACCCGCCCCA TCGCCACCAG CTTCCCGGTG TTCGAGAGCC TGCTGGAAAG CGCGACCAGG
CCGTGA
 
Protein sequence
MRPRRFTANG PLKGRIGVPG DKSISHRSIM LGALAVGETR VTGLLEGEDV LSTAAAMRAM 
GATIERDADG MWHVHGVGVG GLLQPQQALD MGNSGTSTRL LMGLVATHPI TATFVGDASL
SKRPMGRVID PLSTMGAEFT ASPGGRLPLT LRGISPAVPI EYRLPVASAQ VKSAVLLAGL
NTPGVTTVIE PIPTRDHSER MLRGFGAELT VDVAADGARV IRVRGEAELK PQDIAVPGDP
SSAAFFVVAA LLVEGSDLVV ENVGLNPTRA ALFDVLRLMG GSIEELNRRE VGGEPVADLR
VRHSLLTGID VDPAVVPSMV DEFPVLFVAA ALAKGRTVTT GLEELRVKES DRISAMRAAL
ELAGATVTET EDGLIIDGTG GDPLPGTAEG ASVVTHLDHR IAMSMAIAGI ASRNGVEVDD
TRPIATSFPV FESLLESATR P