Gene Saro_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0009 
Symbol 
ID3916051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp7692 
End bp8762 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID640442734 
Productchorismate synthase 
Protein accessionYP_495292 
Protein GI87198035 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCA ACAGCTTCGG TCACATCTTC CGTTTCACAA CCTGGGGCGA GAGCCACGGG 
CCGGCGCTTG GCGCCGTGGT CGACGGTTGC CCTCCCGGCC TTGCGCTGAC CGAAGCGCAG
ATCCAGCCTT TTCTCGACGC CCGGCGCCCT GGCCAGTCGC GCTTCACCAC GCAGCGGCAG
GAGCCGGATC AGGTGCGCAT CCTGTCCGGC GTGTTCGAAG GCCGCACCAC CGGCACTCCG
ATCAGCCTGA TGATCGAGAA CGTCGACCAG CGTTCGAAGG ACTATGGCGA TGTCGCCAAG
GCCTATCGCC CCGGCCATGC CGACTATGCC TATGACGCCA AGTACGGCTT TCGCGACTAT
CGCGGCGGCG GGCGTTCCTC GGCGCGGGAA ACGGCTGCGC GCGTAGCCGC GGGCGCCGTT
GCCCGCCTCG TGATCCCGGA AGTCTCGATC CTGGCCTGGG TCAGCGAGAT CGGCGGTGAC
CGCATCGACA TGGACCATTT CGATGCGGCA GAGATCGCCC GCAACCCGTT CTTCTGCCCG
GATTCCTGGG CCGCGGCCCG TTGGGAGAAG CTGGTCGACG ATGCCCGCAA GTCTGGCTCC
TCGCTCGGCG CGGTGGTCGA ATGCGTCGCA ACCGGCGTAC CGGCCGGCTG GGGCGCGCCG
CTCTACGCCA AGCTCGATGC CGAACTGGCC CATGCGATGA TGGGCATCAA CGCGGTCAAG
GGCGTTGAGA TCGGCGATGG CTTTGCCGCC GCGCGCAATA CCGGCGAAGG CAATGCCGAT
CCGATGCGGC CGGGCGCTGG CGTTCCGGAA TTCCTTGCCA ACCATGCCGG CGGCATCGCG
GGCGGCATAT CCACCGGCCA GCCGGTGACG GTTCGCGTGG CGTTCAAGCC GACTTCGTCG
ATTCTCACGC CGATGCCCAC GATCACGCGC GAGGGCGAGG CGACCGAGTT GCTGACCAAA
GGCCGCCACG ATCCCTGCGT GGGCATTCGC GGCGTGCCCG TGGTCGAGGC GATGATGGCG
CTCGTCCTGG CGGACCAGAA ACTGCTTCAC CGCGGCCAGT GCGGCGGTTG A
 
Protein sequence
MSLNSFGHIF RFTTWGESHG PALGAVVDGC PPGLALTEAQ IQPFLDARRP GQSRFTTQRQ 
EPDQVRILSG VFEGRTTGTP ISLMIENVDQ RSKDYGDVAK AYRPGHADYA YDAKYGFRDY
RGGGRSSARE TAARVAAGAV ARLVIPEVSI LAWVSEIGGD RIDMDHFDAA EIARNPFFCP
DSWAAARWEK LVDDARKSGS SLGAVVECVA TGVPAGWGAP LYAKLDAELA HAMMGINAVK
GVEIGDGFAA ARNTGEGNAD PMRPGAGVPE FLANHAGGIA GGISTGQPVT VRVAFKPTSS
ILTPMPTITR EGEATELLTK GRHDPCVGIR GVPVVEAMMA LVLADQKLLH RGQCGG