Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0009 |
Symbol | |
ID | 3916051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 7692 |
End bp | 8762 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640442734 |
Product | chorismate synthase |
Protein accession | YP_495292 |
Protein GI | 87198035 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCA ACAGCTTCGG TCACATCTTC CGTTTCACAA CCTGGGGCGA GAGCCACGGG CCGGCGCTTG GCGCCGTGGT CGACGGTTGC CCTCCCGGCC TTGCGCTGAC CGAAGCGCAG ATCCAGCCTT TTCTCGACGC CCGGCGCCCT GGCCAGTCGC GCTTCACCAC GCAGCGGCAG GAGCCGGATC AGGTGCGCAT CCTGTCCGGC GTGTTCGAAG GCCGCACCAC CGGCACTCCG ATCAGCCTGA TGATCGAGAA CGTCGACCAG CGTTCGAAGG ACTATGGCGA TGTCGCCAAG GCCTATCGCC CCGGCCATGC CGACTATGCC TATGACGCCA AGTACGGCTT TCGCGACTAT CGCGGCGGCG GGCGTTCCTC GGCGCGGGAA ACGGCTGCGC GCGTAGCCGC GGGCGCCGTT GCCCGCCTCG TGATCCCGGA AGTCTCGATC CTGGCCTGGG TCAGCGAGAT CGGCGGTGAC CGCATCGACA TGGACCATTT CGATGCGGCA GAGATCGCCC GCAACCCGTT CTTCTGCCCG GATTCCTGGG CCGCGGCCCG TTGGGAGAAG CTGGTCGACG ATGCCCGCAA GTCTGGCTCC TCGCTCGGCG CGGTGGTCGA ATGCGTCGCA ACCGGCGTAC CGGCCGGCTG GGGCGCGCCG CTCTACGCCA AGCTCGATGC CGAACTGGCC CATGCGATGA TGGGCATCAA CGCGGTCAAG GGCGTTGAGA TCGGCGATGG CTTTGCCGCC GCGCGCAATA CCGGCGAAGG CAATGCCGAT CCGATGCGGC CGGGCGCTGG CGTTCCGGAA TTCCTTGCCA ACCATGCCGG CGGCATCGCG GGCGGCATAT CCACCGGCCA GCCGGTGACG GTTCGCGTGG CGTTCAAGCC GACTTCGTCG ATTCTCACGC CGATGCCCAC GATCACGCGC GAGGGCGAGG CGACCGAGTT GCTGACCAAA GGCCGCCACG ATCCCTGCGT GGGCATTCGC GGCGTGCCCG TGGTCGAGGC GATGATGGCG CTCGTCCTGG CGGACCAGAA ACTGCTTCAC CGCGGCCAGT GCGGCGGTTG A
|
Protein sequence | MSLNSFGHIF RFTTWGESHG PALGAVVDGC PPGLALTEAQ IQPFLDARRP GQSRFTTQRQ EPDQVRILSG VFEGRTTGTP ISLMIENVDQ RSKDYGDVAK AYRPGHADYA YDAKYGFRDY RGGGRSSARE TAARVAAGAV ARLVIPEVSI LAWVSEIGGD RIDMDHFDAA EIARNPFFCP DSWAAARWEK LVDDARKSGS SLGAVVECVA TGVPAGWGAP LYAKLDAELA HAMMGINAVK GVEIGDGFAA ARNTGEGNAD PMRPGAGVPE FLANHAGGIA GGISTGQPVT VRVAFKPTSS ILTPMPTITR EGEATELLTK GRHDPCVGIR GVPVVEAMMA LVLADQKLLH RGQCGG
|
| |