Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4770 |
Symbol | |
ID | 5200910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 5249324 |
End bp | 5250394 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640584326 |
Product | chorismate synthase |
Protein accession | YP_001265245 |
Protein GI | 148557663 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00112305 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0417622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCA ACAGCTTCGG CCGCGTCTTT CGCTTCTCGA CCTGGGGGGA GAGCCATGGG CCCGCCATCG GCGCGGTGGT CGACGGCTGT CCGCCGGGCC TCGAGCTGAG CGAGGCCGAT ATCCAGCCGT GGCTCGACAA GCGCCGGCCG GGCACCTCGC GCTTCACCAC CCAGCGGCAG GAGCCCGACC AGGTCCGCAT CCTGTCGGGC GTGTTCGAAG GGCGGACCAC CGGCACGCCG ATCAGCCTGA TGATCGACAA TGTCGACCAG CGCTCGAAGG ATTATTCGGA GGTCGCGCTC GCCTATCGGC CCGGCCATGC CGACTATGCC TATGATGCCA AATATGGCTT CCGCGACCAT CGCGGCGGCG GGCGATCCTC GGCGCGCGAG ACGGCGTCGC GGGTGGCGGC GGGCGCGGTC GCGCGGCTGG TGATTCCAGA GGTCAGGATC CGGGCCTATC TGATCGAACT GGGCGGCGAC AGGATCGATC CTGCCGCGTT CGACGATGCC GCGATCGACG AGAATCCGTT CTTCTGCCCC GACCGCGCGG CGGCGGCGCG CTGGGAAGCG ATCGTCGACG ATGCGCGCAA GGCCGGCTCC TCGGTCGGCG CCGTCGTCGA ATGCGTCGCG GAGGGCGTGC CGGCCGGCTG GGGCGCGCCG CTCTACGCCA AGCTCGACAG CGAGCTGGCG GCGGCCTGCA TGTCGATCAA TGCGGTCAAG GGCGTCGAGA TCGGCGACGG CTTCGCCGCG GCGCGGCTGA CCGGCGAGAC CAACGCCGAT CCGATGCGGC CCGGCAATGA CGGCAAGCCG GTGTTCCTCG CCAACCATGC CGGCGGGATC GCGGGCGGCA TCGCCACCGG CCAGCCGGTG GTGGTGCGGA TCGCGCTCAA GCCGACCTCG TCGATCCTGA CCCCGGTCGA GACGATCGGG CGCGACGGCA AGGCCGCCGA CATCCGCACC AAGGGACGCC ACGATCCCTG TGTCGGCATC CGCGCCGCGC CGGTGCTGGA GGCGATGGTC GCGCTGGTGC TGGCCGACCA GAAGCTGCTG CACCGGGCGC AGATCGGATG A
|
Protein sequence | MSFNSFGRVF RFSTWGESHG PAIGAVVDGC PPGLELSEAD IQPWLDKRRP GTSRFTTQRQ EPDQVRILSG VFEGRTTGTP ISLMIDNVDQ RSKDYSEVAL AYRPGHADYA YDAKYGFRDH RGGGRSSARE TASRVAAGAV ARLVIPEVRI RAYLIELGGD RIDPAAFDDA AIDENPFFCP DRAAAARWEA IVDDARKAGS SVGAVVECVA EGVPAGWGAP LYAKLDSELA AACMSINAVK GVEIGDGFAA ARLTGETNAD PMRPGNDGKP VFLANHAGGI AGGIATGQPV VVRIALKPTS SILTPVETIG RDGKAADIRT KGRHDPCVGI RAAPVLEAMV ALVLADQKLL HRAQIG
|
| |