Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1952 |
Symbol | |
ID | 5208914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2420275 |
End bp | 2421351 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640595561 |
Product | chorismate synthase |
Protein accession | YP_001276290 |
Protein GI | 148656085 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGGAA ACACATTTGG ACAGGTTTTT CGATTGACAA CCTGGGGCGA ATCGCACGGA CCCGCAGTTG GGTGCGTGGT CGATGGGTGC CCGGCAGGTA TCGAGATTTC GGAAGCGTTC ATCCAGCGCG AACTGGATCG TCGCCGGGTC GGGCAGAGCC GGGTAACATC GGCGCGTCAG GAACCCGATC AGGTGCAGAT CCTGTCGGGG GTGTTCGAGG GACGTTCGAC CGGCGCCCCC ATCAGCATGC TGGTCTTCAA TACCGATGCG AAGCCGGGGC ACTACGATAC CATCAAGCAC CTCTACCGCC CCGGTCACGC CGATTACACG TGGGACGCGA AGTATGGCTT TCGCGACTGG CGCGGCGGTG GACGGAGCAG CGCACGCGAG ACGATCGGGC GTGTCGCTGG CGGCGCGATT GCGAAACTGC TCCTTGCGCG CTACGGCATT TCGGTCATTG CGTGGACATC GCAACTCGGC GATCTGAAAG CCGAGGTTAT TGATGAGAGC GAAATCGAGC GCAACATCAT GCGCTGCCCG GATGCGCGGG TTGCCGCCCT GATGGTCGAG CGTGTCGAAC AGGCGCGGCG CAGCCTCGAC TCGCTCGGCG GCGTGGTCGA AGTGCGCGCC CGTGGCGTTC CTCCCGGTCT CGGCGAGCCG GTCTTCGACA AATTGCAGGC GGATATTGGC AAGGCAATGT TCAGCATCCC GGCGATCAAA GGGGTTGAGT TCGGCGAGGG GTTCGGTGTC GCATATATGA CCGGCTCGAC CCACAATGAC CCGTTCGTGC GCCGCGATGA TGGCACAATC GGAACCGCGT CCAACCATCA CGGCGGTATT CTCGGCGGCA TCAGCACCGG CGAAGAGATC GTGCTGCGCA TCGCTGCCAA ACCGCCAGCG TCCATCGCCC GTCCGCAACA CACGGTCGAT CGCGCCGGAA ATCCCGCTGC GATCGAAATC CACGGTCGCC ACGACCCGAC CGTGCTCCCA CGGCTGGTTC CAATCGCCGA GGCGATGCTG GCGCTGGTGC TCGCCGATCA CCTGCTGCGG CAACGCCTGG CACGGGTGGA CGCTTGA
|
Protein sequence | MPGNTFGQVF RLTTWGESHG PAVGCVVDGC PAGIEISEAF IQRELDRRRV GQSRVTSARQ EPDQVQILSG VFEGRSTGAP ISMLVFNTDA KPGHYDTIKH LYRPGHADYT WDAKYGFRDW RGGGRSSARE TIGRVAGGAI AKLLLARYGI SVIAWTSQLG DLKAEVIDES EIERNIMRCP DARVAALMVE RVEQARRSLD SLGGVVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK GVEFGEGFGV AYMTGSTHND PFVRRDDGTI GTASNHHGGI LGGISTGEEI VLRIAAKPPA SIARPQHTVD RAGNPAAIEI HGRHDPTVLP RLVPIAEAML ALVLADHLLR QRLARVDA
|
| |