Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3830 |
Symbol | |
ID | 5541333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5004407 |
End bp | 5005483 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640895940 |
Product | chorismate synthase |
Protein accession | YP_001433886 |
Protein GI | 156743757 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.929225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGAA ATACCTTTGG ACAGGTTTTT CGGTTGACTA CCTGGGGGGA GTCGCACGGA CCGGCAGTGG GGTGCGTGGT GGATGGATGC CCCGCAGGTC TCGACATCTC GGAAGACTAT ATTCAGCATG AACTGAATCG CCGACGGGTC GGGCAGAGCC GGGTGACATC GGCGCGTCAA GAATCCGACC AGGTGCAGAT TCTGTCTGGC GTCTTCGAGG GCCGCGCGAC CGGCGCGCCC ATCAGTATGC TGGTGTTCAA CACCGATGCG AAGCCGGGGC ACTACGAAAA TATCAAAGAC CTCTACCGCC CCGGTCATGC CGATTACACC TGGGATGTCA AATATGGCTT CCGCGACTGG CGTGGTGGCG GGCGTAGCAG CGCGCGCGAG ACGATAGGGC GCGTTGCCGG CGGCGCAGTT GCGAAACGCC TCCTGGCGCA GCACGGCGTA TCGATTATTG CCTGGACGGC ACAACTCGGC GATCTGAAGG CCGAGGTGAT CGACGAGAGC GAAATCGAGC GTAATATCAT GCGCTGCCCG GATGCGCGCG TCGCAGCGCT GATGGTCGAG CGGGTCGAAC AGGCGCGCCG CAGCCTTGAC TCGCTCGGTG GTATCGTCGA AGTGCGAGCG CGCGGCGTTC CCCCCGGTCT CGGCGAACCG GTCTTCGACA AACTTCAGGC GGATATTGGC AAGGCAATGT TCAGCATCCC GGCAATTAAG GGCGTTGAGT TCGGCGAAGG GTTCGGTGTA GCGCATATGA CCGGCTCTGT CCACAATGAT CCTTTCGAGC GTCGCGCCGA TGGCACAATT GGAACATCGT CCAACCACCA CGGTGGCATT CTCGGCGGGA TCAGCACCGG TGAAGAAATT GTGCTGCGCA TTGCTGCCAA GCCTCCCGCT TCGATTGCTC GACTGCAACG CACCGTTGAC CGTGAGGGAA ATCCGACGGA GATCGAAATC CACGGGCGCC ACGACCCAAC GGTGCTCCCG CGGTTGGTTC CAATCGCCGA GGCGATGCTG GCGTTGGTGC TCGCCGATCA TCTGCTGCGT CAGCGCCTGG CACGGATGGA GAGATGA
|
Protein sequence | MPGNTFGQVF RLTTWGESHG PAVGCVVDGC PAGLDISEDY IQHELNRRRV GQSRVTSARQ ESDQVQILSG VFEGRATGAP ISMLVFNTDA KPGHYENIKD LYRPGHADYT WDVKYGFRDW RGGGRSSARE TIGRVAGGAV AKRLLAQHGV SIIAWTAQLG DLKAEVIDES EIERNIMRCP DARVAALMVE RVEQARRSLD SLGGIVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK GVEFGEGFGV AHMTGSVHND PFERRADGTI GTSSNHHGGI LGGISTGEEI VLRIAAKPPA SIARLQRTVD REGNPTEIEI HGRHDPTVLP RLVPIAEAML ALVLADHLLR QRLARMER
|
| |