Gene RoseRS_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1952 
Symbol 
ID5208914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2420275 
End bp2421351 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content64% 
IMG OID640595561 
Productchorismate synthase 
Protein accessionYP_001276290 
Protein GI148656085 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAA ACACATTTGG ACAGGTTTTT CGATTGACAA CCTGGGGCGA ATCGCACGGA 
CCCGCAGTTG GGTGCGTGGT CGATGGGTGC CCGGCAGGTA TCGAGATTTC GGAAGCGTTC
ATCCAGCGCG AACTGGATCG TCGCCGGGTC GGGCAGAGCC GGGTAACATC GGCGCGTCAG
GAACCCGATC AGGTGCAGAT CCTGTCGGGG GTGTTCGAGG GACGTTCGAC CGGCGCCCCC
ATCAGCATGC TGGTCTTCAA TACCGATGCG AAGCCGGGGC ACTACGATAC CATCAAGCAC
CTCTACCGCC CCGGTCACGC CGATTACACG TGGGACGCGA AGTATGGCTT TCGCGACTGG
CGCGGCGGTG GACGGAGCAG CGCACGCGAG ACGATCGGGC GTGTCGCTGG CGGCGCGATT
GCGAAACTGC TCCTTGCGCG CTACGGCATT TCGGTCATTG CGTGGACATC GCAACTCGGC
GATCTGAAAG CCGAGGTTAT TGATGAGAGC GAAATCGAGC GCAACATCAT GCGCTGCCCG
GATGCGCGGG TTGCCGCCCT GATGGTCGAG CGTGTCGAAC AGGCGCGGCG CAGCCTCGAC
TCGCTCGGCG GCGTGGTCGA AGTGCGCGCC CGTGGCGTTC CTCCCGGTCT CGGCGAGCCG
GTCTTCGACA AATTGCAGGC GGATATTGGC AAGGCAATGT TCAGCATCCC GGCGATCAAA
GGGGTTGAGT TCGGCGAGGG GTTCGGTGTC GCATATATGA CCGGCTCGAC CCACAATGAC
CCGTTCGTGC GCCGCGATGA TGGCACAATC GGAACCGCGT CCAACCATCA CGGCGGTATT
CTCGGCGGCA TCAGCACCGG CGAAGAGATC GTGCTGCGCA TCGCTGCCAA ACCGCCAGCG
TCCATCGCCC GTCCGCAACA CACGGTCGAT CGCGCCGGAA ATCCCGCTGC GATCGAAATC
CACGGTCGCC ACGACCCGAC CGTGCTCCCA CGGCTGGTTC CAATCGCCGA GGCGATGCTG
GCGCTGGTGC TCGCCGATCA CCTGCTGCGG CAACGCCTGG CACGGGTGGA CGCTTGA
 
Protein sequence
MPGNTFGQVF RLTTWGESHG PAVGCVVDGC PAGIEISEAF IQRELDRRRV GQSRVTSARQ 
EPDQVQILSG VFEGRSTGAP ISMLVFNTDA KPGHYDTIKH LYRPGHADYT WDAKYGFRDW
RGGGRSSARE TIGRVAGGAI AKLLLARYGI SVIAWTSQLG DLKAEVIDES EIERNIMRCP
DARVAALMVE RVEQARRSLD SLGGVVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK
GVEFGEGFGV AYMTGSTHND PFVRRDDGTI GTASNHHGGI LGGISTGEEI VLRIAAKPPA
SIARPQHTVD RAGNPAAIEI HGRHDPTVLP RLVPIAEAML ALVLADHLLR QRLARVDA