Gene RSP_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_1389 
SymbolaroC 
ID3720796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp3167682 
End bp3168782 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID640072616 
Productchorismate synthase 
Protein accessionYP_354470 
Protein GI77464966 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTACA ACACTTTCGG CCACATCTTC CGCGTCACCA CCTGGGGCGA GAGCCATGGG 
CCCGCGCTCG GCGCGACGGT GGATGGCTGC CCGCCCGGCG TCGCGATCGA GGCCGAGGCG
ATCCAGCACT GGCTCGACCG CCGGAAGCCC GGCCAGAACC GCTTCACCAC CCAGCGGCAG
GAGCCGGATG CGGTCAGGAT ACTGTCGGGC ACCTTCGAGG GCCGCTCGAC CGGCACGCCG
ATCCAGCTCA TGATCGAGAA CACCGACCAG CGGTCGAAGG ACTATGGCGA GATCGCCCGG
AGCTTCCGGC CGGGTCATGC CGACATCGCC TATCACTGGA AATACGGGCT GCGCGACTAT
CGCGGGGGCG GGCGCTCCTC GGCGCGCGAG ACGGCGGCGC GGGTCGCGGC GGGCGGTGTC
GCCCGGGCGG CGCTGGCGGC CTTGGTTCCC GGCCTGCGGA TCGAGGGCTA CATGGTCCAG
ATCGGGCCGC ATGCTATCGA CCGCGCCCGG TTCGACGCGG ACGAGATCGA GCGCAACCCC
TTCTGGTGCC CCGATCCCGA TACGGCCGCG CTCTGGGCCG ACTATCTCGA CGGACTGCGC
AAGGCGCACG ATTCGGTGGG CGCCATCGTC GAGGTGCGGG CCTCGGGCGT GCCGGCAGGG
CTCGGCGCGC CGATCTACGG CAAGCTCGAC AGCGACCTCG CCGCGGCCAT GATGACGATC
AACGCGGTGA AGGGTGTCGA GATCGGCGAG GGGATGGCCG CGGCCTGCCT CACCGGCAGC
GCCAATGCCG ACGAAATCCG CATGGGCCCC GAGGGCCCCG AGTTCCTGAC CAACCATGCG
GGCGGCATCC TCGGCGGCAT CTCGACCGGG CAGGATGTGG TGGTGCGCTT TGCGGTGAAG
CCCACCTCCT CGATCCTGAC CCCGCGCCGC TCGGTCACGA CCGACGGGCG CGAGGTGGAG
GTGGTGACGA AGGGCCGCCA CGATCCCTGC GTGGGCATCC GCGCGGTGCC GGTGGGCGAG
GCGATGATGG CCTGCGTGCT GCTCGACCAT CTGCTGCTCG ACCGCGGCCA GACCGGCGGC
CTGCGCGGGA CGATCGGCTA G
 
Protein sequence
MSYNTFGHIF RVTTWGESHG PALGATVDGC PPGVAIEAEA IQHWLDRRKP GQNRFTTQRQ 
EPDAVRILSG TFEGRSTGTP IQLMIENTDQ RSKDYGEIAR SFRPGHADIA YHWKYGLRDY
RGGGRSSARE TAARVAAGGV ARAALAALVP GLRIEGYMVQ IGPHAIDRAR FDADEIERNP
FWCPDPDTAA LWADYLDGLR KAHDSVGAIV EVRASGVPAG LGAPIYGKLD SDLAAAMMTI
NAVKGVEIGE GMAAACLTGS ANADEIRMGP EGPEFLTNHA GGILGGISTG QDVVVRFAVK
PTSSILTPRR SVTTDGREVE VVTKGRHDPC VGIRAVPVGE AMMACVLLDH LLLDRGQTGG
LRGTIG