Gene Rsph17029_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0057 
Symbol 
ID4897234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp66841 
End bp67941 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID640110633 
Productchorismate synthase 
Protein accessionYP_001041949 
Protein GI126460835 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.14254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTACA ACACTTTCGG CCACATCTTC CGCGTCACCA CCTGGGGCGA GAGCCACGGG 
CCCGCGCTCG GCGCGACGGT GGATGGCTGC CCGCCCGGCG TCGCGATCGA GGCCGAGGCG
ATCCAGCACT GGCTCGACCG CCGGAAGCCC GGCCAGAACC GCTTCACCAC CCAGCGGCAG
GAGCCGGATG CGGTCAGGAT CCTTTCGGGC ACCTTCGAGG GCCGCTCGAC CGGCACGCCG
ATCCAGCTCA TGATCGAGAA CACCGACCAG CGGTCGAAGG ACTATGGCGA GATCGCCCGG
AGCTTCCGGC CGGGTCATGC CGACATCGCC TATCACTGGA AATACGGGCT GCGCGACTAT
CGCGGCGGCG GGCGCTCCTC GGCGCGCGAG ACGGCGGCGC GGGTCGCGGC GGGCGGTGTC
GCCCGGGCAG CGCTGGCGGC CTTGGTGCCC GGCCTGCGGA TCGAGGGCTA CATGGTCCAG
ATCGGGCCGC ATGCCATCGA CCGCGCCCGG TTCGACGCGG ACGAGATCGA GCGCAACCCC
TTCTGGTGCC CGGATTCCGA TACGGCCGCG CTCTGGGCCG ACTATCTCGA CGGGCTGCGC
AAGGCGCACG ATTCGGTGGG CGCCATCGTC GAGGTGCGGG CCTCGGGCGT GCCGGCAGGG
CTCGGCGCGC CGATCTACGG CAAGCTCGAC AGCGACCTCG CCGCGGCCAT GATGACGATC
AACGCGGTGA AGGGTGTCGA GATCGGCGAG GGAATGGCCG CGGCCTGCCT CACCGGCAGC
GCCAATGCCG ACGAGATCCG CATGGGCCCC GAAGGGCCCG AGTTCCTGAC CAACCATGCG
GGCGGCATCC TCGGCGGCAT CTCGACCGGG CAGGATGTGG TGGTGCGGTT TGCGGTGAAG
CCCACCTCCT CGATCCTGAC CCCGCGCCGC TCGGTCACGA CCGACGGGTG CGAGGTGGAG
GTGGTGACGA AGGGCCGCCA CGATCCCTGC GTGGGCATCC GCGCGGTGCC GGTGGGCGAG
GCGATGATGG CCTGCGTGCT GCTCGACCAT CTGCTGCTGG ACCGCGGCCA GACCGGCGGC
CTGCGCGGGA CGATCGGCTA G
 
Protein sequence
MSYNTFGHIF RVTTWGESHG PALGATVDGC PPGVAIEAEA IQHWLDRRKP GQNRFTTQRQ 
EPDAVRILSG TFEGRSTGTP IQLMIENTDQ RSKDYGEIAR SFRPGHADIA YHWKYGLRDY
RGGGRSSARE TAARVAAGGV ARAALAALVP GLRIEGYMVQ IGPHAIDRAR FDADEIERNP
FWCPDSDTAA LWADYLDGLR KAHDSVGAIV EVRASGVPAG LGAPIYGKLD SDLAAAMMTI
NAVKGVEIGE GMAAACLTGS ANADEIRMGP EGPEFLTNHA GGILGGISTG QDVVVRFAVK
PTSSILTPRR SVTTDGCEVE VVTKGRHDPC VGIRAVPVGE AMMACVLLDH LLLDRGQTGG
LRGTIG