Gene Rsph17025_3880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3880 
Symbol 
ID5085428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp779527 
End bp780864 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content72% 
IMG OID640485439 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001170040 
Protein GI146279882 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAC ACGGGGCCGC GCAACCGATG ACCGCCCGCC GCTCGGGGCC GCTCAGGGGG 
AGGGCCGAGA TCCCGGGCGA CAAGTCGATC AGCCACCGCG CGCTGATCCT GGGCGCGATG
GCTGTTGGCG AGACGCGGAT CACGGGCCTG CTCGAGGGGC AGGACGTGCT CGATACGGCC
AAGGCGATGC GCGCCTTCGG GGCCGAGCTG ATCCAGCACG GCCCGGGCGA CTGGTCGGTG
CATGGGGTGG GAGTGGGCGG CTTTACCGAA CCCGCCGAGG TGATCGACTG CGGCAACTCG
GGAACGGGGG TGCGGCTCAT CATGGGGTCG ATGGCGACGT CGCCGATCAC CGCCACCTTC
ACGGGCGACG CCTCGTTGCG CAAGCGGCCG ATGGGGCGGG TGACCGATCC GCTGGCGCTG
TTCGGGGCGC GCGCCTACGG GCGCAAGGGC GGGCGGTTGC CGATGACGCT GGTGGGGGCG
GCCGAGCCGG TGCCGGTGCA CTACACGGTG CCGGTGCCGT CGGCGCAGGT GAAGTCGGCC
GTCCTGCTCG CGGGGTTGAA CGCGCCGGGC CAGACGGTGG TCGTCGAACG CGAGGCCACG
CGGGACCATT CCGAGCGGAT GCTGCGCGGC TTCGGGGCGG AACTGACGGT CGAGGCCGCG
CCCGAAGGGC AGATCATCAC CCTGACGGGG CAGCCCGAGC TGCGGCCGCA GACGGTGGCG
GTGCCGCGCG ATCCGTCCTC GGCGGCCTTT CCGGTCTGCG CCGCGCTGAT CGTGGAAGGG
TCGGAGATCC TCGTGCCGGG GGTCAGCCGG AATCCGACGC GGGATGGCCT TTATGTGACG
CTGCTCGAGA TGGGGGCGGA CATCGCCTTC GAGAACGAGC GCGAGGAAGG GGGCGAGCCG
GTCGCGGACC TCCGCGTGCG CGCCTCGGAG CTGAAGGGGG TGGAGGTGCC GCCCGAGCGC
GCGCCGTCGA TGATCGACGA ATATCCGATC CTGTCGGTGG TGGCGGCCTT CGCGGACGGC
ACCACCATCA TGCGCGGTGT GAAGGAGTTG CGCGTGAAGG AGAGCGACCG GATCGACGCC
ATGGCGCGCG GCCTCGAGGC CTGCGGCGTG CGGATCGAGG AGGACGAGGA CACGCTGGTC
GTGCACGGGA GGGGGAGCGT TCCGGGAGGG GCGACCTGCG CCACCCACCT CGACCACCGC
ATCGCGATGA GCTTCCTCGT GCTCGGCATG GCCGCGGAGG CGCCGGTCGC GGTGGACGAC
GGCTCGCCCA TCGAGACCTC CTTTCCGATC TTCATGGGGT TGATGCGCAC GCTCGGGGCG
GATCTGTCGG ACGGTTGA
 
Protein sequence
MSGHGAAQPM TARRSGPLRG RAEIPGDKSI SHRALILGAM AVGETRITGL LEGQDVLDTA 
KAMRAFGAEL IQHGPGDWSV HGVGVGGFTE PAEVIDCGNS GTGVRLIMGS MATSPITATF
TGDASLRKRP MGRVTDPLAL FGARAYGRKG GRLPMTLVGA AEPVPVHYTV PVPSAQVKSA
VLLAGLNAPG QTVVVEREAT RDHSERMLRG FGAELTVEAA PEGQIITLTG QPELRPQTVA
VPRDPSSAAF PVCAALIVEG SEILVPGVSR NPTRDGLYVT LLEMGADIAF ENEREEGGEP
VADLRVRASE LKGVEVPPER APSMIDEYPI LSVVAAFADG TTIMRGVKEL RVKESDRIDA
MARGLEACGV RIEEDEDTLV VHGRGSVPGG ATCATHLDHR IAMSFLVLGM AAEAPVAVDD
GSPIETSFPI FMGLMRTLGA DLSDG