Gene RPC_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1142 
Symbol 
ID3969537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1241910 
End bp1242998 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID637924252 
Productchorismate synthase 
Protein accessionYP_531024 
Protein GI90422654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.579597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.271094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA ACACCTTCGG CCACATGTTT CGCGTCACCA CCTTCGGCGA GAGCCATGGC 
GTGGCGATCG GCTGCGTGGT CGACGGCTGT CCGCCCTTGA TCGCGCTTAC CGAGGCCGAC
ATCCAGCGCG ACCTCGACCG CAGGCGGCCG GGGCAGTCGC GCTTCACCAC CCAGCGCCAG
GAAGCCGACC AGGTGAAGAT CCTGTCCGGG GTGATGGTGC ATCCGCAGAG CGGCTTGCAG
GTCACCACCG GCGCGCCGAT CGCGCTCTTG ATCGAGAACA CCGACCAGCG CTCGAAAGAC
TATTCCGAGA TCAAGGACAA GTTTCGCCCC GGCCACGCCG ACTTCACCTA TGAGGCGAAA
TACGGCATCC GCGATTATCG CGGCGGCGGC CGTTCCTCGG CGCGCGAGAC CGCGACCCGC
GTCGCCGCCG GTGCGATCGC CCGCAAAGTG GTGCCCGGCA TCACCGTGCG CGCCGCTTTG
GTGCAGATGG GGCCGCACCA GATCGACCGC GACAACTGGG ATTGGGAGGA GGTCGGCAAC
AATCCGTTCT TCTGCCCGGA CAAGGACAAG GCGAAATTCT TCGAGGACTA TCTCGACGGC
ATCCGCAAGA ACGGCTCCTC GATCGGCGCG GTGATCGAGG TGGTCGCCGA CGGCGTGCCG
GCGGGGTGGG GCGCGCCGAT CTACGCCAAG CTCGACACCG ACATCGCCGC GGCGCTGATG
AGCATCAACG CGGTGAAGGG CGTCGAGATC GGCGACGGCT TCGCCACCGC AGCACTCACC
GGCGAGCAGA ACGCCGACGA AATGCGCGCC GGCAATGATG GCCCGAGCTT CCTGTCGAAC
CACGCCGGCG GCATTTTGGG CGGCATCTCC ACCGGGCAGC CGGTGGTGGC GCGGTTTGCG
GTGAAGCCGA CCTCCTCGAT CCTGGCGCCG CGCAAGACCG TGGATCGCGA CGGCCACGAC
ACCGACATTC TCACCAAGGG CCGCCACGAC CCCTGCGTCG GCATCCGCGC GGTGTCGGTG
GCCGAAGCCA TGGTCGCCTG CGTGCTCGCC GATCACCTGA TCCGCCACCG CGGCCAGATC
GGCGGGTAG
 
Protein sequence
MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPLIALTEAD IQRDLDRRRP GQSRFTTQRQ 
EADQVKILSG VMVHPQSGLQ VTTGAPIALL IENTDQRSKD YSEIKDKFRP GHADFTYEAK
YGIRDYRGGG RSSARETATR VAAGAIARKV VPGITVRAAL VQMGPHQIDR DNWDWEEVGN
NPFFCPDKDK AKFFEDYLDG IRKNGSSIGA VIEVVADGVP AGWGAPIYAK LDTDIAAALM
SINAVKGVEI GDGFATAALT GEQNADEMRA GNDGPSFLSN HAGGILGGIS TGQPVVARFA
VKPTSSILAP RKTVDRDGHD TDILTKGRHD PCVGIRAVSV AEAMVACVLA DHLIRHRGQI
GG