Gene Rpic12D_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_1657 
Symbol 
ID8019309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012856 
Strand
Start bp1760907 
End bp1762007 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID644830438 
Productchorismate synthase 
Protein accessionYP_002981612 
Protein GI241663252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0287757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGCA ATACCCTGGG CCTGCTGTTT TCCGTCACCA CCTTCGGCGA GTCGCATGGC 
CCGGCCATCG GCGCTGTCAT CGACGGCTGC CCGCCCGGCA TGACGCTGTC TGCAGAAGAC
ATCCAGCCCG ATCTGGATCG GCGCAAGCCC GGCACCTCGC GCCACGTGAC GCAGCGCAAG
GAGGAAGACC TCGTCGAGAT CCTCTCAGGC GTGTACGAAG GGAAAACCAC GGGCACCCCG
ATCTGCCTGC TGATCCGCAA TACCGACCAG CGCAGCAAGG ATTACAGCAA CATCGCCGAG
ACATTCCGTC CCGGCCATGC CGATTACACG TACTGGCACA AATACGGCAT CCGCGACCCG
CGTGGCGGCG GCCGTTCTTC CGCACGCCTG ACTGCGCCGA CGGTGGCCGC AGGTGCAGTC
GCCAAGAAAT GGCTGCGCGA GAAGTTCGGC ATCGAAATCC ACGGCTTCAT GTCGCAACTG
GGCGATATCC AGATCCCGTT CATGGACTGG AACGAAGTTC CGAACAACCC GTTCTTCGCG
CCAAATGCAG AGATCGTTCC CGAACTGGAA ACCTACATGG ATGCACTGCG CAAAGACGGC
GATTCCGTCG GCGCGCGCAT TGAAGTGGTG GCCACGGGCG TGCCCGTCGG CTGGGGCGAG
CCGCTGTTCG ATCGCCTGGA TGCAGACATT GCCCACGCCA TGATGGGTCT GAACGCAGTC
AAGGGCGTTG AGATCGGTGC AGGCTTCCAC GCGGTCAGCC AGCGCGGCTC CGAGCATGGT
GATGAGCTGA CGCCTGAGGG CTTTGTCGGC AACAACGCCG GCGGCATCTT GGGCGGGATT
TCCACGGGGC AGGACATCTC CGTGTCGCTG GCGATCAAGC CGACGTCGAG CATTCGCACG
CCGCGTCGTT CGATCGACAA GGCGGGTGAT CCGGCTGTGG TCGAAACGTT CGGCCGCCAT
GACCCGTGCG TTGGCATCCG TGCCACGCCG ATCGCCGAGG CGCTGCTCGC GCTGGTGCTG
ATCGACCACG CGCTGCGCCA TCGTGCGCAA TGCGGCGACG TGTCGGTCGA GACGCCGGCC
ATCGCGGCAA AAGCGTCCTG A
 
Protein sequence
MSGNTLGLLF SVTTFGESHG PAIGAVIDGC PPGMTLSAED IQPDLDRRKP GTSRHVTQRK 
EEDLVEILSG VYEGKTTGTP ICLLIRNTDQ RSKDYSNIAE TFRPGHADYT YWHKYGIRDP
RGGGRSSARL TAPTVAAGAV AKKWLREKFG IEIHGFMSQL GDIQIPFMDW NEVPNNPFFA
PNAEIVPELE TYMDALRKDG DSVGARIEVV ATGVPVGWGE PLFDRLDADI AHAMMGLNAV
KGVEIGAGFH AVSQRGSEHG DELTPEGFVG NNAGGILGGI STGQDISVSL AIKPTSSIRT
PRRSIDKAGD PAVVETFGRH DPCVGIRATP IAEALLALVL IDHALRHRAQ CGDVSVETPA
IAAKAS