Gene PA14_42760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_42760 
SymbolaroC 
ID4381409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp3802844 
End bp3803935 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content68% 
IMG OID639326000 
Productchorismate synthase 
Protein accessionYP_791565 
Protein GI116049630 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA ACACCTACGG CAAGCTCTTC ACCGTCACCA CCGCAGGCGA AAGCCACGGC 
CCGGCGCTGG TCGCCATCGT CGATGGGTGC CCCCCGGGGC TGGAACTGTC CGCCCGGGAC
CTGCAACGCG ACCTCGACCG GCGCAAGCCC GGCACCAGCC GGCACACCAC CCAGCGCCAG
GAAGCCGACG AGGTGGAGAT TCTTTCCGGG GTGTTCGAGG GCAAGACCAC CGGCACGCCG
ATCGGCCTGC TGATCCGCAA CACCGACCAG AAGTCCAAGG ACTACTCGGC GATCAAGGAC
CTGTTCCGCC CGGCCCACGC CGACTACACC TACCACCACA AGTACGGCGT GCGCGACTAC
CGCGGCGGCG GCCGTTCTTC GGCGCGCGAG ACCGCCATGC GCGTGGCCGC CGGGGCTATT
GCCAAGAAAT ACCTGGCGGG CCTGGGCATC CAGGTGCGCG GCTACATGAG CCAGCTCGGG
CCGATCGAGA TTCCGTTCAG GAGCTGGGAC AGCGTCGAGC AGAATGCCTT CTTCAGCCCC
GACCCGGACA AGGTGCCGGA GCTGGAGGCC TACATGGACC AATTGCGCCG CGACCAGGAT
TCGGTCGGGG CGAAGATCAC CGTGGTTGCC GAAGGCGTGC CGCCGGGCCT GGGCGAGCCG
ATCTTCGACC GCCTGGACGC CGAACTGGCG CATGCGCTGA TGAGCATCAA CGCGGTGAAG
GGCGTGGAGA TCGGCGCCGG CTTCGCCAGC ATCGCCCAGC GCGGCACCGA GCACCGCGAC
GAACTGACCC CGCAAGGCTT CCTGTCGAAC AATGCCGGCG GCATCCTCGG CGGGATCTCC
TCTGGCCAGC CGATCGTCGC CCACCTGGCG CTGAAGCCGA CCTCCAGCAT CACCACTCCC
GGGCGCTCGA TCGATACCGC CGGCGAGCCG GTGGACATGA TCACCAAGGG CCGTCACGAC
CCGTGCGTCG GCATCCGCGC CACGCCGATC GCCGAGGCGA TGATGGCCAT CGTCCTGCTC
GACCAGTTGC TGCGCCAGCG TGGGCAGAAC GCCGACGTGC GCGTCGACAC GCCGGTCCTG
CCGCAGCTGT GA
 
Protein sequence
MSGNTYGKLF TVTTAGESHG PALVAIVDGC PPGLELSARD LQRDLDRRKP GTSRHTTQRQ 
EADEVEILSG VFEGKTTGTP IGLLIRNTDQ KSKDYSAIKD LFRPAHADYT YHHKYGVRDY
RGGGRSSARE TAMRVAAGAI AKKYLAGLGI QVRGYMSQLG PIEIPFRSWD SVEQNAFFSP
DPDKVPELEA YMDQLRRDQD SVGAKITVVA EGVPPGLGEP IFDRLDAELA HALMSINAVK
GVEIGAGFAS IAQRGTEHRD ELTPQGFLSN NAGGILGGIS SGQPIVAHLA LKPTSSITTP
GRSIDTAGEP VDMITKGRHD PCVGIRATPI AEAMMAIVLL DQLLRQRGQN ADVRVDTPVL
PQL