Gene PA14_07940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_07940 
SymboltrpE 
ID4385485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp683802 
End bp685280 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content65% 
IMG OID639323184 
Productanthranilate synthase component I 
Protein accessionYP_788782 
Protein GI116054337 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00127058 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCGCG AAGAATTCCT GCGGCTGGCC GCCGATGGCT ACAACCGCAT CCCGCTGTCC 
TTCGAGACCC TTGCCGACTT CGACACGCCG CTGTCGATCT ACCTGAAGCT GGCCGACGCG
CCGAACTCCT ACCTGCTGGA GTCGGTGCAG GGCGGCGAGA AATGGGGGCG CTATTCGATC
ATCGGCCTGC CGTGTCGCAC GGTGCTGCGG GTCTACGACC ATCAAGTGCG GATCAGCATC
GATGGCATGG AAACCGAGCG CTTCGATTGC GCCGACCCGC TGGCTTTCGT CGAGGAGTTC
AAGGCGCGCT ACCAGGTGCC CACCGTGCCC GGCTTGCCAC GTTTCGATGG AGGCCTGGTC
GGCTATTTCG GTTACGACTG CGTGCGCTAC GTGGAAAAAC GCCTGGCCAC CTGTCCGAAC
CCGGACCCGC TGGGCAACCC GGATATCCTG TTGATGGTGT CCGATGCCGT AGTGGTATTC
GACAACCTGG CTGGGAAGAT CCACGCCATC GTCCTCGCCG ATCCCTCCGA GGAAAATGCC
TACGAGCGCG GCCAGGCACG TCTGGAGGAG CTGCTGGAGC GTCTGCGCCA GCCGATCACC
CCGCGTCGCG GCCTCGACCT CGAGGCGGCC CAGGGCCGGG AGCCGGCGTT TCGTGCCAGC
TTCACCCGCG AGGACTATGA AAACGCGGTA GGAAGGATCA AGGACTACAT CCTGGCCGGC
GACTGCATGC AGGTGGTGCC GTCGCAGCGC ATGTCCATCG AGTTCAAGGC GGCGCCCATC
GACCTGTACC GCGCGCTGCG CTGTTTCAAT CCGACGCCCT ACATGTACTT CTTCAACTTC
GGCGACTTCC ATGTCGTGGG CAGCTCGCCG GAGGTGCTGG TACGGGTCGA GGATGGCCTG
GTGACGGTGC GCCCGATCGC CGGTACCCGT CCGCGCGGGA TCAACGAAGA GGCCGACCTG
GCGCTGGAGC AGGATCTGCT GTCGGACGCC AAGGAGATCG CCGAGCACCT GATGCTGATC
GACCTGGGGC GCAACGACGT GGGGCGGGTG TCCGACATCG GCGCGGTGAA GGTCACCGAA
AAAATGGTGA TCGAACGTTA CTCCAACGTC ATGCACATCG TGTCCAACGT CACCGGGCAA
TTGCGCGAGG GGCTCAGCGC GATGGACGCG CTGCGGGCGA TCCTGCCGGC GGGTACGCTG
TCCGGCGCGC CGAAGATCCG CGCCATGGAG ATCATCGACG AGCTGGAGCC GGTCAAGCGT
GGAGTCTACG GCGGCGCGGT CGGCTACCTG GCATGGAACG GCAACATGGA CACCGCCATT
GCCATCCGCA CCGCGGTGAT CAAGAACGGT GAACTCCACG TGCAGGCCGG CGGCGGTATC
GTTGCCGACT CGGTGCCGGC GCTGGAGTGG GAAGAAACCA TCAACAAGCG CCGGGCGATG
TTCCGCGCCG TGGCGCTGGC CGAGCAGAGC GTCGAGTAA
 
Protein sequence
MNREEFLRLA ADGYNRIPLS FETLADFDTP LSIYLKLADA PNSYLLESVQ GGEKWGRYSI 
IGLPCRTVLR VYDHQVRISI DGMETERFDC ADPLAFVEEF KARYQVPTVP GLPRFDGGLV
GYFGYDCVRY VEKRLATCPN PDPLGNPDIL LMVSDAVVVF DNLAGKIHAI VLADPSEENA
YERGQARLEE LLERLRQPIT PRRGLDLEAA QGREPAFRAS FTREDYENAV GRIKDYILAG
DCMQVVPSQR MSIEFKAAPI DLYRALRCFN PTPYMYFFNF GDFHVVGSSP EVLVRVEDGL
VTVRPIAGTR PRGINEEADL ALEQDLLSDA KEIAEHLMLI DLGRNDVGRV SDIGAVKVTE
KMVIERYSNV MHIVSNVTGQ LREGLSAMDA LRAILPAGTL SGAPKIRAME IIDELEPVKR
GVYGGAVGYL AWNGNMDTAI AIRTAVIKNG ELHVQAGGGI VADSVPALEW EETINKRRAM
FRAVALAEQS VE