Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_07940 |
Symbol | trpE |
ID | 4385485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 683802 |
End bp | 685280 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639323184 |
Product | anthranilate synthase component I |
Protein accession | YP_788782 |
Protein GI | 116054337 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00127058 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCGCG AAGAATTCCT GCGGCTGGCC GCCGATGGCT ACAACCGCAT CCCGCTGTCC TTCGAGACCC TTGCCGACTT CGACACGCCG CTGTCGATCT ACCTGAAGCT GGCCGACGCG CCGAACTCCT ACCTGCTGGA GTCGGTGCAG GGCGGCGAGA AATGGGGGCG CTATTCGATC ATCGGCCTGC CGTGTCGCAC GGTGCTGCGG GTCTACGACC ATCAAGTGCG GATCAGCATC GATGGCATGG AAACCGAGCG CTTCGATTGC GCCGACCCGC TGGCTTTCGT CGAGGAGTTC AAGGCGCGCT ACCAGGTGCC CACCGTGCCC GGCTTGCCAC GTTTCGATGG AGGCCTGGTC GGCTATTTCG GTTACGACTG CGTGCGCTAC GTGGAAAAAC GCCTGGCCAC CTGTCCGAAC CCGGACCCGC TGGGCAACCC GGATATCCTG TTGATGGTGT CCGATGCCGT AGTGGTATTC GACAACCTGG CTGGGAAGAT CCACGCCATC GTCCTCGCCG ATCCCTCCGA GGAAAATGCC TACGAGCGCG GCCAGGCACG TCTGGAGGAG CTGCTGGAGC GTCTGCGCCA GCCGATCACC CCGCGTCGCG GCCTCGACCT CGAGGCGGCC CAGGGCCGGG AGCCGGCGTT TCGTGCCAGC TTCACCCGCG AGGACTATGA AAACGCGGTA GGAAGGATCA AGGACTACAT CCTGGCCGGC GACTGCATGC AGGTGGTGCC GTCGCAGCGC ATGTCCATCG AGTTCAAGGC GGCGCCCATC GACCTGTACC GCGCGCTGCG CTGTTTCAAT CCGACGCCCT ACATGTACTT CTTCAACTTC GGCGACTTCC ATGTCGTGGG CAGCTCGCCG GAGGTGCTGG TACGGGTCGA GGATGGCCTG GTGACGGTGC GCCCGATCGC CGGTACCCGT CCGCGCGGGA TCAACGAAGA GGCCGACCTG GCGCTGGAGC AGGATCTGCT GTCGGACGCC AAGGAGATCG CCGAGCACCT GATGCTGATC GACCTGGGGC GCAACGACGT GGGGCGGGTG TCCGACATCG GCGCGGTGAA GGTCACCGAA AAAATGGTGA TCGAACGTTA CTCCAACGTC ATGCACATCG TGTCCAACGT CACCGGGCAA TTGCGCGAGG GGCTCAGCGC GATGGACGCG CTGCGGGCGA TCCTGCCGGC GGGTACGCTG TCCGGCGCGC CGAAGATCCG CGCCATGGAG ATCATCGACG AGCTGGAGCC GGTCAAGCGT GGAGTCTACG GCGGCGCGGT CGGCTACCTG GCATGGAACG GCAACATGGA CACCGCCATT GCCATCCGCA CCGCGGTGAT CAAGAACGGT GAACTCCACG TGCAGGCCGG CGGCGGTATC GTTGCCGACT CGGTGCCGGC GCTGGAGTGG GAAGAAACCA TCAACAAGCG CCGGGCGATG TTCCGCGCCG TGGCGCTGGC CGAGCAGAGC GTCGAGTAA
|
Protein sequence | MNREEFLRLA ADGYNRIPLS FETLADFDTP LSIYLKLADA PNSYLLESVQ GGEKWGRYSI IGLPCRTVLR VYDHQVRISI DGMETERFDC ADPLAFVEEF KARYQVPTVP GLPRFDGGLV GYFGYDCVRY VEKRLATCPN PDPLGNPDIL LMVSDAVVVF DNLAGKIHAI VLADPSEENA YERGQARLEE LLERLRQPIT PRRGLDLEAA QGREPAFRAS FTREDYENAV GRIKDYILAG DCMQVVPSQR MSIEFKAAPI DLYRALRCFN PTPYMYFFNF GDFHVVGSSP EVLVRVEDGL VTVRPIAGTR PRGINEEADL ALEQDLLSDA KEIAEHLMLI DLGRNDVGRV SDIGAVKVTE KMVIERYSNV MHIVSNVTGQ LREGLSAMDA LRAILPAGTL SGAPKIRAME IIDELEPVKR GVYGGAVGYL AWNGNMDTAI AIRTAVIKNG ELHVQAGGGI VADSVPALEW EETINKRRAM FRAVALAEQS VE
|
| |