Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_41920 |
Symbol | aroF-1 |
ID | 4381830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 3742858 |
End bp | 3743934 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639325930 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_791495 |
Protein GI | 116049700 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATT TACAGATCGA CGACCTTAAC GTTGCCTCCA ACGAGACCCT GATCACGCCG GAGCAGCTCA AGCGTGAAAT TCCCCTGACC GACAAGGCCC TGCAGACCGT GGCCCATGGT CGCCAGGTGG TGCGCGACAT CCTGGATGGC AAGGACCACC GTCTGTTCGT GGTGATCGGC CCCTGCTCCA TCCACGACAT CAAGGCCGCC CACGAATACG CCGACCGCCT CAAGGCGCTC GCGGCCGAAG TGGCGGATAC GCTGTTCCTG GTGATGCGCG TGTACTTCGA GAAGCCGCGT ACCACGGTGG GCTGGAAAGG CCTGATCAAC GATCCGTACC TGGACGACTC GTTCAAGATC CAGGATGGCC TGCACATCGG TCGCCAACTG CTCCGCGACC TCGCCGAGAA AGGCTTGCCC ACCGCCACCG AAGCGCTCGA CCCGATTTCC CCGCAGTACC TGCAGGACCT GATCAGCTGG TCGGCGATCG GCGCCCGTAC CACCGAATCG CAGACCCACC GCGAGATGGC CTCCGGCCTG TCTTCCGCGG TCGGCTTCAA GAACGGCACC GATGGCAGCC TGACCGTGGC GATCAATGCC CTGCAGTCGG TCTCCAGCCC GCATCGCTTC CTCGGCATCA ACCAGCAGGG CGGCGTATCC ATCGTCACCA CCAAGGGCAA CCGCTACGGT CACGTGGTGT TGCGCGGCGG CAACGGCAAG CCGAACTACG ATTCGGTCAG CGTCGCGCTC TGCGAGCAGG ACCTGAACAA GGCGAAAATC CCGCTGAACA TCATGGTCGA CTGCAGCCAC GCCAACTCCA ACAAGGACCC GGCCCTGCAA CCGCTGGTGA TGGACAACGT CAGCAACCAG ATCGTCGAAG GCAACAACTC GATCGTCGGC CTGATGGTGG AAAGCCACCT GGGCTGGGGC AGCCAGCCGA TTCCGAAGGA TCTCGACCAA CTTCAGTACG GCGTCTCCAT CACCGACGCC TGCATCGACT GGGCGACCAC CGAGAAGAGC ATCCGCAGCA TGCACGCCAA GCTCAAGGAC GTGCTGCCGA AACGCCAGCG CGGCTGA
|
Protein sequence | MADLQIDDLN VASNETLITP EQLKREIPLT DKALQTVAHG RQVVRDILDG KDHRLFVVIG PCSIHDIKAA HEYADRLKAL AAEVADTLFL VMRVYFEKPR TTVGWKGLIN DPYLDDSFKI QDGLHIGRQL LRDLAEKGLP TATEALDPIS PQYLQDLISW SAIGARTTES QTHREMASGL SSAVGFKNGT DGSLTVAINA LQSVSSPHRF LGINQQGGVS IVTTKGNRYG HVVLRGGNGK PNYDSVSVAL CEQDLNKAKI PLNIMVDCSH ANSNKDPALQ PLVMDNVSNQ IVEGNNSIVG LMVESHLGWG SQPIPKDLDQ LQYGVSITDA CIDWATTEKS IRSMHAKLKD VLPKRQRG
|
| |