Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_32690 |
Symbol | gtdA |
ID | 4380554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | - |
Start bp | 2838300 |
End bp | 2839361 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639325198 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_790767 |
Protein GI | 116050414 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000000796263 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000100367 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAACG ACAATTCCAC CGAGAAGCGC GCCGACTTCT ATCGACGCAT CCGCCAGCAG CACCTCACCC CGCTCTGGGA AGCCCTGCAC AACCTGGTCC CGGCGCAACC CGCCGGCGGC TGCCAGGCGG CGCTGTGGCG CTATCGCGAA CTGCGCCCGT TCCTGCTCGA GGCAGGCGAC CTGATCAGCG CCGAGGAAGC GGTGCGCCGC GTGCTGGTGC TGGAGAACCC GGACCTTCCC GGCCAGTCGG CGATCACCCC GAGCCTCTAC GCCGGCCTGC AACTGATCCT GCCGGGCGAG ATCGCCCCCA GCCACCGGCA CACCCAGTCG GCCCTGCGCT TCGTCGTCGA GGGCTACGGC GCCTACACCT CGGTGGACGG CGAACGCACC CGCATGGAGC CGGGCAACTT CATCATCACT CCGTCCTGGA CCTGGCACGA CCACGGCAAC CGCCCCGCCG CGGAAGGCGG CGAGCCGGTG GTCTGGCTGG ACGGCCTGGA CATCCCGATG CTGCGTTTCT TCGGCGCCAC CTTCGCCGAG AGCTACGGCG AACCGGTGCA GCCGTTGCGA CGTGGCGAAG GCGACAGCCT CGCCCGCTAC GGCAGCAACA TGCTGCCGCT GCGCCACCAG CCGACGACGC CGACCTCGCC GCTGTTCAGT TACCCCTACG CGCGCAGCCG CGAGGCCCTC GAGCGCCTGG CGCGCCTCGA ACGGCCCGAT CCCTGGGAAG GCCACAAGCT GCGCTACGTC AACCCGGCCA CCGGCGGCTG GGCCATGCCG ACCATCGCCA CCTGCCTGCA ACTGCTGCCG GCCGGCTTCC TCGGCCAACC CGCGCGCAGC ACCGACGCCA GCGTCTATTC GGTGGTGGAA GGCAGCGGCG TGGCGCAAAT CGGCGGCCGG CGCTTCGCCT TCGAGGCCAA GGATCTGTTC GTCGTGCCCT CCTGGGCCGA GCTGCGCCTG GAAGCCGGCG CCACCGACTG CGTGCTGTTC AGTTTCTCCG ACCGTCCTGT GCAGCAGGCG CTCGGCATCC TCCGCGAATC CCGCGAACCC CTCGCCCACT GA
|
Protein sequence | MHNDNSTEKR ADFYRRIRQQ HLTPLWEALH NLVPAQPAGG CQAALWRYRE LRPFLLEAGD LISAEEAVRR VLVLENPDLP GQSAITPSLY AGLQLILPGE IAPSHRHTQS ALRFVVEGYG AYTSVDGERT RMEPGNFIIT PSWTWHDHGN RPAAEGGEPV VWLDGLDIPM LRFFGATFAE SYGEPVQPLR RGEGDSLARY GSNMLPLRHQ PTTPTSPLFS YPYARSREAL ERLARLERPD PWEGHKLRYV NPATGGWAMP TIATCLQLLP AGFLGQPARS TDASVYSVVE GSGVAQIGGR RFAFEAKDLF VVPSWAELRL EAGATDCVLF SFSDRPVQQA LGILRESREP LAH
|
| |