Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_3903 |
Symbol | |
ID | 5357638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | - |
Start bp | 4038852 |
End bp | 4040543 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640812953 |
Product | arylsulfatase |
Protein accession | YP_001349257 |
Protein GI | 152984576 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.313687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCAA TTTCCCGCAA GGCCCTGGGC CTGTTGTTCC CCTTGCTGTG CTGCGTAGCC CAGAGCCAGG CCGCGGCACC GTCGCGTCCG AACGTCCTGC TGATCGTCGC CGACGACCTC GGCTTCTCCG ATACCTCGCC CTTCGGCGGG GAAATCTCCA CGCCCAGCCT GCAGGGGCTG GCCGACGAGG GCGTGCGCCT GACCAACTTC TATGCCGGTC CGACCTGCTC GGTGAGCCGT TCGATGCTGC TCACCGGGGT CGACAATCAC CAGGCCGGTC TCGGCACCAT GGCCGAATAC CTGCAACCGG AACAGAAGGG CAAGCCGGGC TACGAAGGGC AGCTGAATCG CCGCGTGGCG ACGCTCGCCG AGTTGCTCCG CGACGTCGGC TACAACACCT TCATGAGCGG CAAGTGGCAC CTCGGCGCGA CCCCGCAGAG CAACGCCGCG GCGCGTGGCT TCGAGCGCTC CTATACCCTG ATGCCAGGGG GCGCCAGCCA CATGGACAAG ACCCAGATGT TCCCCGGCAA CTACAAGGCG CGCTACCTGG AGGACGGCAA GGACGTCTCG ATCCCCGACG ATTTCTACTC CAGCGACTTC TACACCGACC GCCTGCTCGG CTACCTGGAG CGCGACCGCC AGCCGGGCAA GCCGTTCTTC GCCTACCTGG CGTTCACCGC GCCACACTGG CCGCTGCAGG CGCCGGACGA GTACCTGGAC AGGTATCGCG GCAACTACGC GGACGGCTAC GAAGCGCTGC GCCGCAAGCG CCTGGCGCGG ATGATCGAAC TGGGGATACT GCCGGCCGGG ACCCGGCCCA ACGATCCGCT GGCGGACAAG CTGCCGACCT GGGAGCAGTT GACCGAGGCA CAGCGCCAGG AACAGACGCG GATCATGCAG ATCTATGCGG CGATGGTCGA CAACATGGAC CACAACATCG GGCGCGTGCT CGAGCACCTG CGCAAGCGCG GCGAGCTGGA CAATACCTTC GTCCTGTTCA TGTCCGACAA CGGCCCGGAG TCGGCCAGCC CAGAATCGCT GGGCACTACC GCAGACCGCA ACGGCATCCG CGAATGGGTC GATGCGACCT TCGACAACAG CCCGCGGAAC ATGGGCCGCA AGGGCTCGTA CGTGACCCTC GGACCGGGTT GGGCGCAGGT CGGCGCCACG CCGTTCCCGT ACTTCAAGAG CTTCACCGCG AAAGGCGGGA TCCAGGTACC GGCGATCGTG CGCTATCCCG CGGCGACCCC GAAGGGGGCG ATCAGTGGCG AGGTCCTGCA CGTGAAGGAC TTCGTGCCGA CCATCCTCGC CCTGGCCGGG GCGAACTATC CCGCACGGTA CCGGGGAGAG GCGCTGTTGC CGCTGCAGGG ACGCTCCATG CTCCCGGCCC TGCAGGGCCG GGAGCAGCCG GCGCGGGTGC TGGGCTGGGA GTTCAACGGT CGCCGGGCCC TGTACAAGGG CCAATGGGCC GCGCAGATGC AGAAGCCGCC TTATGGCAGC GGCCGCTGGG AGTTGTATGA CCTGGTGCGC GATCCGGCCT TCAACCGCGA TCTCTCCGCG AGCGAGCCGG GCAAGGTCGC CGAGCTTGCC GCTGACTGGA ATGACTACGC CAGGGACAAT GGCGTGGTGC CGGCGCCGAT CCGCTACAAA TACGGGCAGA TGACCTGCCT GTACAGCCAC TGCATCCAGT AG
|
Protein sequence | MSAISRKALG LLFPLLCCVA QSQAAAPSRP NVLLIVADDL GFSDTSPFGG EISTPSLQGL ADEGVRLTNF YAGPTCSVSR SMLLTGVDNH QAGLGTMAEY LQPEQKGKPG YEGQLNRRVA TLAELLRDVG YNTFMSGKWH LGATPQSNAA ARGFERSYTL MPGGASHMDK TQMFPGNYKA RYLEDGKDVS IPDDFYSSDF YTDRLLGYLE RDRQPGKPFF AYLAFTAPHW PLQAPDEYLD RYRGNYADGY EALRRKRLAR MIELGILPAG TRPNDPLADK LPTWEQLTEA QRQEQTRIMQ IYAAMVDNMD HNIGRVLEHL RKRGELDNTF VLFMSDNGPE SASPESLGTT ADRNGIREWV DATFDNSPRN MGRKGSYVTL GPGWAQVGAT PFPYFKSFTA KGGIQVPAIV RYPAATPKGA ISGEVLHVKD FVPTILALAG ANYPARYRGE ALLPLQGRSM LPALQGREQP ARVLGWEFNG RRALYKGQWA AQMQKPPYGS GRWELYDLVR DPAFNRDLSA SEPGKVAELA ADWNDYARDN GVVPAPIRYK YGQMTCLYSH CIQ
|
| |