Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_02310 |
Symbol | atsA |
ID | 4383583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | - |
Start bp | 206784 |
End bp | 208394 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639322740 |
Product | arylsulfatase |
Protein accession | YP_788341 |
Protein GI | 116053904 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAC GCCCCAACTT CCTGGTGATC GTCGCCGACG ACCTGGGCTT CTCCGATATC GGCGCCTTCG GCGGCGAGAT CGCCACGCCG AACCTCGACG CCCTGGCCAT CGCCGGCCTG CGCCTGACCG ACTTCCACAC CGCCTCGACC TGCTCGCCGA CCCGCTCGAT GCTGCTCACC GGCACCGACC ACCACATCGC CGGGATCGGC ACCATGGCCG AGGCGCTGAC CCCGGAACTG GCAGGCAAGC CGGGTTACGA AGGGCATCTC AACGAGCGCG TGGTGGCGCT GCCGGAGCTG CTCCGCGAGG CCGGCTACCA GACCCTCATG GCCGGCAAGT GGCACCTCGG TCTGAAGCCG GAACAGACGC CCCATGCACG CGGTTTCGAG CGTTCCTTCT CGCTGCTGCC GGGCGCCGCC AACCACTACG GCTTCGAGCC TCCCTACGAC GAAAGCACTC CGCGCATCCT CAAGGGCACG CCGGCGCTCT ACGTGGAAGA CGAGCGCTAC CTCGACACGC TGCCGGAGGG TTTCTATTCC TCCGATGCCT TCGGCGACAA GCTGCTGCAC TACCTCAAGG AGCGCGACCA GAGCCGGCCG TTCTTCGCCT ACCTGCCGTT CTCCGCGCCG CACTGGCCGC TACAGGCGCC GCAGGAGATC GTCGAGAAGT ACCGCGGCCG CTATGACGCC GGCCCAGAGG CGCTGCGCCA GGAACGCCTG GCCCGGCTCA AGGAGCTGGG CCTGGTGGAG GCGGACGTCG AAGCCCATCC GGTGCTCGCC CTGAGCCGCG AGTGGGAAGC CCTGGACGAC GAGGAACGGG CGAAGTCGGC GCGGGCGATG GAGGTCTACG CGGCGATGGT CGAGCGCATG GACTGGAACA TCGGCAGGGT CGTGGACTAC CTGCGTCGGC AGGGCGAGCT GGACAACACC TTCGTCCTGT TCATGTCCGA CAACGGCGCC GAAGGCGCCC TGCTGGAGGC CTTTCCGAAA TTCGGCCCGG ATTTGCTGGG CTTTCTCGAC CGGCACTACG ACAACAGCCT GGAGAACATC GGCCGCGCCA ATTCCTACGT CTGGTATGGC CCGCGCTGGG CCCAGGCGGC CACCGCGCCG TCGCGCCTGT ACAAGGCGTT CACCACCCAG GGCGGCATTC GCGTGCCAGC GCTGGTGCGC TACCCGCGGC TAAGCCGGCA GGGCGCGATC AGCCATGCCT TCGCCACGGT GATGGACGTC ACCCCGACCC TCCTCGACCT CGCCGGTGTC CGCCACCCAG GCAAGCGCTG GCGCGGCCGC GAGATCGCCG AACCGCGCGG CAGGTCGTGG CTGGGCTGGC TTTCCGGCGA GACCGAGGCG GCCCACGACG AGAACACCGT GACCGGCTGG GAGCTGTTCG GCATGCGTGC GATCCGCCAG GGCGACTGGA AGGCGGTGTA CCTGCCGGCC CCGGTGGGCC CGGCCACCTG GCAGCTCTAC GACCTGGCCC GCGACCCGGG CGAGATCCAC GACCTCGCTG ACAGCCAGCC GGGCAAGCTG GCGGAGCTGA TCGAGCATTG GAAGCGCTAC GTCAGCGAGA CCGGTGTCGT AGAGGGGGCT TCGCCTTTCC TGGTGCGATA A
|
Protein sequence | MSKRPNFLVI VADDLGFSDI GAFGGEIATP NLDALAIAGL RLTDFHTAST CSPTRSMLLT GTDHHIAGIG TMAEALTPEL AGKPGYEGHL NERVVALPEL LREAGYQTLM AGKWHLGLKP EQTPHARGFE RSFSLLPGAA NHYGFEPPYD ESTPRILKGT PALYVEDERY LDTLPEGFYS SDAFGDKLLH YLKERDQSRP FFAYLPFSAP HWPLQAPQEI VEKYRGRYDA GPEALRQERL ARLKELGLVE ADVEAHPVLA LSREWEALDD EERAKSARAM EVYAAMVERM DWNIGRVVDY LRRQGELDNT FVLFMSDNGA EGALLEAFPK FGPDLLGFLD RHYDNSLENI GRANSYVWYG PRWAQAATAP SRLYKAFTTQ GGIRVPALVR YPRLSRQGAI SHAFATVMDV TPTLLDLAGV RHPGKRWRGR EIAEPRGRSW LGWLSGETEA AHDENTVTGW ELFGMRAIRQ GDWKAVYLPA PVGPATWQLY DLARDPGEIH DLADSQPGKL AELIEHWKRY VSETGVVEGA SPFLVR
|
| |