Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPA7_2624 |
Symbol | |
ID | 5357304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa PA7 |
Kingdom | Bacteria |
Replicon accession | NC_009656 |
Strand | - |
Start bp | 2644259 |
End bp | 2645884 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640811675 |
Product | arylsulfatase |
Protein accession | YP_001347986 |
Protein GI | 152988895 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.344429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTTC TGCTCGCCGG CGCGATCATG TCGGCGTCCG TCCAGGCGGG GGCCCGGCAG CCGAACGTAC TGGTCATAGT CGCCGACGAC CTGGCCTTTT CCGACCTTGG CGCGTTCGGC GGCGAGATCG ACACGCCGAA CCTGGACGCC CTCGCCAGGG AGGGCGTCCG CTTCACCGGC TTCCACACCG CTCCCACCTG CTCGCCCTCG CGCGCCATGT TGCTGACCGG CACCGACAGC CACCGCGCCG GCTTCGGCAC CATGGCCGAA CGGGTGGCGC CCAACCAGAA AGGCAGACCA GGCTACGAAG GCTATCTGCG GCCGGACGTG GTCACCCTGG CGGAGCGCTT CGCCGCCGCC GGCTACCGCA CCATGATGGC CGGCAAGTGG CACATGGGCC TCGCCCCGCA ACAGGATCCG CACGCCCGCG GCTTCCAGCA CAGCTTCGCG CTGCTGCAGG GCTCCCACAA CCACTACGGC CTGGACTTCG CCGCGGACCT GAACAAGGCG CCGGAGCAGA ACACCGGCGG TGCGGTCTAT ACCGAGGATG GCGTACGGCT CGCGCAACTG CCGCGGGGTT TCTATTCCTC CGACTTCTTC ACCAGCAAGC TGCTGGAATT TCTCGGCCGG CGTGACGAAG GGGCCAAGCC ATTCTTCGCC TACCTGGCTT TCTCCGCTCC CCATTGGCCA TTGCAGGCGC CGAGGGAAGT CATCGAGAAA TACAAGGGAC GCTACGACCA GGGCTACGAC CGGCTCCGCG AGAGCCGTCT CGCCCGGCAG GCGGGACTCG GCCTGCTGGC TCCGAGCACG CAGGTCCGGC CAATGGTCAT GACCCGGCGC TGGGACGAGC TGGATCCGGA ACGGCAACGA CTCGCCGCCC GCGACATGGA AGTCTATGCC GCGATGGTGG ACCGCCTGGA CCAGAACGTC GGCCGTGTCG TCAGCGCACT CAAGCGCAGC GGCGAGCTGG ACGACACACT CATCCTGTTC CTTTCCGACA ACGGTCCCGA AGGCCGCGAT ACCGCCACCA CCAAGGAAAT GCCGGCCAGT GCCGACAACC GCCTGGAGAA TCGCGGCAAC GCCACGTCCT ACTTCGGCTA CGGCCCCGGC TGGGCCCAGG CTGGCAGCGC GCCGTCCTGG CTGATCAAGT ACTATGCCAC CGAGGGCGGT ACCCGCAATG CCGCGTTTCT TCGGTACCCC GGCCTGGGCA GGTCCGGCGC CGTCTCGACG GCCTTCCTCT CGATAATGGA CGTCACCCCG ACTCTCCTTG AATTCGCTGG CATTCCGCTG ACCGACGGTA GTTTCCAGGG CCGCGCGATC GAGCCGCTAC GCGGGCGGAG CTGGGTGCCC TACCTGAAGG GTCGGCGCGA GTACGTCTAC GGACCGGACG ACGCGATCGG CACCGAAGTC TTCGGCTCGC GCTCCCTCCG CCAGGGCGAC TGGAAGATCA CCGATACCGG CAATGGCGCC TGGCGCCTGT TCGATGTGGC CCGGGACCCG GGTGAAACCC ATGATCTCTC CGCCGAGATG CCGGACCGGC TGAAGCAGCT GGAAGCCGCC TGGGGAAGCT ATGCCAGGGA CGTGGGTGTC GTCGCGCCGC CCATGGCGGT CCTTCCGCAA CCCTGA
|
Protein sequence | MAVLLAGAIM SASVQAGARQ PNVLVIVADD LAFSDLGAFG GEIDTPNLDA LAREGVRFTG FHTAPTCSPS RAMLLTGTDS HRAGFGTMAE RVAPNQKGRP GYEGYLRPDV VTLAERFAAA GYRTMMAGKW HMGLAPQQDP HARGFQHSFA LLQGSHNHYG LDFAADLNKA PEQNTGGAVY TEDGVRLAQL PRGFYSSDFF TSKLLEFLGR RDEGAKPFFA YLAFSAPHWP LQAPREVIEK YKGRYDQGYD RLRESRLARQ AGLGLLAPST QVRPMVMTRR WDELDPERQR LAARDMEVYA AMVDRLDQNV GRVVSALKRS GELDDTLILF LSDNGPEGRD TATTKEMPAS ADNRLENRGN ATSYFGYGPG WAQAGSAPSW LIKYYATEGG TRNAAFLRYP GLGRSGAVST AFLSIMDVTP TLLEFAGIPL TDGSFQGRAI EPLRGRSWVP YLKGRREYVY GPDDAIGTEV FGSRSLRQGD WKITDTGNGA WRLFDVARDP GETHDLSAEM PDRLKQLEAA WGSYARDVGV VAPPMAVLPQ P
|
| |