Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3137 |
Symbol | |
ID | 5454824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 3349259 |
End bp | 3350788 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640878727 |
Product | sulfatase |
Protein accession | YP_001414401 |
Protein GI | 154253577 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.738751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATG CGGAAGACGG AACCGGGGAT CAACCCAGGA ATGCGGTCGT CATACTTCTC GATAGTCTCA ACCGGCATAT GATTGGCGCC TATGGTGGAC GGGAATTCGC AACGCCGAAT CTCGATCGCT TCGCCGCCCG CTCCACCCGA TTCACGAGGC ATTTCACGGG TTCGCTTCCC TGCATGCCCG CGCGCCACGA CATCCTGTGC GGCGCGCTTG ACTTTCTCTG GCGGCCCTGG GGCTCGGTCG AACTTTGGGA AGACGCGATT ACCTACGAGC TGCGAAAAAA GGGCGTGGTG ACGCAGCTCA TTTCCGATCA CCCGCATCTC TTTGAAACGG GCGGTGAAAA TTATCACGTC GATTTCACGG CCTGGGACTA TCAGCGTGGT CATGAAGGTG ACCCATGGAA GACGCGGCCG GACCCGAGCT GGGCCGGGGC GCCGAACTTC ATGCGCAAAC ACATGCCGTA TGATGACTCG CGCGGCTATT TCCGCGGAGA GGAGGATTTT CCCGGCCCCC GCACGATGGG TGCGGCAGCA CGCTGGCTGA ACGAGAATGC TGGCCACCAC GGCCGCTTCA TGCTGTTCGT GGACGAGTTC GATCCGCACG AGCCCTTCGA CACCCCCGAG CCCTATGCTT CAATGTACGA CCCGGATTGG GAAGGTGCTC ATCTCATATG GCCGCCTTAT GTGAATGGCG GTATCGAGAA GAGCGTCATC ACCGAGCGTC AGGCCCGCCA GATTCGGGCT TCCTATGGCG GCAAACTCAC CATGATTGAC AAGTGGTTCG GTAAAATTCT GGATGAGCTC GATGCCAAGG ATCTCTGGAA AGACACGCTT GTCATTCTTT GTACGGATCA TGGCCACTAT CTGGGTGAAA AGGATATATG GGGGAAGCCG GGCGTGCCCG TCTATGAACC CCTCGGGCAT ATTCCACTGA TGATCGCGCA TCCAGACGTC GCTCCCGGCA CATGCGATGC CCTCACCACA AGCGTGGATC TCTTTGCGAC GCTGGCTGAG TTGTTTGGTG TGGAAGCGCG CCAGCGTACA CATGGCCGCT CTCTGCTGCC GCTGATGAGG AAGGAGAAGC CGGGTATCCG CGATTGGCTG CTTACCGGCG TATGGGGCCG CGAGGTCCAC TACATCGACA ATCGCTTTAA ATATGCCCGC GGGCCCGCTG GCGACAACGC GCCGCTCACC ATGATGTCGA ACCGCTGGTC GACCATGCCG ACGCATTTTC TGACGCGGGA GCAGGAATTG CCATTGCCGG ATGACCGCGC TTTTCTGGAC AGAATGCCGG GCAGTGGCGT TCCGGTCATT CACCAGCAAT GGGACAGGGA TGATCCAGTG CCATTCTGGG CGCGAACACG CTTTGCAGGC CATCATCTTT ATGACCTGAC CGAGGACCCC GCCGAAGAGC GCAATTTGGC AGGAACGTCA GCCGAAGCGG ATTTAGCGGA ACGGCTGCGG GCCGCACTCG TCGAAATCGA GGCGCCCAAA AGCCAGTTGG AACGGCTAGG GCTCAACTGA
|
Protein sequence | MTNAEDGTGD QPRNAVVILL DSLNRHMIGA YGGREFATPN LDRFAARSTR FTRHFTGSLP CMPARHDILC GALDFLWRPW GSVELWEDAI TYELRKKGVV TQLISDHPHL FETGGENYHV DFTAWDYQRG HEGDPWKTRP DPSWAGAPNF MRKHMPYDDS RGYFRGEEDF PGPRTMGAAA RWLNENAGHH GRFMLFVDEF DPHEPFDTPE PYASMYDPDW EGAHLIWPPY VNGGIEKSVI TERQARQIRA SYGGKLTMID KWFGKILDEL DAKDLWKDTL VILCTDHGHY LGEKDIWGKP GVPVYEPLGH IPLMIAHPDV APGTCDALTT SVDLFATLAE LFGVEARQRT HGRSLLPLMR KEKPGIRDWL LTGVWGREVH YIDNRFKYAR GPAGDNAPLT MMSNRWSTMP THFLTREQEL PLPDDRAFLD RMPGSGVPVI HQQWDRDDPV PFWARTRFAG HHLYDLTEDP AEERNLAGTS AEADLAERLR AALVEIEAPK SQLERLGLN
|
| |