Gene Plav_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3137 
Symbol 
ID5454824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3349259 
End bp3350788 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content59% 
IMG OID640878727 
Productsulfatase 
Protein accessionYP_001414401 
Protein GI154253577 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.738751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATG CGGAAGACGG AACCGGGGAT CAACCCAGGA ATGCGGTCGT CATACTTCTC 
GATAGTCTCA ACCGGCATAT GATTGGCGCC TATGGTGGAC GGGAATTCGC AACGCCGAAT
CTCGATCGCT TCGCCGCCCG CTCCACCCGA TTCACGAGGC ATTTCACGGG TTCGCTTCCC
TGCATGCCCG CGCGCCACGA CATCCTGTGC GGCGCGCTTG ACTTTCTCTG GCGGCCCTGG
GGCTCGGTCG AACTTTGGGA AGACGCGATT ACCTACGAGC TGCGAAAAAA GGGCGTGGTG
ACGCAGCTCA TTTCCGATCA CCCGCATCTC TTTGAAACGG GCGGTGAAAA TTATCACGTC
GATTTCACGG CCTGGGACTA TCAGCGTGGT CATGAAGGTG ACCCATGGAA GACGCGGCCG
GACCCGAGCT GGGCCGGGGC GCCGAACTTC ATGCGCAAAC ACATGCCGTA TGATGACTCG
CGCGGCTATT TCCGCGGAGA GGAGGATTTT CCCGGCCCCC GCACGATGGG TGCGGCAGCA
CGCTGGCTGA ACGAGAATGC TGGCCACCAC GGCCGCTTCA TGCTGTTCGT GGACGAGTTC
GATCCGCACG AGCCCTTCGA CACCCCCGAG CCCTATGCTT CAATGTACGA CCCGGATTGG
GAAGGTGCTC ATCTCATATG GCCGCCTTAT GTGAATGGCG GTATCGAGAA GAGCGTCATC
ACCGAGCGTC AGGCCCGCCA GATTCGGGCT TCCTATGGCG GCAAACTCAC CATGATTGAC
AAGTGGTTCG GTAAAATTCT GGATGAGCTC GATGCCAAGG ATCTCTGGAA AGACACGCTT
GTCATTCTTT GTACGGATCA TGGCCACTAT CTGGGTGAAA AGGATATATG GGGGAAGCCG
GGCGTGCCCG TCTATGAACC CCTCGGGCAT ATTCCACTGA TGATCGCGCA TCCAGACGTC
GCTCCCGGCA CATGCGATGC CCTCACCACA AGCGTGGATC TCTTTGCGAC GCTGGCTGAG
TTGTTTGGTG TGGAAGCGCG CCAGCGTACA CATGGCCGCT CTCTGCTGCC GCTGATGAGG
AAGGAGAAGC CGGGTATCCG CGATTGGCTG CTTACCGGCG TATGGGGCCG CGAGGTCCAC
TACATCGACA ATCGCTTTAA ATATGCCCGC GGGCCCGCTG GCGACAACGC GCCGCTCACC
ATGATGTCGA ACCGCTGGTC GACCATGCCG ACGCATTTTC TGACGCGGGA GCAGGAATTG
CCATTGCCGG ATGACCGCGC TTTTCTGGAC AGAATGCCGG GCAGTGGCGT TCCGGTCATT
CACCAGCAAT GGGACAGGGA TGATCCAGTG CCATTCTGGG CGCGAACACG CTTTGCAGGC
CATCATCTTT ATGACCTGAC CGAGGACCCC GCCGAAGAGC GCAATTTGGC AGGAACGTCA
GCCGAAGCGG ATTTAGCGGA ACGGCTGCGG GCCGCACTCG TCGAAATCGA GGCGCCCAAA
AGCCAGTTGG AACGGCTAGG GCTCAACTGA
 
Protein sequence
MTNAEDGTGD QPRNAVVILL DSLNRHMIGA YGGREFATPN LDRFAARSTR FTRHFTGSLP 
CMPARHDILC GALDFLWRPW GSVELWEDAI TYELRKKGVV TQLISDHPHL FETGGENYHV
DFTAWDYQRG HEGDPWKTRP DPSWAGAPNF MRKHMPYDDS RGYFRGEEDF PGPRTMGAAA
RWLNENAGHH GRFMLFVDEF DPHEPFDTPE PYASMYDPDW EGAHLIWPPY VNGGIEKSVI
TERQARQIRA SYGGKLTMID KWFGKILDEL DAKDLWKDTL VILCTDHGHY LGEKDIWGKP
GVPVYEPLGH IPLMIAHPDV APGTCDALTT SVDLFATLAE LFGVEARQRT HGRSLLPLMR
KEKPGIRDWL LTGVWGREVH YIDNRFKYAR GPAGDNAPLT MMSNRWSTMP THFLTREQEL
PLPDDRAFLD RMPGSGVPVI HQQWDRDDPV PFWARTRFAG HHLYDLTEDP AEERNLAGTS
AEADLAERLR AALVEIEAPK SQLERLGLN