Gene Plav_1131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1131 
Symbol 
ID5455287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1246514 
End bp1248169 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content59% 
IMG OID640876701 
Productsulfatase 
Protein accessionYP_001412409 
Protein GI154251585 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG GAAAGCGCGG GGCGAGTATT CTCGCCGCGC TCGTCGTCAT GCTCGTTGTC 
GGCGCGGTGC TGCTCAGCCG CTACTGGATC TACATTCCGG GCCTCCTGAT GGAATTGCGC
GATCCGGTGC AGCCCAACCG GGAAGTCACT TGGGAGCAGG GCCCTGCCGA AGCCACGGCA
TCCCCCACCG AGCGGGCGCC AAACGTCGTC TTCATCCTCG TGGACGACCT CGGCTTCAAC
GATTTGAGCT TCGCAGGCGG CGGCATGGGC GGCGGCACCG TCCGTACACC CCATATCGAC
AGCATTGCCC ATGAAGGTGT CTTGTTCGCT AACGGTTATT CTGGAAATGC CACCTGCGCA
CCCTCCCGTG CTGCGATCAT GACAGGCCGC TATGCCACGC GCTTCGGCTT CGAGTTCACG
CCTGCGCCCA AGGCTTTCCA GAAAGGCATC GCCACCTTCA ACAAGGATGC CGGCGCGCAA
TACTTTACCG AGCGAGAGAA AGACGTTCCC GAAGTCGATG CCATGAGCTT GCCCACCAGC
GAAATCACCA TCGCGGAAAT GCTGAAGCAG CAGGGCTATC ACAATGTCAT GCTCGGCAAG
TGGCATCTCG GCGGCACGGA TACATCTCGT CCCGAGAAGC GCGGCTTCGA CGAATTTCTC
GGGTTCATTC CCGGCGCCTC GATGTTCCTG CCGCGCAACA GTCCAGACGT CGTGAACTCG
ATCCAGGATT TCGATCCAAT CGACCGCTTT CTCTGGGCCA ATCTGCCTTT CGCGGTCCAG
TTCAACGGCG GTGACAGGTT CGAGCCTTCC GAATACATGA CGGACTATCT CACCAATGAG
GCCGTCAAGG CGATCGGGGC AAATCGCAAT CGCCCTTTCT TCATGTATGT TGCCTACAAC
GCGCCGCATA CGCCGCTTCA GGCGCTGAAA TCGGATTATG ACGCGCTGGC CCATATCGAA
AATCACACCG AGCGCGTCTA TGCCGCGATG GTCGTCGCGC TCGACCGTGG CGTCGGCAAG
ATCAAGCAAG CCCTTCGCGA CAATGGTCTC GAAGAAAACA CGATCATCAT TTTTACCAGC
GACAATGGCG GCGCGGGTTA TGTCGGCCTG CCGGATCTGA ACAAGCCTTA TCGCGGTTGG
AAGGCCTCCT TCTTCGAGGG CGGCATCCAC GTTCCCTTCT TCATGAAGTG GCCTGCACGC
ATAGCACCCG GCACCGTCTA TGAATATCCG GTCGCCCATG TCGATATCTT CAGCACCGTC
GCTGCGGCAG CAGGTGCGAC ACCGCCGGCG GATCGCGTCA TCGATGGCGT TGATTTGACG
GCACAGGTGA CGGGCAACAC CGATCCTTCG CGCACGCTTT TCTGGCGGTC GGGACATTAC
AAGGTTCTGC TTTCCGAAGG CTGGAAACTC CAGACGTCCG AACGCCCGGA GAAGAAGTGG
CTCTTCAATC TCGCTGCCGA TCCCACGGAG CAGAAAAATC TGGCGGACGC CGAACCGGAA
AAGCTCTCCG AGATGATGGA AATGCTCGCA AAAGTTGACG GCGAACAATC GGCGCCGGTC
TGGCCGGCGC TCATCGAAGC GCCGATCATG ATAGACCGTC CTCTCGGCGG TGCGCCGCGC
GGGCCGGAAG ATGAATTCGT CTTCTGGGCA AATTGA
 
Protein sequence
MKIGKRGASI LAALVVMLVV GAVLLSRYWI YIPGLLMELR DPVQPNREVT WEQGPAEATA 
SPTERAPNVV FILVDDLGFN DLSFAGGGMG GGTVRTPHID SIAHEGVLFA NGYSGNATCA
PSRAAIMTGR YATRFGFEFT PAPKAFQKGI ATFNKDAGAQ YFTEREKDVP EVDAMSLPTS
EITIAEMLKQ QGYHNVMLGK WHLGGTDTSR PEKRGFDEFL GFIPGASMFL PRNSPDVVNS
IQDFDPIDRF LWANLPFAVQ FNGGDRFEPS EYMTDYLTNE AVKAIGANRN RPFFMYVAYN
APHTPLQALK SDYDALAHIE NHTERVYAAM VVALDRGVGK IKQALRDNGL EENTIIIFTS
DNGGAGYVGL PDLNKPYRGW KASFFEGGIH VPFFMKWPAR IAPGTVYEYP VAHVDIFSTV
AAAAGATPPA DRVIDGVDLT AQVTGNTDPS RTLFWRSGHY KVLLSEGWKL QTSERPEKKW
LFNLAADPTE QKNLADAEPE KLSEMMEMLA KVDGEQSAPV WPALIEAPIM IDRPLGGAPR
GPEDEFVFWA N