Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_1131 |
Symbol | |
ID | 5455287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 1246514 |
End bp | 1248169 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640876701 |
Product | sulfatase |
Protein accession | YP_001412409 |
Protein GI | 154251585 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCG GAAAGCGCGG GGCGAGTATT CTCGCCGCGC TCGTCGTCAT GCTCGTTGTC GGCGCGGTGC TGCTCAGCCG CTACTGGATC TACATTCCGG GCCTCCTGAT GGAATTGCGC GATCCGGTGC AGCCCAACCG GGAAGTCACT TGGGAGCAGG GCCCTGCCGA AGCCACGGCA TCCCCCACCG AGCGGGCGCC AAACGTCGTC TTCATCCTCG TGGACGACCT CGGCTTCAAC GATTTGAGCT TCGCAGGCGG CGGCATGGGC GGCGGCACCG TCCGTACACC CCATATCGAC AGCATTGCCC ATGAAGGTGT CTTGTTCGCT AACGGTTATT CTGGAAATGC CACCTGCGCA CCCTCCCGTG CTGCGATCAT GACAGGCCGC TATGCCACGC GCTTCGGCTT CGAGTTCACG CCTGCGCCCA AGGCTTTCCA GAAAGGCATC GCCACCTTCA ACAAGGATGC CGGCGCGCAA TACTTTACCG AGCGAGAGAA AGACGTTCCC GAAGTCGATG CCATGAGCTT GCCCACCAGC GAAATCACCA TCGCGGAAAT GCTGAAGCAG CAGGGCTATC ACAATGTCAT GCTCGGCAAG TGGCATCTCG GCGGCACGGA TACATCTCGT CCCGAGAAGC GCGGCTTCGA CGAATTTCTC GGGTTCATTC CCGGCGCCTC GATGTTCCTG CCGCGCAACA GTCCAGACGT CGTGAACTCG ATCCAGGATT TCGATCCAAT CGACCGCTTT CTCTGGGCCA ATCTGCCTTT CGCGGTCCAG TTCAACGGCG GTGACAGGTT CGAGCCTTCC GAATACATGA CGGACTATCT CACCAATGAG GCCGTCAAGG CGATCGGGGC AAATCGCAAT CGCCCTTTCT TCATGTATGT TGCCTACAAC GCGCCGCATA CGCCGCTTCA GGCGCTGAAA TCGGATTATG ACGCGCTGGC CCATATCGAA AATCACACCG AGCGCGTCTA TGCCGCGATG GTCGTCGCGC TCGACCGTGG CGTCGGCAAG ATCAAGCAAG CCCTTCGCGA CAATGGTCTC GAAGAAAACA CGATCATCAT TTTTACCAGC GACAATGGCG GCGCGGGTTA TGTCGGCCTG CCGGATCTGA ACAAGCCTTA TCGCGGTTGG AAGGCCTCCT TCTTCGAGGG CGGCATCCAC GTTCCCTTCT TCATGAAGTG GCCTGCACGC ATAGCACCCG GCACCGTCTA TGAATATCCG GTCGCCCATG TCGATATCTT CAGCACCGTC GCTGCGGCAG CAGGTGCGAC ACCGCCGGCG GATCGCGTCA TCGATGGCGT TGATTTGACG GCACAGGTGA CGGGCAACAC CGATCCTTCG CGCACGCTTT TCTGGCGGTC GGGACATTAC AAGGTTCTGC TTTCCGAAGG CTGGAAACTC CAGACGTCCG AACGCCCGGA GAAGAAGTGG CTCTTCAATC TCGCTGCCGA TCCCACGGAG CAGAAAAATC TGGCGGACGC CGAACCGGAA AAGCTCTCCG AGATGATGGA AATGCTCGCA AAAGTTGACG GCGAACAATC GGCGCCGGTC TGGCCGGCGC TCATCGAAGC GCCGATCATG ATAGACCGTC CTCTCGGCGG TGCGCCGCGC GGGCCGGAAG ATGAATTCGT CTTCTGGGCA AATTGA
|
Protein sequence | MKIGKRGASI LAALVVMLVV GAVLLSRYWI YIPGLLMELR DPVQPNREVT WEQGPAEATA SPTERAPNVV FILVDDLGFN DLSFAGGGMG GGTVRTPHID SIAHEGVLFA NGYSGNATCA PSRAAIMTGR YATRFGFEFT PAPKAFQKGI ATFNKDAGAQ YFTEREKDVP EVDAMSLPTS EITIAEMLKQ QGYHNVMLGK WHLGGTDTSR PEKRGFDEFL GFIPGASMFL PRNSPDVVNS IQDFDPIDRF LWANLPFAVQ FNGGDRFEPS EYMTDYLTNE AVKAIGANRN RPFFMYVAYN APHTPLQALK SDYDALAHIE NHTERVYAAM VVALDRGVGK IKQALRDNGL EENTIIIFTS DNGGAGYVGL PDLNKPYRGW KASFFEGGIH VPFFMKWPAR IAPGTVYEYP VAHVDIFSTV AAAAGATPPA DRVIDGVDLT AQVTGNTDPS RTLFWRSGHY KVLLSEGWKL QTSERPEKKW LFNLAADPTE QKNLADAEPE KLSEMMEMLA KVDGEQSAPV WPALIEAPIM IDRPLGGAPR GPEDEFVFWA N
|
| |