Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3641 |
Symbol | |
ID | 5454446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 3897688 |
End bp | 3899148 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640879225 |
Product | sulfatase |
Protein accession | YP_001414896 |
Protein GI | 154254072 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.00000013502 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGTTCA TCTATATCGA CATCGACACG CTGCGCGCCG ACCATCTCGG CTGTTACGGC TATCACCGCA ACACGAGCCC GCATATCGAC CGGCTGGCGG CGCGGGGCCT GCGCTTCGAA AATGTCCACG CCTCCGACAC GCCCTGCCTG CCGAGCCGGA CGGCGCTGCT GACAGGCCGC TTCGGCATTC ACAATGGCGT CGTCAATCAT GGCGGGGCCG ATGCCGATCC CGTGATCGGC GGCGCGGACA GGGCCTTCTG GTCGCAGTTT CATCTGCACA GTTTTCCCGC GCAACTGAAA CGCGCCGGGC TCAAAACCGT CAGCGTCAGT TCCTTCGCGC ATCGCCATTC GGCCTTTCAC TGGCATGCGG GCTTCGACGA AACCTACAAT GTCGGCAAGT TCGGGCTGGA GACGGCGGAT GAAGTGTTCG CCATCGCATC GCGGTGGCTG GAAGCGAATG GACGGAACGG CGACTGGTTT CTGCATGTGC ATATGTGGGA CCCGCATACG CCCTATCGCG CCGCGCCGGA CTATGGCGAG CCTTTCGCGG AGACGCCGCT GCCCGGCTGG CTGACCGAGG AGAAGCGGGC GCGGGACTGG CAGGGCTGCG GACCGCACAG CGCGCAGGAA TGTTCGGGCT TTGCGCCGAA CCCGAAAGCG GCGCAGGCCT TTCCGCGCCA GCCGCAACAG ATTCCCGACA TGGCGGCGGT GCGCGCGATG TTCGACGGCT ACGACACAGG CGTTCTCGTT GCCGACGAAT ATGTCGGGCG CATCGTTGCG CTGCTGGGCG AACTCGGCAT CGAAGAGGAA GTCGCGATCA TGATTTCGTC GGATCATGGC GAGACGCTGG GGGAACTCAA TGTCTATGGC GACCACCAGA CCGTCGACCA GCACACCACC CGCGTGCCGC TTGTTCTCGT CTGGCCGGGA CTTGAGGGCG GCAAGACCCT TTCCGCCTTC CACTACCAGA TCGACGTGAC GGCGACGCTG CTCGAGCTTC TCGGCCGCAA GGTGCCGGAG AGCTGGGACG GCGTTTCATT CGCGGGCAGC CTGAGGGCGG GCGAGGACAA GGGCCGGGAT CACCTCATCG TTTCGCAAGG CGCGTGGACC TGCCAGCGCG GCGTACGCTT CGACAAGTGG ATCCTGATTT CGACGATGCA TGACGGCTAT CATCTCTATG ACGAGGCGAT GCTGTTCAAC CTCGAAGACG ACCCGCATGA GGAGAGGAAC CTGGCGGAGG CGCAGCCCGA AATCGCGGCG CGCGGATTTC ACCTGCTCTC CGCCTGGCAC GAGGACATGA TGAAAGATGC GGCGCGGGGG CGCGACCCGC TGGCGAATGT GGTGGCCGAG GGCGGGCCCT ATCATGTGCG CGGCGCGCTG CCCGCCTATC TCGAACGGCT GCGCGCGACC GGCCGCGCGG CAATGGCCGA GAGGCTCAGC GCGAAATATC CGGCGGGCTG A
|
Protein sequence | MKFIYIDIDT LRADHLGCYG YHRNTSPHID RLAARGLRFE NVHASDTPCL PSRTALLTGR FGIHNGVVNH GGADADPVIG GADRAFWSQF HLHSFPAQLK RAGLKTVSVS SFAHRHSAFH WHAGFDETYN VGKFGLETAD EVFAIASRWL EANGRNGDWF LHVHMWDPHT PYRAAPDYGE PFAETPLPGW LTEEKRARDW QGCGPHSAQE CSGFAPNPKA AQAFPRQPQQ IPDMAAVRAM FDGYDTGVLV ADEYVGRIVA LLGELGIEEE VAIMISSDHG ETLGELNVYG DHQTVDQHTT RVPLVLVWPG LEGGKTLSAF HYQIDVTATL LELLGRKVPE SWDGVSFAGS LRAGEDKGRD HLIVSQGAWT CQRGVRFDKW ILISTMHDGY HLYDEAMLFN LEDDPHEERN LAEAQPEIAA RGFHLLSAWH EDMMKDAARG RDPLANVVAE GGPYHVRGAL PAYLERLRAT GRAAMAERLS AKYPAG
|
| |