Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2621 |
Symbol | |
ID | 5454260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 2827161 |
End bp | 2828699 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640878198 |
Product | sulfatase |
Protein accession | YP_001413886 |
Protein GI | 154253062 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0161574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.118013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGGA AAATTCTTTT CATCACCACC GATCAGATGC GCTTCGATGC CATCGGCGCG AATGGTCAGA AGGTCGCGCG CACACCCGCC ATCGACGCGC TGGCAAAAGC CGGCATCAAC TACACCCGCG CGCATAATCA GAACGTCGTC TGCATGCCCG CCCGCTCCAC CATGATCACC GGGCAATATG TGTCGACGCA TGGCGTCTGG ATGAACGGCG TGCCGCTTCC CGTCGATGCG CCCTCCGTCG CGCAATATCT CAACGAAAAA GGCGGCTACA AGACGGCGCT GATCGGCAAG GCGCATTTCG AGCCCTTCCT CGATCTCCAT CAGCAATTCT ACGAAAGCCA GATGGCGCGG CGAGGCGAAA ACGGTCCGCA TCGCGGCTTC GACTACATGG AGCTCGCCAC GCATTCGCCG CTCATCCTTC ACTACAATGA ATGGATGAAG AAGAACGAAC CCGAGGCGCT CAATTATTTC TACCAGAACC TCAACGACAA GTTTCAGGTG AACGCTGCCG GCGGCGGCGA GACCGGCGGC TGCCAGCTCC ATTTCAACAA GATCGCGCGC GAGCACTACC ACACCGACTG GGTCGCCGAC CGCACCATCG ACTGGCTCGC CTCCGTCGGC GCAGGCGACG ACTGGTTCTG CTGGATGAGC TTCCCCGATC CGCACCACCC GTGGGACCCG CCGCAATCCG AACTTCACCG TCATCCCTGG CGCGATACGC CGCTGCCGGA ATTCTATCCG GGCTCGAAGG AAAAGATCGA AGCCGTCCTC GCGGACAAGC CGCGCCACTG GATGGAATGG TACACCGGCG AGCGCGTGAC GAACTTCGAA GCCCCGCCCG AATTCCGCGC GCAGGACATG ACCGCCGATC AGGTGCAGGA GATCAACGCC TTCACCCATG TCGAAAACGA ATTGATCGAC GAAGCCATCG CGAAAGTCAT GGCCTATGTC GAAAAGCGCG GCTGGGGCGA TGATGTCGAT GTCGTCTTCA CCACCGACCA CGGCGAATTC CAGGGCGAAT TCGGCCTGCT CTTCAAGGGC CCCTATCACG TCGATGCGCT GATGCGCCTC CCCATGATCT GGCGCCCCGC GAAATCCGCG AAGGTCGCGC CCGCCGCCGT CGAAAAACCC GTCGGCCAGG TCGACCTCGC GCCCACCTTC TGCGAAATCG CCGGCCTCCC CGTGCCCGAA TGGATGCAGG GAAAGCCGAT GCCGAAAACC GATGCCGAAG GCGACGCCCA GGGCCGCGAG CGCGTCTTCA CCGAATGGGA CTGCAAACAT GTCGACGGCA CCACCGTCGG CCTCCGCACC ATCTATCGCG ACGGCTACAC CATCACCGCC TATCTCCCCG GCACCATCTA CGACGGCAGC GAAGGCGAGC TTTACGACCA CGCCAACGAT CCGCGGCAGT TCCGCAACCT CTGGAACGAC CCGGCCTACG CCAAGCTGAA ATCCGATCTT CTCGCCGATC TGAAAGACAA CCTCCCCCCC GTCCGCGACC CCCAGCTCGA ATACGTCGCC CCTGTTTAA
|
Protein sequence | MGRKILFITT DQMRFDAIGA NGQKVARTPA IDALAKAGIN YTRAHNQNVV CMPARSTMIT GQYVSTHGVW MNGVPLPVDA PSVAQYLNEK GGYKTALIGK AHFEPFLDLH QQFYESQMAR RGENGPHRGF DYMELATHSP LILHYNEWMK KNEPEALNYF YQNLNDKFQV NAAGGGETGG CQLHFNKIAR EHYHTDWVAD RTIDWLASVG AGDDWFCWMS FPDPHHPWDP PQSELHRHPW RDTPLPEFYP GSKEKIEAVL ADKPRHWMEW YTGERVTNFE APPEFRAQDM TADQVQEINA FTHVENELID EAIAKVMAYV EKRGWGDDVD VVFTTDHGEF QGEFGLLFKG PYHVDALMRL PMIWRPAKSA KVAPAAVEKP VGQVDLAPTF CEIAGLPVPE WMQGKPMPKT DAEGDAQGRE RVFTEWDCKH VDGTTVGLRT IYRDGYTITA YLPGTIYDGS EGELYDHAND PRQFRNLWND PAYAKLKSDL LADLKDNLPP VRDPQLEYVA PV
|
| |