Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_3441 |
Symbol | |
ID | 4013899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | + |
Start bp | 3641671 |
End bp | 3643206 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637943104 |
Product | sulfatase |
Protein accession | YP_550248 |
Protein GI | 91789296 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0813077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC CCAACATACT GCTCATCACC ACCGACCAGC ACCGGGGTGA CTGCCTGGGC TTTGCAGGCC GCAAGGTCAA GACCCCGCAC ATCGACGAAA TGGCCAGGAC GGGCACGCAC TTCACCTCGT GCATCACGCC GAACATCGTG TGCCAGCCCT CTCGCGCCTC CATCCTGACC GGGTTGTTGC CGCTGACGCA CGGCGTATGC GACAACGGCA TTGACCTGGA TGAGGCGAGA GGCGAAGCGG GCTTTGCCGG CACACTGGCA AGCAGCGGTT ATTCGACAGG CTTTATCGGC AAGGCGCATT TCTCGACCCA CCACACGTTT GCAAAAACCG GCCGCCCCGA ATGCCAGTTC AGCGAGGCCG ACTACGGCCC CGCGTGGTAC GGCCCATACA TGGGCTTTGA ACATGTGGAG CTTGCCGTGG AAGGGCACAA CTACTGGTTG CCCACCCCGC TGCCGGGCGG GCTGCACCAT TCGCGCTGGT ACTACGGCGA TGGTCTGGGC GAGATGCGCA ACAGGCTTTA CCAGCAAGAC ATGGGGCCAC CCAGTGGCGC GCCGCAAACC TTCAATTCCG CCCTGCCCAG CGCGTGGCAC AACTCCACCT GGATAGGCGA CCGGACGATC GAGTTCATGC GCAAACATGC AGGCGAGGCC GCAAAACGCT TCTGCCTGTG GGCCTCGTTT CCAGATCCGC ATCACCCCTT TGATTGCCCG GAGCCATGGT CACGGCTTCA CCACCCGGAT GAGGTCGACC TGCCGGCGCA CCGGACCACC GACTTCGAGC GCCGGCCCTG GTGGCACAAG GCCAGCATGG ACAGCAAGCC CGTCGGCGAT GCGGCCGTGC AGGCCCTGCG GCAAAACTTC TCGCGCATGC CTACACCGGC CGAGCAGCAA CTGCGCAACA TCACCGCTAA CTACTACGGC ATGATTTCGC TGGTGGACCA CCAGGTGGGC CGCATCCAGA CCGCGTTGCA GCAACTGGGC CTGGACGGCA ACACCCTCGT GATCTTCACC TCTGATCACG GCGAGTGGCT GGGTGACCAC GGGCTGATGC TCAAGGGCCC GATCCCTTAC GAAGGTGTCC TGCGCGTGGG CATGGTTGTC AACGGCCCGC AGGTCCAGGC CGGCCAGGTG CGGCATGAGC CGGTATCAAC GCTCGACCTG GCCGCCACCT TTGCGGACTA TGCAACGGCC ACCGCGCTGG CGCCCCTGCA CGGCCAGAGC CTGCGGCCTT TGTTGGAAGG CGGGCAACAG ACACGCGACT TCGCATTGAG CGAATGGAAC GTGGCCGCAT CGCGCTGCGG TTTGGAACTG CAACTGCGAA CCGTGCGCAC CGAAAACTGG AAACTCACCC TCGAGCAAAA CTCCGGCGCA GGCGAGATGT ACTGCCTGTC CGAAGATCCC AATGAGATGG ACAACCTGTT CGACGACCCG GGCTATACGG CAAAGCGCAA GGAGCTCAGT GACATGATCG CATCGCGCCC CCGCGACCAG TTGGCCCAAG CGCCCGCGCC TTCGGGCATT GCATAG
|
Protein sequence | MKRPNILLIT TDQHRGDCLG FAGRKVKTPH IDEMARTGTH FTSCITPNIV CQPSRASILT GLLPLTHGVC DNGIDLDEAR GEAGFAGTLA SSGYSTGFIG KAHFSTHHTF AKTGRPECQF SEADYGPAWY GPYMGFEHVE LAVEGHNYWL PTPLPGGLHH SRWYYGDGLG EMRNRLYQQD MGPPSGAPQT FNSALPSAWH NSTWIGDRTI EFMRKHAGEA AKRFCLWASF PDPHHPFDCP EPWSRLHHPD EVDLPAHRTT DFERRPWWHK ASMDSKPVGD AAVQALRQNF SRMPTPAEQQ LRNITANYYG MISLVDHQVG RIQTALQQLG LDGNTLVIFT SDHGEWLGDH GLMLKGPIPY EGVLRVGMVV NGPQVQAGQV RHEPVSTLDL AATFADYATA TALAPLHGQS LRPLLEGGQQ TRDFALSEWN VAASRCGLEL QLRTVRTENW KLTLEQNSGA GEMYCLSEDP NEMDNLFDDP GYTAKRKELS DMIASRPRDQ LAQAPAPSGI A
|
| |