Gene Bpro_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_3441 
Symbol 
ID4013899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp3641671 
End bp3643206 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content64% 
IMG OID637943104 
Productsulfatase 
Protein accessionYP_550248 
Protein GI91789296 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0813077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC CCAACATACT GCTCATCACC ACCGACCAGC ACCGGGGTGA CTGCCTGGGC 
TTTGCAGGCC GCAAGGTCAA GACCCCGCAC ATCGACGAAA TGGCCAGGAC GGGCACGCAC
TTCACCTCGT GCATCACGCC GAACATCGTG TGCCAGCCCT CTCGCGCCTC CATCCTGACC
GGGTTGTTGC CGCTGACGCA CGGCGTATGC GACAACGGCA TTGACCTGGA TGAGGCGAGA
GGCGAAGCGG GCTTTGCCGG CACACTGGCA AGCAGCGGTT ATTCGACAGG CTTTATCGGC
AAGGCGCATT TCTCGACCCA CCACACGTTT GCAAAAACCG GCCGCCCCGA ATGCCAGTTC
AGCGAGGCCG ACTACGGCCC CGCGTGGTAC GGCCCATACA TGGGCTTTGA ACATGTGGAG
CTTGCCGTGG AAGGGCACAA CTACTGGTTG CCCACCCCGC TGCCGGGCGG GCTGCACCAT
TCGCGCTGGT ACTACGGCGA TGGTCTGGGC GAGATGCGCA ACAGGCTTTA CCAGCAAGAC
ATGGGGCCAC CCAGTGGCGC GCCGCAAACC TTCAATTCCG CCCTGCCCAG CGCGTGGCAC
AACTCCACCT GGATAGGCGA CCGGACGATC GAGTTCATGC GCAAACATGC AGGCGAGGCC
GCAAAACGCT TCTGCCTGTG GGCCTCGTTT CCAGATCCGC ATCACCCCTT TGATTGCCCG
GAGCCATGGT CACGGCTTCA CCACCCGGAT GAGGTCGACC TGCCGGCGCA CCGGACCACC
GACTTCGAGC GCCGGCCCTG GTGGCACAAG GCCAGCATGG ACAGCAAGCC CGTCGGCGAT
GCGGCCGTGC AGGCCCTGCG GCAAAACTTC TCGCGCATGC CTACACCGGC CGAGCAGCAA
CTGCGCAACA TCACCGCTAA CTACTACGGC ATGATTTCGC TGGTGGACCA CCAGGTGGGC
CGCATCCAGA CCGCGTTGCA GCAACTGGGC CTGGACGGCA ACACCCTCGT GATCTTCACC
TCTGATCACG GCGAGTGGCT GGGTGACCAC GGGCTGATGC TCAAGGGCCC GATCCCTTAC
GAAGGTGTCC TGCGCGTGGG CATGGTTGTC AACGGCCCGC AGGTCCAGGC CGGCCAGGTG
CGGCATGAGC CGGTATCAAC GCTCGACCTG GCCGCCACCT TTGCGGACTA TGCAACGGCC
ACCGCGCTGG CGCCCCTGCA CGGCCAGAGC CTGCGGCCTT TGTTGGAAGG CGGGCAACAG
ACACGCGACT TCGCATTGAG CGAATGGAAC GTGGCCGCAT CGCGCTGCGG TTTGGAACTG
CAACTGCGAA CCGTGCGCAC CGAAAACTGG AAACTCACCC TCGAGCAAAA CTCCGGCGCA
GGCGAGATGT ACTGCCTGTC CGAAGATCCC AATGAGATGG ACAACCTGTT CGACGACCCG
GGCTATACGG CAAAGCGCAA GGAGCTCAGT GACATGATCG CATCGCGCCC CCGCGACCAG
TTGGCCCAAG CGCCCGCGCC TTCGGGCATT GCATAG
 
Protein sequence
MKRPNILLIT TDQHRGDCLG FAGRKVKTPH IDEMARTGTH FTSCITPNIV CQPSRASILT 
GLLPLTHGVC DNGIDLDEAR GEAGFAGTLA SSGYSTGFIG KAHFSTHHTF AKTGRPECQF
SEADYGPAWY GPYMGFEHVE LAVEGHNYWL PTPLPGGLHH SRWYYGDGLG EMRNRLYQQD
MGPPSGAPQT FNSALPSAWH NSTWIGDRTI EFMRKHAGEA AKRFCLWASF PDPHHPFDCP
EPWSRLHHPD EVDLPAHRTT DFERRPWWHK ASMDSKPVGD AAVQALRQNF SRMPTPAEQQ
LRNITANYYG MISLVDHQVG RIQTALQQLG LDGNTLVIFT SDHGEWLGDH GLMLKGPIPY
EGVLRVGMVV NGPQVQAGQV RHEPVSTLDL AATFADYATA TALAPLHGQS LRPLLEGGQQ
TRDFALSEWN VAASRCGLEL QLRTVRTENW KLTLEQNSGA GEMYCLSEDP NEMDNLFDDP
GYTAKRKELS DMIASRPRDQ LAQAPAPSGI A