Gene Bphyt_6647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_6647 
Symbol 
ID6278274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp2960051 
End bp2961481 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content63% 
IMG OID642617679 
Productsulfatase 
Protein accessionYP_001890316 
Protein GI187921284 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.707956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA CGAACGTCCT CTTCATCCTG TCTGACGAGC ACCAGCACAA CCTGATGGGC 
TGCGCCGGGC ACCCGGTCAT CAAGACGCCC TCGCTCGACG CGCTCGCGCA GCGCGGCACG
CGCTTCGAGA ACGCCTACAC GCCGTCGCCG ATCTGCGTGC CGGCGCGGGC AAGCCTCGCG
ACGGGGCGCT ATGTCCACGA CATCCGCTGC TGGGACAACG CGATCGCCTA CGACGGCAGC
ACGCCGGGCT GGGCGCAGCA CCTGTCGGCG AGCGGCGTGC TGACGGAATC GATCGGCAAG
CTGCACTACA AGTCGGACGC GTCGCCCGTC GGATTTCGCC GGCAGCAACA CGCGGTGCAT
ATCCTCGATG GGATTGGCCA GGTGTGGGGG TCGGTGCGCA ATCCGATGCC CGAAACCATG
GGCCGCTCGC CGCTATACGA CAAGATCGGC CCCGGCACGT CCGACTACAA CCGCTTCGAC
ATGCGCGTCG CCGATACGGC ATGTGGCTGG CTCGGTGAGC ATGCCGCCGA CGACAAGCCC
TGGGTGCTGT TCGTCGGGCT CGTCGCACCG CACTTTCCGC TCGTCGTGCC GCAGGATTTT
CTCGATCTCT ACGATCCACG CGAAATCGAC CTGCCGCTAC TGCATCCGTC GACGGGTTAT
GTACGGCATC CGTGGGTGGA GCGTCAGGCG CGGCATATGG ATCACGATGC GGCGATCGGC
AGCGACGAAC GCAGGCGTCT TGCCGTCGCA TGCTATTACG CGCTGGTGTC GTTCCTCGAC
GCGCAAGTCG GCAAAGTGCT TGCCGCACTG CGGGCAAGCG GACTCGACGA TTCGACGACG
ATCATCTACA GCAGCGATCA CGGCGATAAT CTCGGCAAGC GCGGCATGTG GAACAAGTGT
CTGATGTACC GCGAATCGAC AGGTGTGCCG ATGATCGTCG CGGGTCCTGG CATTCCGGCG
AGCAAGGTGA GTGAAACGCC TGTGTCGCTG ATCGATATCC AGAACACGCT GCTCGAATGC
ACGGGCTGCG AAGCAGCGCT GATCGATGGT CCAGGAAAGT CTCTCGTCGA ACTCGCGTGT
GCGGAAGACG ACGTCGGGCG TCTCGCGTTC AGCGAATATC ACGCTGTCGG ATCGGAAAGC
GCAGCGTATA TGCTCGCGGA TAGCCACTAC AAGTATCACC ATTATCTCGG CATGAAGCCG
GAACTGTTCG ATGTGAAGAA CGACCCGGAA GAGATGCGCG ATCTTGCGTC GCTGCCCGAA
TACGCCGACG TGCTCGCGCA TTTCGAACGA CAGCTTCGCG CGCTGCTCGA TCCCGAAACC
ACCGATGCTG CCGCGAAAGC CGATCAGGAC AGACTGGTCC AAGCATTCGG CGGCAGGGAA
GCGGCGCTGC GAACGGGCAC ACCCGCAGCG ACGCCCGTGC CGGTTGAATA G
 
Protein sequence
MKPTNVLFIL SDEHQHNLMG CAGHPVIKTP SLDALAQRGT RFENAYTPSP ICVPARASLA 
TGRYVHDIRC WDNAIAYDGS TPGWAQHLSA SGVLTESIGK LHYKSDASPV GFRRQQHAVH
ILDGIGQVWG SVRNPMPETM GRSPLYDKIG PGTSDYNRFD MRVADTACGW LGEHAADDKP
WVLFVGLVAP HFPLVVPQDF LDLYDPREID LPLLHPSTGY VRHPWVERQA RHMDHDAAIG
SDERRRLAVA CYYALVSFLD AQVGKVLAAL RASGLDDSTT IIYSSDHGDN LGKRGMWNKC
LMYRESTGVP MIVAGPGIPA SKVSETPVSL IDIQNTLLEC TGCEAALIDG PGKSLVELAC
AEDDVGRLAF SEYHAVGSES AAYMLADSHY KYHHYLGMKP ELFDVKNDPE EMRDLASLPE
YADVLAHFER QLRALLDPET TDAAAKADQD RLVQAFGGRE AALRTGTPAA TPVPVE