Gene Bphyt_6681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_6681 
Symbol 
ID6280298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp2995296 
End bp2996981 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content57% 
IMG OID642617713 
Productsulfatase 
Protein accessionYP_001890350 
Protein GI187921318 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTG AAACGTCCTT CCCGGCACCT GCCAGGCCCT GCAAACGCGC GCCTTTCGCT 
GTGGTTGCGG CGGCATTGCT GAGCGTTGCG ATAAGCGGCC AGGCACAAAC GGATTCTCAA
CAGCCCATGC GGCCTAACAT CGTGTTGATT GTGGGCGACG ACGTGGGTTG GGGCGATCTC
GGAGCCTATG GCGGCGGTGA GGGGCGCGGC ATTCCCACTC CGAACCTTGA CAGGCTGGCC
GACGAAGGCA TGACATTTTT CGACTTCTAC GGCCAGCCGA GTTGTACGCC AGGCCGGGCG
GCTCTTCAGA CCGGACGCAA TCCGAACCGC AGCGGTATGA CGACCGTAGC GTTCCAGGGG
CAAGGCGGCG GCCTGCCTCA CGCGGAATGG ACGCTGGCGT CCATGCTGAA ACTCGCGCAC
TACAACACAT ATTTCACCGG CAAGTGGCAC CTCGGCGAAG CGGACTACGC CCTACCGAAC
GCACAGGGCT ATGACGACAT GAAGTACGTT GGCCTGTATC ACCTGAATGC ATACACGTAC
TCCGATCCGA AATGGTTTCC CGACATGGAC CAGCAAACTC GGGACCTGTT CACCAAGGTA
ACGAGAGGCA TGCTCAGCGG CAAGGCCGGC GAAAAGGCTC ATGAAGATTT CAAGCTGAAC
GGTCAGTATC AAAACGAGCC CGAAAACCGG ATCGTCGGCA TTCCGTTTGT CGATCGTTAC
ATCGAGAAAG CTGCGCTCGA CGATATCGAC GATGCATCCC AACGCGGCCA GCCATTCTTC
ATCAACGTAA ACTTCATGAA GGTTCATCAG CCGAACATGC CCGATCCCGA CTACATCGGA
AAGTCTCTGT CGAAGTCCAA ATACGCAGAC TCACTGGTGG AACTGGATGC ACGTGTCGGC
CACATCATGG ACAAGCTTCG CGAGAAAGGA CTCGACAAGA ACACGCTTGT GTTCTTCACG
ACGGACAACG GCGCGTGGCA GGATGTTTAT CCAGATGCGG GCTATACGCC CTTCCGCGGT
ACCAAGGGAA CCGACCGCGA GGGCGGAGCG CGTGTTCCCG CCATCGCGTG GTGGCCGGGC
AAGATCAAGC CTCACACGAG GAACTTCGAC ATTCTGGGTG GCCTCGACTG TATGGCAACC
TTCGCGGCAT TGGCAGGCGT GGATTTACCC AAAAACGATC GCGAAGGCAA ACCCATCGTT
TTCGACAGCT ACGATATGTC TCCCGTGCTG CTCGGCACGG GCAAGAGCAA GCGCAATGCC
TGGTTCTATT TCACCGAAAA CGAGCTGACG CCGGGTGCCG TGCGGGTCGG GCAGTTCAAG
GCGGTGTTCA ACCTGCGCGG CGACGCAGGC AAGGATACGG GCGGTCTAGC GGTGGACACG
AATCTCGGCT GGAAAGGACC TGAGAGCTAC GTCGCAACCG TTCCGCAGGT CTTCGATCTG
TATCAGGATC CTCAGGAGCG CTACGACATC TTCATGAACA ACTATACGGA ACACACTTGG
ACACTTGTGG CGTTTAACGC TGCTGTCAAG GACCAGATGC AAACGTATGT GAAGTATCCG
CCGCGCAAGT TGCAAAGCGA AGGATACGCC GGACCGATTA CCCTCACCCA GTATCAACGA
TTCAAGTACA TCCGGGATCA ACTGCAGGAG AACGGTTTCA ACATTCCGAT GCCAACCGGG
AACTGA
 
Protein sequence
MKPETSFPAP ARPCKRAPFA VVAAALLSVA ISGQAQTDSQ QPMRPNIVLI VGDDVGWGDL 
GAYGGGEGRG IPTPNLDRLA DEGMTFFDFY GQPSCTPGRA ALQTGRNPNR SGMTTVAFQG
QGGGLPHAEW TLASMLKLAH YNTYFTGKWH LGEADYALPN AQGYDDMKYV GLYHLNAYTY
SDPKWFPDMD QQTRDLFTKV TRGMLSGKAG EKAHEDFKLN GQYQNEPENR IVGIPFVDRY
IEKAALDDID DASQRGQPFF INVNFMKVHQ PNMPDPDYIG KSLSKSKYAD SLVELDARVG
HIMDKLREKG LDKNTLVFFT TDNGAWQDVY PDAGYTPFRG TKGTDREGGA RVPAIAWWPG
KIKPHTRNFD ILGGLDCMAT FAALAGVDLP KNDREGKPIV FDSYDMSPVL LGTGKSKRNA
WFYFTENELT PGAVRVGQFK AVFNLRGDAG KDTGGLAVDT NLGWKGPESY VATVPQVFDL
YQDPQERYDI FMNNYTEHTW TLVAFNAAVK DQMQTYVKYP PRKLQSEGYA GPITLTQYQR
FKYIRDQLQE NGFNIPMPTG N