Gene Bphyt_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_4474 
Symbol 
ID6279540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp542836 
End bp544782 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content63% 
IMG OID642615567 
Productsulfatase 
Protein accessionYP_001888220 
Protein GI187919189 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC CACGATCGCA GCGCCCTTCT CTTCGTCTTG GCATGATCGG CGCCGCCGTC 
GCCAGTCTCG TCGCTCTCGC CTCCTGCGGC GACGACAATC TGCCCGGCCC GACTCCCACG
CCCGTAGTCG CCCAGAAACG CCCGAACATT CTGTACATCA TGGCCGACGA CCTCGGCTAT
TCCGACATCC ACGCGTTCGG CGGCGAAATC AACACGCCCA ACCTCGACGC GCTGGTGCAA
TCGGGCCGCA TCCTGACGAA TCACCATACG GGCACCGTCT GCGCGATCAC GCGCTCCATG
TTGATCTCCG GCACCGATCA CCATCTGGTC GGCGAAGGCA CCATGGGTGT GCCGACCGAC
GAGCGCAAAG GACTGCCGGG TTATGAAGGC TATCTGAACG ACCGCGCGCT CTCCGTTGCG
CAACTGCTGA AGGACGGCGG TTATCACACG TATATGGCCG GCAAATGGCA TATCGGCTCG
GGCATTGTCG GCAGTACGAC AGGCGGCGGA CAGACGCCCG ACCAATGGGG CTTCGAACAT
AGCTATGCCT TGCTCGGCGG CGCCGCGACC AATCACTTCG CGCACGAACC GGCGAACTCG
CACAACTACA CCGAAGACGG CAAATACGTG CAGCCCGGTC AGCCGGGACA ACCGGGTGGC
GCGGGCGGCA GCCCCGCAGT GTTCTATTCG ACCGACTTTT ATACGCAGCG CCTGATCTCG
TACATCGATT CGAACAAGGG CGACGGCAAA CCGTTCTTCG CTTACGCGGC CTACACGTCG
CCGCATTGGC CGCTGCAGGT ACCCGAACCT TATCTGCACA ACTACGCCGG CAAATACGAC
GCCGGTTACG ACGCCATCCG CAACGCGCGC ATTGCGCGGC AGAAAGCGCT CGGCATCATT
CCGAACGACT TCGTGCCGTA CGGCGGCGCA TCGGAAACGC TCGTCGCCAC CGCGGCCACG
GCGAACAACG GCACGGTGAA CGCGAAGTAC GTGAGCGCCG TGCATAGCGC GGCGCAAGGC
TATACCGACT ACGGCCCGGG CACGGTGAAC AAGACGTGGG CGAGTCTCTC GCCCGCCGAG
AAGAAAGCCC AGGCGCGTTA TATGGAAATC TACGCGGGCA TGGTCGAAAA CCTCGATCAC
AACATCGGGT TGCTGATCCA GCATCTGAAG GATATCGGCG AGTACGACAA TACCTTCATC
ATGTTCCAGT CGGACAACGG CGCGGAAGGC TGGCCGATCG ACTCGGGCGC CGACCCGACC
GCGACCGACA CCGCCAATGC CGCCGACCCG GTTTATTCGG CACTCGGCAC CGACAACGGC
AAGCAGAACG CGCAACGTCT GCAATACGGT TTGCGCTGGG CCGAAGTCAG CGCCACGCCG
TTCCGTCTCA CCAAGGGCTA TTCGGGCGAG GGCGGCGTCT CCACGCCGTT GATCGTCCAT
CTGCCCGGTC AGAGCACGCA GAAACCGACA CTGCGCGACT TCACGCACGT GACCGACAAT
ACCGCGACGT TCCTTGCTGT CGCACAGATT TCGCCGCCCA CGCAGGCCGC GCCGCCGCTG
ATCAATTCTC TGACCGGCGT CGATCAGAAC AAGGGCAAGG TGGTCTACAA CAATCGCTAC
GTCTATCCGG TCACCGGGCA GTCGCTGCTG CCGCTGTTGA ACGATCAGGC GACGAGCGCG
GTGCACAGCG CATCGTTCGG CGACGAAGCC TATGGCCGCG GCTATCTGCG CAGCGCCGAC
GGCCGCTGGA AAGCGTTGTG GACGGAACCG CCGCTCGGTC CCGTGGACGG TCACTGGCAA
CTGTTCGACA TGAGCGCCGA CCGTGGCGAA ACGCAGGACG TGTCGACGCA GAATCCCTCG
GTGATCGACG GTCTCGTGCA GCAGTGGAAC AACTATATGA GCAGCGTCGG CGGCGTCGAA
CCGTTGCGTC CGCGCGGTTA CTACTGA
 
Protein sequence
MTTPRSQRPS LRLGMIGAAV ASLVALASCG DDNLPGPTPT PVVAQKRPNI LYIMADDLGY 
SDIHAFGGEI NTPNLDALVQ SGRILTNHHT GTVCAITRSM LISGTDHHLV GEGTMGVPTD
ERKGLPGYEG YLNDRALSVA QLLKDGGYHT YMAGKWHIGS GIVGSTTGGG QTPDQWGFEH
SYALLGGAAT NHFAHEPANS HNYTEDGKYV QPGQPGQPGG AGGSPAVFYS TDFYTQRLIS
YIDSNKGDGK PFFAYAAYTS PHWPLQVPEP YLHNYAGKYD AGYDAIRNAR IARQKALGII
PNDFVPYGGA SETLVATAAT ANNGTVNAKY VSAVHSAAQG YTDYGPGTVN KTWASLSPAE
KKAQARYMEI YAGMVENLDH NIGLLIQHLK DIGEYDNTFI MFQSDNGAEG WPIDSGADPT
ATDTANAADP VYSALGTDNG KQNAQRLQYG LRWAEVSATP FRLTKGYSGE GGVSTPLIVH
LPGQSTQKPT LRDFTHVTDN TATFLAVAQI SPPTQAAPPL INSLTGVDQN KGKVVYNNRY
VYPVTGQSLL PLLNDQATSA VHSASFGDEA YGRGYLRSAD GRWKALWTEP PLGPVDGHWQ
LFDMSADRGE TQDVSTQNPS VIDGLVQQWN NYMSSVGGVE PLRPRGYY