Gene Bphyt_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_2053 
Symbol 
ID6282260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp2319705 
End bp2321276 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content64% 
IMG OID642621618 
Productsulfatase 
Protein accessionYP_001895684 
Protein GI187924042 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000240603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGCCA TCCGAAACGT CCTTTTCATC ATGTGTGACC AGTTGCGGCG GGATCACCTC 
GCCTGCTACG GCCACCCATA CATGCGCACA CGCAACATCG ACACGCTCGC CGCGCGCGGC
GTGCGCTTCG ACAACGCCTA TGTCAGCTCC GGCGTCTGCG GGCCGTCGCG CATGTCGTAT
TACACCGGCC GCACGATGAC GAGCCACGGC GCGAACTGGA ATCGCGTGCC GATGTCCATC
GGCGAGATCA CGCTTGGGGA GTATCTGCTC CGCCATGGAC GCTCGCTCGC CCTCGCCGGA
AAGACCCATG TTCAGGCGGA CGCGTCGGGC ATGGAGCGGC TGGAAATCGG TAGGGACAGC
CCGCTTGGGC TGCGCCTTTC TGCGGGCTCG TTCGTCGAAC TGGACCGTTA CGACGGGCAT
CACGAACCGG GCCGCGAAAG CGGTTATCCC GCGTATCTGC GCGCACACGG GTACGCGAGC
GACCGGCCGT GGAGCGACTA TGTGATCAGC GTCGAAGACG ACCGCGGCGA GGTCCGCTCG
GGCTGGCAGA TGCGCAATGT GCGCTGGCCG TCGCGCGTCG CGGAGCCGCA CTCCGAAACA
GCGTATATGA CCGATCAGGC GATCCGCCAT ATCGAACAAC AAGGCGACGA GCCCTGGGCG
TTACACCTGT CGTACGTGAA GCCGCATTGG CCGTATGTCG CGCCGCATCC CTATCACGAC
ATGTACTCGC TCGATCAGTG CCTGCCGCTC GTGCGCCACA CCGCGGAGCT GGACAATGCC
CACCCGGTGA CGGCCGCTTA TCGCCAGCAT GAAGAGAGCG TCAATTTTTC GCGCGACGAA
GTGTCCAATA CGGTGCGGCC CATTTACCAG GGATTGATCC AGCAGATCGA CGATCATCTC
GGGCGTCTCT GGGACGTGCT CGACCGGCTG AATCGCTGGG ACGACACGCT GATCGTCCTG
ACCGCCGATC ACGGCGATTT CCTCGGCGAT CACTGGCTCG GCGAGAAAGA GCAGTTCTAC
GATACCGTGC AGCGCGTGCC GATGGTCGTG TACGATCCAT CGCCCGAAGC GAACGCCACG
CGCGGTACGG CCGAGGCGCG CTTCACGTCC TGCATCGACG TGGTGCCGAC CGTGCTCGAA
GCGCTCGGCC TGCCTTCGTG TGAGGAGCGG ATCGAAGGAA AATCGTTGCT GCCGTTGCTG
CGCGACACAC TCACGCAGGA AAACGGATGG CGTGACTATG TGGTCTCCGA ACTCGACTAC
AGTTTTCGCG GAGCGCGCCT GACGCTCGGC CGCGCGCCGC ACGAATGCCG CGGCTGGATG
GTGCGCGACG CGCGCTGGAA ATACGTGCAC TGGCTCGGCT ACCGGCCGCA ACTGTTCGAC
CTCGAAGCGG ACCCGAACGA GTTTATCGAT CTCGGCGGCG AGCGCACGCA TGAAGCGACG
CGTACGCAAA TGCATGCCAA GCTGGTCGAC TGGCACGCGA CGCTCAAGCA GCGCGTGACG
ATCGACGATG CGGGCGTCGC GAGCCGCACA AATACGCACA GGGACTGGGG CGTGTTCTTC
GGCGAGTGGT AA
 
Protein sequence
MTAIRNVLFI MCDQLRRDHL ACYGHPYMRT RNIDTLAARG VRFDNAYVSS GVCGPSRMSY 
YTGRTMTSHG ANWNRVPMSI GEITLGEYLL RHGRSLALAG KTHVQADASG MERLEIGRDS
PLGLRLSAGS FVELDRYDGH HEPGRESGYP AYLRAHGYAS DRPWSDYVIS VEDDRGEVRS
GWQMRNVRWP SRVAEPHSET AYMTDQAIRH IEQQGDEPWA LHLSYVKPHW PYVAPHPYHD
MYSLDQCLPL VRHTAELDNA HPVTAAYRQH EESVNFSRDE VSNTVRPIYQ GLIQQIDDHL
GRLWDVLDRL NRWDDTLIVL TADHGDFLGD HWLGEKEQFY DTVQRVPMVV YDPSPEANAT
RGTAEARFTS CIDVVPTVLE ALGLPSCEER IEGKSLLPLL RDTLTQENGW RDYVVSELDY
SFRGARLTLG RAPHECRGWM VRDARWKYVH WLGYRPQLFD LEADPNEFID LGGERTHEAT
RTQMHAKLVD WHATLKQRVT IDDAGVASRT NTHRDWGVFF GEW