Gene Bphyt_5075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_5075 
Symbol 
ID6279038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp1236569 
End bp1238107 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content60% 
IMG OID642616166 
Productcholine-sulfatase 
Protein accessionYP_001888809 
Protein GI187919778 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.287023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0038506 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGACA GCAAAAAGAA TATTCTCATC CTGATGGCCG ACCAGATGAC GCCGTTCGCG 
CTGGCCGCAT ACGGGCACCC TCTGACGAAG ACGCCCAATC TCGATCGCCT CGCGAAACAG
GGCGTGGTGT TCGAATCGGC CTATTGCGCG AGTCCGCTGT GTGCGCCGTC GCGATTCTCT
TTTTTATCGG GCAAGCTGCC GTCCGCAATC GGTGCCTATG ACAATGCAGC GGAATTTCCG
TCGCAAACGC TGACCTTCGC GCATTATCTC CGCGCTGAAG GTTATCGAAC CATTCTGTCC
GGCAAGATGC ATTTCTGCGG CGCCGATCAG TTGCATGGCT TCGAAGAGCG GCTCACCACC
GACATCTATC CCGCCGACTT CGGCTGGACA CCCGACTGGG ACAATTTCGA AGCACGTCCC
ACGTGGTATC ACAACATGAG TTCGGTAATC GACGCGGGTC CGTGCGTGCG CACCAATCAA
CTCGACTTCG ACGACGAGGT CACGTTCACC GCTCGCCAGA AACTCTTCGA CATTGCGCGC
GAACGTCATG CGGGTAAAGA TGCGCGGCCG TTTTGCATGG TCGCCTCGCT GACGCACCCA
CACGACCCGT ACGCGATTCC GCAAAAGTAC TGGGACATGT ATCGCGACGA AGACATCGAC
ATGCCCGCGT TTCGCGATTC ATTCGAAGAC GCTGACCCGC ACTCAAAGCG CCTGCGCCAT
GTCTGCGAAA CCGATCGCAC GCCGCCTACC GATCAGCAGA TCCGCAACGC GCGGCGCGCT
TATTACGGCG CGATCTCCTA CGTCGACGAC CAGTTCGGCG CGATCCTCGA AGCGCTCGAT
CAGGCCGGCC TCGCGCAAGA CACGGTGATC GTGGTGACGT CCGATCACGG CGAGATGCTC
GGCGAACGTG GACTCTGGTA CAAGATGACT TTCTTCGAAG GCGGTTGCCG CGTGCCCTTG
ATCGTGCACG CGCCGCAGCA ATTCGACGCG CATCGCGTGA AGGACTCGGT CTCGCATCTC
GATCTCGTGC CGACGCTGGT CGAACTGGCG CGCGGCGAAC AGCCGGCCGT GTGGCCCGAT
TCGCTGGATG GACAAAGCCT CGTGCCGCAT CTGTTCGGTA AGCAAGGTGG TCATGACGAA
GCGATCGGCG AATATCTGGC CGAGGGTGCG ATTGCGCCGA TTGTGATGCT GCGGCGTGGG
CGCTTCAAGT TTATTCACAC ACCTGCCGAT CCCGATCAAC TCTACGACGT CGCAGCCGAT
CCGCTTGAAC GAGAAAACCT TGCCGCGCGC AGCGAATATG CATCACAGGT CGCGGCATTT
CGGGAGGAAG TCGCGCAACG CTGGAATCTC GCCGCGCTGC ACAATGAAGT GCTGCAAAGC
CAGCGGCGCC GCCATTTTCA CTTCGCGTCA ACGACACAAG GCACGGTCGC ATCGTGGGAT
TGGCAACCGC TCGTCGATGC GAGCCAGCGT TATATGCGCA ACCACATCGA TCTCGATACG
CTCGAAGCGA TGGCGCGCTT TCCCGCCGTT GCCCGTTAA
 
Protein sequence
MLDSKKNILI LMADQMTPFA LAAYGHPLTK TPNLDRLAKQ GVVFESAYCA SPLCAPSRFS 
FLSGKLPSAI GAYDNAAEFP SQTLTFAHYL RAEGYRTILS GKMHFCGADQ LHGFEERLTT
DIYPADFGWT PDWDNFEARP TWYHNMSSVI DAGPCVRTNQ LDFDDEVTFT ARQKLFDIAR
ERHAGKDARP FCMVASLTHP HDPYAIPQKY WDMYRDEDID MPAFRDSFED ADPHSKRLRH
VCETDRTPPT DQQIRNARRA YYGAISYVDD QFGAILEALD QAGLAQDTVI VVTSDHGEML
GERGLWYKMT FFEGGCRVPL IVHAPQQFDA HRVKDSVSHL DLVPTLVELA RGEQPAVWPD
SLDGQSLVPH LFGKQGGHDE AIGEYLAEGA IAPIVMLRRG RFKFIHTPAD PDQLYDVAAD
PLERENLAAR SEYASQVAAF REEVAQRWNL AALHNEVLQS QRRRHFHFAS TTQGTVASWD
WQPLVDASQR YMRNHIDLDT LEAMARFPAV AR