Gene BURPS1106A_A0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0753 
Symbol 
ID4905957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp745448 
End bp747001 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID640143859 
Productsulfatase family protein 
Protein accessionYP_001074789 
Protein GI126457339 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGATA CCGCCGAACC CACCGATATC CAGCCGAACA TTCTCGTCCT GATGGCCGAC 
CAGCTCACGC CCTTCGCGTT GCGCGCGTAC GGCCATCGCG CGACGCGTAC GCCGACGATC
GACCGGCTCG CCGCCGAGGG CGTCGTCTTC GACGCCGCTT ATTGCGCGAG CCCGCTCTGC
GCGCCGTCGC GCTTCGCGCT GATGGCGGGC AAGCTGCCGT CGGCGCTCGG CGCTTACGAT
AACGCCGCCG AATTGCCGGC GCAAACGCTG ACGTTCGCGC ACTACCTGCG CGCGGCCGGT
TACCGGACGA TGCTGTCGGG CAAGATGCAC TTCTGCGGGC CCGATCAGTT GCACGGCTTC
GAGGAACGGC TCACGACCGA CATCTATCCG GCCGATTTCG GCTGGGTGCC GGACTGGACG
CGTCCCGCCG AGCGGCCGAG CTGGTATCAC AACATGAGCT CGGTGCTCGA CGCCGGGCCT
TGCGTGCGGA CCAACCAGCT CGATTTCGAC GACGATGCGA CGTTCGCCGC GCGCCAGAAG
ATCTTCGACG TCGCGCGCGA GCGCGCGGCC GGCCGGGACA CGCGGCCGTT CTGCATGGTC
GTGTCGCTCA CGCATCCGCA TGATCCGTAT GCGATCACGC GCGAATACTG GGATCTGTAC
CGGGACGAGG ACATCGATCT GCCCGCCGTG CAGATGGATT TCGACGCGAG CGACCCGCAT
TCGCGGCGGC TGCGCGCCGT ATGCGAGGTC GATCGCACGC CGCCGGAGGA CCTGCAGATC
CGGCGCGCGC GGCGCGCGTA CTACGGCGCG ACGTCCTATG TCGACGCGCA GTTCGGCGCG
CTGCTCGCGA CGCTCGAGCA ATGCGGGCTC GCCGACGACA CGATCGTGAT CGTCACCGCC
GATCACGGCG ACATGCTCGG CGAGCGCGGC CTCTGGTACA AGATGACGTT CTTCGAAGGC
GCATGCCGCG TGCCGCTCAT CGTGCACGCG CCGCGCCGGT TTCCGGCCGC GCGCGTGCCG
GCGGCCGTGT CGCACGTCGA TCTGCTGCCG ACGCTCGTCG AGCTCGCGAC GGGCGAGCGC
CGCGCCGACT GGCCCGACGC CGTCGACGGC CGCAGCCTCG TTCCCCATCT GCGCGGCGAA
GGCGGCCATG ACGAGGCGTT CGGCGAATAT CTGGCCGAAG GCGCGATCGC GCCGATCGTG
ATGATGCGCC GCGGCAGCCA CAAGTACATC CATTCGCCCG CGGATCCGGA TCAGCTCTTC
GATCTGAGGA ATGATCCGCG CGAGCTCGAC AATCTCGCGA ACACGCCCGC CGCGGCAAAG
CACGTCGCCG CGTTTCGCAT GGAGCGCGTC GCGCGCTGGG ATCTCGATGC GCTGCATCAG
CAGGTGCTCG CGAGCCAGCG CAGGCGGCGC TTCCATTTCG AGGCGACGAC CCAGGGGCGA
ATCCGGTCGT GGGACTGGCA GCCGTTCCAG GATGCGAGCC AGCGTTACAT GCGCAATCAC
CTCGAACTCG ACGCGCTCGA GGCAGCCGCG CGTTTTCCTC GTCCGCACGC ATGA
 
Protein sequence
MPDTAEPTDI QPNILVLMAD QLTPFALRAY GHRATRTPTI DRLAAEGVVF DAAYCASPLC 
APSRFALMAG KLPSALGAYD NAAELPAQTL TFAHYLRAAG YRTMLSGKMH FCGPDQLHGF
EERLTTDIYP ADFGWVPDWT RPAERPSWYH NMSSVLDAGP CVRTNQLDFD DDATFAARQK
IFDVARERAA GRDTRPFCMV VSLTHPHDPY AITREYWDLY RDEDIDLPAV QMDFDASDPH
SRRLRAVCEV DRTPPEDLQI RRARRAYYGA TSYVDAQFGA LLATLEQCGL ADDTIVIVTA
DHGDMLGERG LWYKMTFFEG ACRVPLIVHA PRRFPAARVP AAVSHVDLLP TLVELATGER
RADWPDAVDG RSLVPHLRGE GGHDEAFGEY LAEGAIAPIV MMRRGSHKYI HSPADPDQLF
DLRNDPRELD NLANTPAAAK HVAAFRMERV ARWDLDALHQ QVLASQRRRR FHFEATTQGR
IRSWDWQPFQ DASQRYMRNH LELDALEAAA RFPRPHA