Gene BURPS1106A_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1810 
Symbol 
ID4901077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1769785 
End bp1771497 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content71% 
IMG OID640135040 
Productsulfate permease family inorganic anion transporter 
Protein accessionYP_001066079 
Protein GI126451659 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.251362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTCC GCTCGAAGCA CATCGACGCG CCGCCGCAGC ATGCCGCGCA TTTCGCGCCG 
CTCGACGAGC CGCCCGCGCC GCGCGCCCGC CGCACCGCGC TCGACGCGCT CGCCGGGCTG
TCGATCGCGG GCCTTCTGAT ACCCGAGGCG GTCGCCTACG CGGGGCTCGC GAACCTGCCG
CCGCAGGCGG GGCTCATCGC GCTCGTCGTG GGGCTCGTCG TCTACGCGAT CGCGGGCAGC
AGCCGCTTCG CGATCGTGTC GGCGACGTCG TCGTCGGCCG TCGTGCTCGC GGCGACCGTG
ATGTCCGAGG CCGGACCGGG CGCCGCCGCG CAGTTGATGC TCGCCGCGGC GCTCGTCGCG
ACGACGGGCA TCCTGTTCAT CCTCGCGGGC GCCGCGCGTC TCGGCGGCAT GTCGGATTTC
ATCGCGCGGC CGGTGCTGCG CGGCTTCACG TTCGGCCTCG CACTCACGAT CGTGATCAAG
CAGTTGCCGA AGATGCTCGA CGTGCCGATC CATCACGGCG ATACGCTGCG CGTCGCGCTC
GACCTGCTGC TCGGCATTGC GGGCTGCAAT GTCCGCAGCG CGGCGCTCGG CGCGACCGCG
CTGGCGATCC TGTTCGCGCT CGGCAGGCGC ACGCGCGTGC CGGCGACGCT CGTCGTGATC
GTGCTCGGCA TCGCGGCCGG CTACTGGATC GACTGGCATC GCTACGGCAT CGCCGTCGTC
GGTACGATCG ATCTGCAGAA TCTCGCGTTC GGCATGCCGG TGCTCGGCCG CTCCGGCTGG
ATGCAGACGG CCGAGTTCGG CTTCGCGCTG ATGCTGATCC TGTACGCGGA ATCGTACGGG
TCGATTCGCA ACTTCGCGCT CAAGCACGGC GACACGGTCT CGCCGAACCG CGATCTCGTC
GCGCTCGGCT GCGCGAACCT CGTATCGGGG CTGCTGCATG GGATGCCCGT CGGCGCGGGC
TATTCGGCGA CCTCGGCGAA CGAGGCGGCG GGCGCGCAAA CGCGTATGGC GGGCCTGTTC
GCGGCCGCCG TGATCGCGCT GATCGCCTGG CTGCTGCTGC CGCAGCTCGC GCGCATTCCC
GAGCCCGTGC TCGCGGCGAT CGTGATCTTC GCGGTCAGCC ATTCGCTGCA TCCGGAGGTG
TTCCGGCCGT ACTGGACCTG GCATCGGGAC CGGATCGTCG TGATCGCCGC GCTCGCGGCG
GTGATCGTGC TCGGCGTGCT GCACGGCCTG CTCGCCGCGA TCGGCGTGAG CCTGCTGCTC
ACGCTGCGGC AATTGTCCGA GCCGAACGTG AGCGTGCTGG GCCGGCTGCG CGGGAGCCAC
GATTTCGTCG ACGTGTCGAT GCACGAGGAT GCGAAGCCGA TCCCCGGCGT GCTGATCGTG
CGGCCGGAAG CCCAGCTCTT CTTCGCGAAC GCGGAGCGCG TGCTGACCAT GGCGAGGCGC
CTCGCGCGCG ACGCGCAGCC GCCCGTGCAC ACGGTGATGC TGAGTCTCGA GGAATCGCCC
GACGTCGACG GCACGACGAT CGAGGCGCTG AAGACGTTCG GCGCCGAATG CGATGCGCGC
GGCTGGCGCC TCGCGCTCGT GCGCCTGAAG CCGAACGTGC TGCGCGTGCT GCAACGCGCG
GCGGACGGCG GGCTGCGCGC GGATGCGCTG TCGGAGCTGA GCGTCGACGA GAGCCTGCAA
TCGCTGACGG CGGGCGAGTT GCCGCGCGCG TGA
 
Protein sequence
MDLRSKHIDA PPQHAAHFAP LDEPPAPRAR RTALDALAGL SIAGLLIPEA VAYAGLANLP 
PQAGLIALVV GLVVYAIAGS SRFAIVSATS SSAVVLAATV MSEAGPGAAA QLMLAAALVA
TTGILFILAG AARLGGMSDF IARPVLRGFT FGLALTIVIK QLPKMLDVPI HHGDTLRVAL
DLLLGIAGCN VRSAALGATA LAILFALGRR TRVPATLVVI VLGIAAGYWI DWHRYGIAVV
GTIDLQNLAF GMPVLGRSGW MQTAEFGFAL MLILYAESYG SIRNFALKHG DTVSPNRDLV
ALGCANLVSG LLHGMPVGAG YSATSANEAA GAQTRMAGLF AAAVIALIAW LLLPQLARIP
EPVLAAIVIF AVSHSLHPEV FRPYWTWHRD RIVVIAALAA VIVLGVLHGL LAAIGVSLLL
TLRQLSEPNV SVLGRLRGSH DFVDVSMHED AKPIPGVLIV RPEAQLFFAN AERVLTMARR
LARDAQPPVH TVMLSLEESP DVDGTTIEAL KTFGAECDAR GWRLALVRLK PNVLRVLQRA
ADGGLRADAL SELSVDESLQ SLTAGELPRA