Gene BURPS1106A_3258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3258 
Symbol 
ID4901274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3165752 
End bp3167992 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content67% 
IMG OID640136484 
Productexopolysaccharide transport family protein 
Protein accessionYP_001067495 
Protein GI126453607 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCA ATCCGACCGG CACATCGACG CCCGCCGACA GCGAAGGCGA TACCGACTTC 
ATCGCCGTTC TCGACATCCT GATCGAAGGC CGCTGGCTGA TCGCCGCGAT CGCGCTCGGT
GTTTTCATCG TCGGCGTCGC GTATGCGGTG TTCAGCAAGC CCGTCTATCA GGCCGACATC
CTGATCCAGG TGGAGGACAG CCCCGATACG TCCGCCGCGA AGAGCCTGCT CGGCGACGTG
TCCTCGCTGT TCGACGTGAA GTCGTCGGCG GCCGCCGAAA CGCAGATCCT CGCGTCGCGG
CTCGTCGTGT CGCGCGCCGT CGACAATCTG AAACTCTTCA TCGACGCGAA GCCGAAGCGT
TTCCCGGTGA TCGGCCACTG GCTCGCGCGC CGCAGCGAAG GGCTGTCCGA TCCGGGGCTC
GCGGGGTTCG GCGGCTACGC GTGGGGCCAG GAGCGCATCG ATGTCGCGAC GTTCGACGTG
CCGCGCGCGA TGGAGGGCGA CACGTTCGAG CTGACGATGC TCGATGCGCG CCGCTACCGC
CTCGACGGCG GCGATCTCGA GCGCGGCGCC GTGGGCGTCG TCGGCCGGCT CGAACGCATC
GCGGCGAAGG GCGGGCCGAT CGCGCTGCGC GTCGACGCGT TCGCCGCGAA GCCGGGCGCG
ACGTTCGTGC TCGTGCGCCA TTCCCGCCTG CGTACGATCG AGGCGTTGCA GGACAACCTC
GACGTGCAGG AGCGCGTCAA GCAGTCCGAC GTCGTCGTCG CGAGCCTGCG CGACACCGAT
CCCGATCTCG TGAGCCGCGC GCTCAATGAA ATCGGGCGGC AATACATCGC GCAGAACATT
CAGCGAAAGT CGGCGGAAGC CGCGCAATCG CTCGAGTTCC TGAATGGGCA ACTGCCCACG
CTCAAGCAGC AGTTGACCGA TTCGGAAGCG CGACTGACGA AACTGCGCGA CGAGCACGGC
AGCGTCGATC TGACCGAGGA GGCGAAGCTC GTGCTCGCGC AATCGGCCGA CGCGAAGACG
CGTCTGCTCG AATTGCAGCA AAAGCGGCAG GAGCTGCTGT CGCGCTTCAT GCCGAAGCAC
CCGAGCGTCG TCGCGATCGA TCAGCAGATC GCCGCGCTCG ACGGCTATCG CGGCACGGCC
GAGCAGCAGA TCAAGCGGCT GCCGGAGCTG CAGCAACAGT TCGTGCGGCT GATGCTCGAC
GTGAAGGTCA ATACCGATCT CTATACGGCG TTGCTCAACA ACATGCAGCA GTTGCAGCTC
GTGCGCGCGG GCAAGGTCGG CAACGTGCGG CTCGTCGATA CGGCGGCGGT GCCGGAAGTC
CCCGTCAAGC CGAAGAAAGC GCTCGTCGCG CTCGCGTCGC TGCTGCTCGG CGTGCTCGCC
GGCTGCGGCA CGGCGGTTGG CCGCTCGATG CTGTTCCACG GCATCTCCGA TCCGAACGAG
ATCGAGCGGC GCCTCGGCTT GAGCGTCTAT GCGACCGTGC CCCGCAGCGA TCGGCAACGG
GCGCTGACCG AGCGCGCGAA GCACAAGGCG CGCGCGCTGT CGCTGCTGTG CGTCGCGCAT
CCGGACGAGC CGGCCGTCGA GAGCCTGCGC AGCCTGCGCA CCGCGCTGCA ATTCGCGATG
CTCGATGCAA AAAACAACGT CGTCGTGATC GCGGGTCCCG CGCCGGGCGT GGGCAAGTCG
TTCGTGTCGG CCAATCTCGC CGCGGTGCTG ACGATGGCGG GCAAGCGCGT GCTGCTGATC
GACGGCGACA TTCGCAAAGG ACATCTGAAC GATTATCTCG GCCTCGCGCG CGGCAAGGGC
TTTTCCGAAC TGATCGCGGG CTCGGCGCAG CCGGACGACG TGCTGCATCG CGACGTGATC
GCCGGGCTCG ATTTCATCTC GACGGGCGCG ATGCCGAAGA ATCCGGCCGA GTTGCTGCTC
AATGCGCGCG TGTCGACGTT GATCGACACT TTCTCGCAGC GTTACGACGC CGTCGTGATC
GATTCGCCGC CGGTGCTGGC GGTGGCCGAC ACGGGCATTC TCGCCGCGAC GGCGGGTACG
GCGTTCCTCG TCACGCTCGC CGGCTCGACG AAGCTCGGCG AGATCGCCGA ATCGGCGAAG
CGGCTCGCGC AGAACGGCGT GCGCCTGAGC GGCGTGGTGT TCAACGGCAT CAATCCGCGC
CTCGGGCAGT ACGGCTACGG CTCGAAGTAC GGCGGCTATC GCTACGTCGC ATACGAATAC
GGAGCGAAGC GCGATGCCTG A
 
Protein sequence
MNPNPTGTST PADSEGDTDF IAVLDILIEG RWLIAAIALG VFIVGVAYAV FSKPVYQADI 
LIQVEDSPDT SAAKSLLGDV SSLFDVKSSA AAETQILASR LVVSRAVDNL KLFIDAKPKR
FPVIGHWLAR RSEGLSDPGL AGFGGYAWGQ ERIDVATFDV PRAMEGDTFE LTMLDARRYR
LDGGDLERGA VGVVGRLERI AAKGGPIALR VDAFAAKPGA TFVLVRHSRL RTIEALQDNL
DVQERVKQSD VVVASLRDTD PDLVSRALNE IGRQYIAQNI QRKSAEAAQS LEFLNGQLPT
LKQQLTDSEA RLTKLRDEHG SVDLTEEAKL VLAQSADAKT RLLELQQKRQ ELLSRFMPKH
PSVVAIDQQI AALDGYRGTA EQQIKRLPEL QQQFVRLMLD VKVNTDLYTA LLNNMQQLQL
VRAGKVGNVR LVDTAAVPEV PVKPKKALVA LASLLLGVLA GCGTAVGRSM LFHGISDPNE
IERRLGLSVY ATVPRSDRQR ALTERAKHKA RALSLLCVAH PDEPAVESLR SLRTALQFAM
LDAKNNVVVI AGPAPGVGKS FVSANLAAVL TMAGKRVLLI DGDIRKGHLN DYLGLARGKG
FSELIAGSAQ PDDVLHRDVI AGLDFISTGA MPKNPAELLL NARVSTLIDT FSQRYDAVVI
DSPPVLAVAD TGILAATAGT AFLVTLAGST KLGEIAESAK RLAQNGVRLS GVVFNGINPR
LGQYGYGSKY GGYRYVAYEY GAKRDA