Gene BURPS668_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3220 
Symbol 
ID4882571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3149445 
End bp3151685 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content67% 
IMG OID640129148 
Productexopolysaccharide transport family protein 
Protein accessionYP_001060231 
Protein GI126439124 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCA ATCCGACCGG CACATCGACG CCCGCCGACA GCGAAGGCGA TACCGACTTC 
ATCGCCGTTC TCGACATCCT GATCGAAGGC CGCTGGCTGA TCGCCGCGAT CGCGCTCGGT
GTTTTCATCG TCGGCGTCGC GTATGCGGTG TTCAGCAAGC CCGTCTATCA GGCCGACATC
CTGATCCAGG TGGAGGACAG CCCCGATACG TCCGCCGCGA AGAGCCTGCT CGGCGACGTG
TCCTCGCTGT TCGACGTGAA GTCGTCGGCG GCCGCCGAAA CGCAGATCCT CGCGTCGCGG
CTCGTCGTGT CGCGCGCCGT CGACAATCTG AAACTCTTCA TCGACGCGAA GCCGAAGCGT
TTCCCGGTGA TTGGCCACTG GCTCGCGCGC CGCAGCGAAG GGCTGTCCGA TCCGGGGCTC
GCGGGGTTCG GCGGCTACGC GTGGGGCCAG GAGCGCATCG ATGTCGCGAC GTTCGACGTG
CCGCGCGCGA TGGAGGGCGA CACGTTCGAG CTGACGATGC TCGATGCGCG CCGCTACCGC
CTCGACGGCG GCGATCTCGA GCGCGGCGCC GTGGGCGTCG TCGGCCGGCT CGAACGCATC
GCGGCGAAGG GCGGGCCGAT CGCGCTGCGC GTCGACGCGT TCGCCGCGAA GCCGGGCGCG
ACGTTCGTGC TCGTGCGCCA TTCCCGCCTG CGTACGATCG AGGCGTTGCA GGACAACCTC
GACGTGCAGG AGCGCGTCAA GCAGTCCGAC GTCGTCGTCG CGAGCCTGCG CGACACCGAT
CCCGATCTCG TGAGCCGCGC GCTCAATGAA ATCGGGCGGC AATACATCGC GCAGAACATT
CAGCGAAAGT CGGCGGAAGC CGCGCAATCG CTCGAGTTCC TGAATGGGCA ACTGCCCACG
CTCAAGCAGC AGTTGACCGA TTCGGAAGCG CGACTGACGA AACTGCGCGA CGAGCACGGC
AGCGTCGATC TGACCGAGGA GGCGAAGCTC GTGCTCGCGC AATCGGCCGA CGCGAAGACG
CGTCTGCTCG AATTGCAGCA AAAGCGGCAG GAGCTGCTGT CGCGCTTCAT GCCGAAGCAC
CCGAGCGTCG TCGCGATCGA TCAGCAGATC GCCGCGCTCG ACGGCTATCG CGGCACGGCC
GAGCAGCAGA TCAAGCGGCT GCCGGCGCTG CAGCAACAGT TCGTGCGGCT GATGCTCGAC
GTGAAGGTCA ATACCGATCT CTATACGGCG TTGCTCAACA ACATGCAGCA GTTGCAGCTC
GTGCGCGCGG GCAAGGTCGG CAACGTGCGG CTCGTCGATA CGGCGGCGGT GCCGGAAGTC
CCCGTCAAGC CGAAGAAAGC GCTCGTCGCG CTCGCGTCGC TGCTGCTCGG CGTGCTCGCC
GGCTGCGGCA CGGCGGTTGG CCGCTCGATG CTGTTCCACG GCATCTCCGA TCCGAACGAG
ATCGAGCGGC GCCTCGGCTT GAGCGTCTAT GCGACCGTGC CCCGCAGCGA TCGGCAACGG
GCGCTGACCG AGCGCGCGAA GCACAAGGCG CGCGCGCTGT CGCTGCTGTG CGTCGCGCAT
CCGGACGAGC CGGCCGTCGA GAGCCTGCGC AGCCTGCGCA CCGCGCTGCA ATTCGCGATG
CTCGATGCAA AAAACAACGT CGTCGTGATC GCGGGTCCCG CGCCGGGCGT GGGCAAGTCG
TTCGTGTCGG CCAATCTCGC CGCGGTGCTG ACGATGGCGG GCAAGCGCGT GCTGCTGATC
GACGGCGACA TTCGCAAAGG ACATCTGAAC GATTATCTCG GCCTCGCGCG CGGCAAGGGC
TTTTCCGAAC TGATCGCGGG CTCGGCGCAG CCGGACGACG TGCTGCATCG CGACGTGATC
GCCGGGCTCG ATTTCATCTC GACGGGCGCG ATGCCGAAGA ATCCGGCCGA GTTGCTGCTC
AATGCGCGCG TGTCGACGTT GATCGACACT TTCTCGCAGC GTTACGACGC CGTCGTGATC
GATTCGCCGC CGGTGCTGGC GGTGGCCGAC ACGGGCATTC TCGCCGCGAC GGCGGGCACG
GCGTTCCTCG TCACGCTCGC CGGCTCGACG AAGCTCGGCG AGATCGCCGA ATCGGCGAAG
CGGCTCGCGC AGAACGGCGT GCGCCTGAGC GGCGTGGTGT TCAACGGCAT CAATCCGCGC
CTCGGGCAGT ACGGCTACGG CTCGAAGTAC GGCGGCTATC GCTACGTCGC ATACGAATAC
GGAGCGAAGC GCGATGCCTG A
 
Protein sequence
MNPNPTGTST PADSEGDTDF IAVLDILIEG RWLIAAIALG VFIVGVAYAV FSKPVYQADI 
LIQVEDSPDT SAAKSLLGDV SSLFDVKSSA AAETQILASR LVVSRAVDNL KLFIDAKPKR
FPVIGHWLAR RSEGLSDPGL AGFGGYAWGQ ERIDVATFDV PRAMEGDTFE LTMLDARRYR
LDGGDLERGA VGVVGRLERI AAKGGPIALR VDAFAAKPGA TFVLVRHSRL RTIEALQDNL
DVQERVKQSD VVVASLRDTD PDLVSRALNE IGRQYIAQNI QRKSAEAAQS LEFLNGQLPT
LKQQLTDSEA RLTKLRDEHG SVDLTEEAKL VLAQSADAKT RLLELQQKRQ ELLSRFMPKH
PSVVAIDQQI AALDGYRGTA EQQIKRLPAL QQQFVRLMLD VKVNTDLYTA LLNNMQQLQL
VRAGKVGNVR LVDTAAVPEV PVKPKKALVA LASLLLGVLA GCGTAVGRSM LFHGISDPNE
IERRLGLSVY ATVPRSDRQR ALTERAKHKA RALSLLCVAH PDEPAVESLR SLRTALQFAM
LDAKNNVVVI AGPAPGVGKS FVSANLAAVL TMAGKRVLLI DGDIRKGHLN DYLGLARGKG
FSELIAGSAQ PDDVLHRDVI AGLDFISTGA MPKNPAELLL NARVSTLIDT FSQRYDAVVI
DSPPVLAVAD TGILAATAGT AFLVTLAGST KLGEIAESAK RLAQNGVRLS GVVFNGINPR
LGQYGYGSKY GGYRYVAYEY GAKRDA