Gene BURPS668_3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3222 
Symbol 
ID4883216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3152158 
End bp3153360 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content71% 
IMG OID640129150 
Productpolysaccharide biosynthesis/export protein 
Protein accessionYP_001060233 
Protein GI126441359 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGTC TGGGGGAAGC AGGCGGCATC GCGCCGCCGA AACGCACGTG CGGCGCGCGC 
GTGCGTCGGC TGATGAAGCG GGTGACGCTG TGCGCGGCGC TTGCGGCATT GAGCGCGTGC
GGCGTCGCGC CCGGCATGCG GATGAAACAG CCGGCGAACG TGCCGGTGTC GAGCGCGGCA
GCGGACGCGC CGGCCGAGGC CGGACGCAAG CCGCGCGGCG AGCAGCTGCC GGTGCCGATC
ACCGACATCG ATCTGAGCCT GATCCGGACG CTGCGCGATG CGCAACAGGC GCCGCGCCGC
GCGGCCGATC TCGTATCGCC GGCGTCGGGC TATACGATCG GGCGCGGCGA CGTGCTGCAG
ATCACGGTCT GGGACCACCC CGAGCTCGCG GCGGCGCTCG GCACGCAGCA GCAGACGGCG
GCGCGCGCGG CCGATGCGCC GGCGGGCTTC GTCGTCGATC AGGACGGCAC GCTCCAGTAC
CCGTACGTCG GGCGCATCGC GGTGGCGGGC CTGAAGCCGG AACAGGTTCA GGCGCGGCTC
GCGCGCCAGC TCGCGCAGAC GTTCCGCGAT CCGCAGGTGA CGGTGCGCAT CGCATCGTTT
CGCGCGAAGC AGGTCTACAT CGAAGGCGAG GTGCATACGC CCGGTTCGCA GGCGCTCAAC
GACATCCCGA TGACGCTGTA CGACGCGGTG AGCCGCGCGG GCGGCTTCTC GGCGAGCGCG
GACCAGCGGC GCGTGACGCT CGTGCGCGAC GGCGTCGAAC GAAGGATCGA TCTGTCGGGC
GCTGCACAGG GCGTCAATCC GTCACGGATC GTGCTGCGCG ACGGCGATTT GCTGCGCATT
CCGCCGCGCG ACGAAAGCGG CGTGTTCGTG ATGGGCGAGG TCAACCGGCC CGTCACCGCG
CTGCCGATGC GCAACGGCCG CCTGACGCTG AGCGAGGCGC TGTCGCAGGC CGGCAGCCTG
AACGCGACGA CGGCCGACGC CGCGCAACTG TATGTGATTC GCGGCTCGCT CGACGCGAAG
CCGCACGTGT ATCGGCTCGA CGCGAGCTCG CCCGTCGCGA TGGTGCTCGC GAACCAGTTC
GAGCTGGAGC CGAAGGACAT CGTCTATGTC GACGGCAACG GCCTCGTGCG CTTCAGCCGC
GTGCTCAGCC TGTTGCTGCC GGCCGTCAAC GCCGGCCTGA CCGCGGCGGT CGTGACCAAA
TGA
 
Protein sequence
MGSLGEAGGI APPKRTCGAR VRRLMKRVTL CAALAALSAC GVAPGMRMKQ PANVPVSSAA 
ADAPAEAGRK PRGEQLPVPI TDIDLSLIRT LRDAQQAPRR AADLVSPASG YTIGRGDVLQ
ITVWDHPELA AALGTQQQTA ARAADAPAGF VVDQDGTLQY PYVGRIAVAG LKPEQVQARL
ARQLAQTFRD PQVTVRIASF RAKQVYIEGE VHTPGSQALN DIPMTLYDAV SRAGGFSASA
DQRRVTLVRD GVERRIDLSG AAQGVNPSRI VLRDGDLLRI PPRDESGVFV MGEVNRPVTA
LPMRNGRLTL SEALSQAGSL NATTADAAQL YVIRGSLDAK PHVYRLDASS PVAMVLANQF
ELEPKDIVYV DGNGLVRFSR VLSLLLPAVN AGLTAAVVTK