Gene BURPS668_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3090 
Symbol 
ID4883592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3031195 
End bp3033108 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content65% 
IMG OID640129018 
Productcapsular polysaccharide biosynthesis protein 
Protein accessionYP_001060102 
Protein GI126440288 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGAG CATTTTGCCC GAAGGCTTCG TGGCTTTCCC TGGGCGCCTT TCTGTTCGAT 
CTGATGGCGG TCGTATGCGC GTGGCTGATC GCGTATGTGG TCCGGTTCAA TGGTGCGATG
CCGCCTGAAT TCCTGAAAGG CGCCTTCGTC GCGCTCGCAT GGGTGGTGCC GTTGTATGGC
CTGCTGTTTC ATGTTTTCGG GTTGTATCGC GGTCTCTGGG TATTTGCGAG CCTGCCGGAT
CTGCTGCGTA TCTCGAAGGC CGTGGTGGGC GGCGGCTTGA TCGTGATGAT CGGCGCGGTG
ATGCTGCAGC CGGTTCCGAT CATTCCGCGT TCGGTTCTCG TGCTGTCGCC GATGCTGCTG
TTTCTTGCGA TGGGCGGTGC ACGTGCGCTC TATCGGGCGA CGAAGGAGTT CTATCTGTAC
GGCGGACTGA TCGGGCAGGG CAAGCCCGTG CTGATTCTCG GCGCCGGCAG TGCCGGCGCG
AGCCTCGCCC GTGAACTGTC GCGCTCCGGT GAGTGGCGTC TTGCCGGTCT GCTCGATGAC
GATCCCGCGA AACACGGGCG CGAAGTCTAC GGCTATAAGG TACTGGGTTC GATCGGCGAA
GTCGCAGACT GGGCCGAGGC CGCGAAAGCC GAATACGCGA TCATCGCGAT TCCATCCGCA
TCGGTCGAAG CGCAACGTCG CTTCGCCACC CTCTGCGTGC GCGCCGGCGT CAGGGCCATG
GTGCTGCCGT CGCTGACCGC ATTGATGCCG GGCCAGGGCA TTCTGTCGCA GATCCGCCAG
ATCGATCTCG AAGACCTGCT CGGCCGCGAG GCCGTGACGA TCGACACGCC GCACGTCGAG
GCGCTGCTGC GCGGCCGCGT CGTGATGGTG ACGGGCGCGG GCGGCTCGAT CGGCTCGGAG
CTGTGCCGCC AGATCCTGAA GTTCCAGCCC GCGCAACTGA TCGCGTTCGA TCTGTCCGAA
TATGCGATGT ACCGGCTCAC GGAAGAGTTG CGCGAGCGCT TCCCCGATCT GCCCGTCGTG
CCGATCATCG GCGATGCGAA GGATTCGCTG CTGCTCGATC AGGTGATGTC GCGCTATGCG
CCGCACATCG TGTTCCATGC GGCCGCGTAC AAGCACGTGC CGTTGATGGA GGAGCTCAAC
GCATGGCAGG CGCTGCGCAA CAACGTGCTC GGCACGTATC GCGTCGCGCG CGCGGCGATT
CGCCACGACG TCAGGCACTT CGTGCTGATC TCGACCGACA AGGCCGTGAA CCCGACGAAC
GTGATGGGCG CGAGCAAGCG CCTGGCCGAG ATGGCTTGCC AGGCGCTGCA GCAGACGAGC
GCGCGCACGC AGTTCGAGAC GGTGCGCTTT GGCAACGTGC TCGGCAGCGC CGGCAGCGTG
ATTCCGAAAT TTCAGCAGCA GATCGCCAAG GGCGGCCCGG TGACCGTCAC GCATCCGGAA
ATCACGCGCT TCTTCATGAC GATTCCCGAG GCGTCGCAAC TCGTGCTGCA GGCGTCGAGC
ATGGGGCAGG GCGGCGAGAT CTTCATTCTC GACATGGGCG AGCCGGTGAA GATCGTCGAT
CTCGCGCGCG ATCTGATTCG CCTGTACGGC TTCACCGAGG AGCAGATCCG CATCGAGTTC
AGCGGGCTGC GCCCGGGCGA GAAGCTCTAT GAGGAACTGC TCGCCGACGA CGAGACCACC
ACCCGCACGC CACACCCGAA GCTGCGCACC GCGAAGGCGC GCGAGGTGCC CGATCATCTG
CTCGACGAAC TGCTGCCGTG GCTGATGCAG CACCGCGTGC TGAGCGACGA TGAAGTGCGG
CGCGACTTGC GGCGCTGGGT GCCCGAGTAT CAGCCGGCCG TCGGCCCGAC GCTGCAGAGC
GTGCCGACCG GCAACGGGCT CGTCGCCGGC ATCGGGCGCG AGGCGGGGCG CTGA
 
Protein sequence
MTRAFCPKAS WLSLGAFLFD LMAVVCAWLI AYVVRFNGAM PPEFLKGAFV ALAWVVPLYG 
LLFHVFGLYR GLWVFASLPD LLRISKAVVG GGLIVMIGAV MLQPVPIIPR SVLVLSPMLL
FLAMGGARAL YRATKEFYLY GGLIGQGKPV LILGAGSAGA SLARELSRSG EWRLAGLLDD
DPAKHGREVY GYKVLGSIGE VADWAEAAKA EYAIIAIPSA SVEAQRRFAT LCVRAGVRAM
VLPSLTALMP GQGILSQIRQ IDLEDLLGRE AVTIDTPHVE ALLRGRVVMV TGAGGSIGSE
LCRQILKFQP AQLIAFDLSE YAMYRLTEEL RERFPDLPVV PIIGDAKDSL LLDQVMSRYA
PHIVFHAAAY KHVPLMEELN AWQALRNNVL GTYRVARAAI RHDVRHFVLI STDKAVNPTN
VMGASKRLAE MACQALQQTS ARTQFETVRF GNVLGSAGSV IPKFQQQIAK GGPVTVTHPE
ITRFFMTIPE ASQLVLQASS MGQGGEIFIL DMGEPVKIVD LARDLIRLYG FTEEQIRIEF
SGLRPGEKLY EELLADDETT TRTPHPKLRT AKAREVPDHL LDELLPWLMQ HRVLSDDEVR
RDLRRWVPEY QPAVGPTLQS VPTGNGLVAG IGREAGR