Gene BURPS668_A2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2620 
Symbol 
ID4887766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2518628 
End bp2519803 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID640132557 
Productcapsular polysaccharide biosynthesis/export protein 
Protein accessionYP_001063613 
Protein GI126445515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATC GTTCGCTTAG ACCCCTGGCG CTCGCCGTCG CCGCCGCCAC GCTGCTGCAG 
GCGTGCGCGA CGGCGCCCGG CAACTACCTC GACACGTCGC GTCTCGACGA CAAGGACAGC
CAGTCCGCCG AGCATTACAA CGTGCAGCTC ATTACCGCGC AGCTCGTCGT TTCGCAGGCC
GACGCGCAGC GCAAGGCTGG GCCGTTGCCG CCGGCGCGCT TCGTCGATCC GATGCAGTAC
GTCTACCGGA TCGCGCCGCA GGACATTCTC GGCGTGACCG TCTGGGATCA TCCGGAGCTC
ACGACGCCGC AAGGCCAATC GTTCTCGAGC GGCGGCAACA CGACGCAGAC GGTCGCGGGC
GCGCTGCAGC AGCCGTATGC GAATGCATTG CCCGGCCAGG CCGATCCGTA CGGCCAGACG
GTGATGTCCG ACGGCACGAT CTACTTTCCG TTCGTCGGCC GCCTGCACGC GGCGGGCAAG
ACGGTCGGCC AGGTGCGCGA CGAACTCGCC GCGCGGCTGG CGCGTTACGT GAAGAATCCG
CAGGTCGACG TGCGCGTGCT GTCGTATCGC AGCCAGAAGG TGCAGGTGAC TGGCGAAGTG
AAGACGCCCG GCCCGCTTGC GATCACCGAT GTGCCGCTCA CGCTCGTGGA CGCGATCACG
CGCTCGGGCG GCTCGACGAA CGAGGCCGAC CTGCAGCGCG TGCGCCTCAC GCGCGACGGC
AAGTTCTACC AACTCGACGC GAACGGCATG CTCGATCGCG GCGACGTCAC GCAGAACGTG
ATGCTGCAGC CGGGCGACAT CGTCAACGTG CCGGACCGCG GCGACAGCCG CGTGTTCGTG
ATGGGCGAGG TGAAGACGCC CGCGACGGTG CCGATGCTCA AGGGGCGCTT GACGATCGCG
GACGCGCTCA CGGCGGGAGG CGGCATTCTC GATACCGATG CGAATCCGCG TCAGGTGTAC
GTGTTGCGCG ATCTGCAGGA CAAGCCGAAC ACACCGGACA TCTTCCGCCT CGACATGACG
CAGCCCGACG CGCTGATGCT GTCGAGCCGC TTCCAGTTGA AGCCGCTCGA CGTCGTGTAC
GTCGGCACGG CGGGATCGGT GCGCTTCAAC CGCCTGCTGC AGCAGATTTT CCCGACGATC
CAGTCGATTT ACTACATGAA GCAGATCACG CGCTGA
 
Protein sequence
MLNRSLRPLA LAVAAATLLQ ACATAPGNYL DTSRLDDKDS QSAEHYNVQL ITAQLVVSQA 
DAQRKAGPLP PARFVDPMQY VYRIAPQDIL GVTVWDHPEL TTPQGQSFSS GGNTTQTVAG
ALQQPYANAL PGQADPYGQT VMSDGTIYFP FVGRLHAAGK TVGQVRDELA ARLARYVKNP
QVDVRVLSYR SQKVQVTGEV KTPGPLAITD VPLTLVDAIT RSGGSTNEAD LQRVRLTRDG
KFYQLDANGM LDRGDVTQNV MLQPGDIVNV PDRGDSRVFV MGEVKTPATV PMLKGRLTIA
DALTAGGGIL DTDANPRQVY VLRDLQDKPN TPDIFRLDMT QPDALMLSSR FQLKPLDVVY
VGTAGSVRFN RLLQQIFPTI QSIYYMKQIT R