Gene BURPS1106A_3274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3274 
Symbol 
ID4901135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3189624 
End bp3190913 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content58% 
IMG OID640136500 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001067511 
Protein GI126452213 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGGCT GCAAGGCGGG TGTTGTCATT TTGTCGGCGA CTTGTTCGAG CGCGTCGGCC 
TTCATAACTC AATCAGATAA GAATCGTATG TCCCGCTTCT TCCTTGCCCT GCAAGGCACG
GCCTCTCCTT TTTTTGGTCG ACTCGCTGCC GGTCTCGGCC AGCGGGGCCA CCAGGTTCGG
CGTGTGAATT TTTGCGGCGG AGATCTCGCG TATCAAGGTT CGGAAAGCGC TTGGAACTAT
CGCGACGAAC CCGAAGGCCT GGTTGCGTGG TATCGCGATG CCATTGCGAC CAATGGAGTG
ACGGATGTGC TTCTGTTTGG CGACTGCCGT GCGATCCACC GGCCGATGCA TGAGATCGCT
CGCGCATCGG GGGTGCGTGT TCACGTATTC GAAGAGGGGT ATGTTCGACC GCACTGGATC
ACAATGGAAA GGCACGGCGT CAACGGCCGA TCGTTGCTGC CGCGCGACCC GGCTTACTAT
CTCGACGCAC GCCGGCATAT CCCGCCAGCG GTACCCGGGA AACCGACCGG CTACAACCTG
TACGAGCGCG CCTGCCACGA TATCAGGTAT CGCATGGCCA ACGCGTTGTA CGCGCATCGT
TTCCCGCATT ACAAGTCGCA CCGTCCGAGA AACGGCTTAC AGGAGTACGC GGGCCTCGCG
TATCGCGCCG TTCAGCAACA CGTGCGCGAT AGGGAGGCCG AGAACGTCAC CCGTGATCTG
CTGGAACGAA AACGCCGCTA CTATCTGTTT CCGCTGCAGC TCAATTCCGA CTCCCAGATC
GTCGATCATT CCCCTTTTGG CGGCATTTGC GACGCGATAG CGATTGTTTT ACACTCATTC
GCCGAAAATG CGCCCGACGA CAGTTGGCTT GTCATCAAGA ATCATCCGTT GGACACCGGT
CTGATCGGCT ACCGTCAATT TGCAACGGCA TTGGCCACTG AACTGGGTAT CGAGAAGAGA
ATGGCCTTCA TCGATGCGGG CCACTTGCCG ACGTTACTCG ATCAATGTCG TGGCGTGGTC
GTGATAAACA GCACGGTCGG TTTGTCCGCC GTCCACCATC GACGCCCGCT CGTTGCATTG
GGCACCGCGA TCTATTCGAT GCCGGGGCTG ACTTGGCAAG GCAGCCTGGC GGACTTTTGG
ACGGAGGCTG GTAGCCCGGA CATGAATCTC TATCAGGCTT TTCTCGACTA CGTGATGCAC
CATACGCAGA TCAACGGAGA TTTCTATACG CGCACCGGTA TAGAGATGAG CGTCGCCGGC
GCCGTGAGCC GGCTCGAGGC GGTGTCGTGA
 
Protein sequence
MTGCKAGVVI LSATCSSASA FITQSDKNRM SRFFLALQGT ASPFFGRLAA GLGQRGHQVR 
RVNFCGGDLA YQGSESAWNY RDEPEGLVAW YRDAIATNGV TDVLLFGDCR AIHRPMHEIA
RASGVRVHVF EEGYVRPHWI TMERHGVNGR SLLPRDPAYY LDARRHIPPA VPGKPTGYNL
YERACHDIRY RMANALYAHR FPHYKSHRPR NGLQEYAGLA YRAVQQHVRD REAENVTRDL
LERKRRYYLF PLQLNSDSQI VDHSPFGGIC DAIAIVLHSF AENAPDDSWL VIKNHPLDTG
LIGYRQFATA LATELGIEKR MAFIDAGHLP TLLDQCRGVV VINSTVGLSA VHHRRPLVAL
GTAIYSMPGL TWQGSLADFW TEAGSPDMNL YQAFLDYVMH HTQINGDFYT RTGIEMSVAG
AVSRLEAVS