Gene BURPS668_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3238 
Symbol 
ID4883772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3173324 
End bp3174526 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content59% 
IMG OID640129166 
Productcapsule polysaccharide biosynthesis protein 
Protein accessionYP_001060249 
Protein GI126439773 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.333462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCT TCTTCCTTGC CCTGCAAGGC ACGGCCTCTC CTTTTTTTGG TCGACTCGCT 
GCCGGTCTCG GTCAGCGGGG CCACCAGGTT CGGCGTGTGA ATTTTTGCGG CGGAGATCTC
GCGTATCAAG GTTCGGAAAG CGCTTGGAAC TATCGCGACG AACCCGAAGG CCTGGTTGCG
TGGTATCGCG ATGCCATTGC GACCAATGGA GTGACGGATG TGCTTCTGTT TGGCGACTGC
CGTGCGATCC ACCGGCCGAT GCATGAGATC GCTCGCGCAT CGGGGGTGCG TGTTCACGTA
TTCGAAGAGG GGTATGTTCG ACCGCACTGG ATCACAATGG AAAGACACGG CGTCAACGGC
CGATCGTTGC TGCCGCGCGA CCCGGCTTAC TATCTCGACG CACGCCGGCA TATCCCGCCA
GCGGTACCCG GGAAACCGAC CGGCTACAAC CTGTACGAGC GCGCCTGCCA CGATATCAGG
TATCGCATGG CCAACGCGTT GTACGCGCAT CGTTTCCCGC ATTACAAGTC GCACCGTCCG
AGAAACGGCT TACAGGAGTA CGCGGGCCTC GCGTATCGCG CCGTTCAGCA ACACGTGCGC
GATAGGGAGG CCGAGAACGT CACCCGTGAT CTGCTGGAAC GAAAACGCCG CTACTATCTG
TTTCCGCTGC AGCTCAATTC CGACTCCCAG ATCGTCGATC ATTCCCCGTT TGGCGGCATT
TGCGACGCGA TAGCGATTGT TTTACACTCA TTCGCCGAAA ATGCGCCCGA CGACAGTTGG
CTTGTCATCA AGAATCATCC GTTGGACACC GGTCTGATCG GCTACCGTCA ATTTGCAACG
GCATTGGCCA CTGAACTGGG TATCGAGAAG AGAATGGCCT TCATCGATGC GGGCCACTTG
CCGACGTTAC TCGATCAATG TCGTGGCGTG GTCGTGATAA ACAGCACGGT CGGTTTGTCC
GCCGTCCACC ATCGACGCCC GCTCGTTGCA TTGGGCACCG CGATCTATTC GATGCCGGGG
CTGACTTGGC AAGGCAGCCT GGCGGACTTT TGGACGGAGG CTGGTAGCCC GGACATGAAT
CTCTATCAGG CTTTTCTCGA CTACGTGATG CACCATACGC AGATCAACGG AGATTTCTAT
ACGCGCACCG GTATAGAGAT GAGCGTCGCC GGCGCCGTGA GCCGGCTCGA GGCGGTGTCG
TGA
 
Protein sequence
MSRFFLALQG TASPFFGRLA AGLGQRGHQV RRVNFCGGDL AYQGSESAWN YRDEPEGLVA 
WYRDAIATNG VTDVLLFGDC RAIHRPMHEI ARASGVRVHV FEEGYVRPHW ITMERHGVNG
RSLLPRDPAY YLDARRHIPP AVPGKPTGYN LYERACHDIR YRMANALYAH RFPHYKSHRP
RNGLQEYAGL AYRAVQQHVR DREAENVTRD LLERKRRYYL FPLQLNSDSQ IVDHSPFGGI
CDAIAIVLHS FAENAPDDSW LVIKNHPLDT GLIGYRQFAT ALATELGIEK RMAFIDAGHL
PTLLDQCRGV VVINSTVGLS AVHHRRPLVA LGTAIYSMPG LTWQGSLADF WTEAGSPDMN
LYQAFLDYVM HHTQINGDFY TRTGIEMSVA GAVSRLEAVS