Gene BURPS668_3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3249 
Symbol 
ID4881721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3183243 
End bp3184526 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content56% 
IMG OID640129177 
Productputative capsular polysaccharide biosynthesis protein 
Protein accessionYP_001060260 
Protein GI126439225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCAGTC GGGAGCGGCT GGCGCGCCAA ATTGTGTTCC ATCACATTCC CAAGACGGCG 
GGATCGTCGT TCAATCAGAT ACTTCGCACG CTATATCGCG ACGACGAAGT ATGCGACGCT
GCGTTGGATG ACGAACTCGA TGAAGTGATG GCCGACGAGA CGCGTCGTTA CGAGCTGTTT
GTCGGGCATT TCAGCTTCGA CGCGCTGCAT CGGCACTTCG GCGGCGCCAC GCGTTTGACT
TTTCTTCGCG ATCCGGTTCA GCGCTGTATT TCCCAGTATC ACAACTGGCA TGACGCTTCG
CGCTATTCGG ATGCGTGGAT CGGGCGCAGC GACACGAATC CGGACGTCAT CAAGGCGCTG
AAGATGACGT CCGAGATGTC GCTTTGTGAA TTTGTGAGTT CGGATAATCT CGTGATTTCC
GACAGCGCTC AAAACATGAT GACTCGCTAC CTCGCGCCGA GCGTCGAATG GAAGAAGGAG
CGTGGATACT ATGACGCCGA GCTTGTCGAG AAAGCCAAGC GCAATCTCGT CGAGTATTTT
CATTTTTTTG GCCTGACCGA GCAATTTGAT CGTTCACTAG TGCTTCTTGC GCATACCCTC
GGTATCCGCC CATGGGAACG GAGCGATGCA CTGCTAACTA ATCGAAATCC GAAGAAGGCT
TCGTTCGACA GTGTTTACAA TACCACGCCA GAAGAAGGCG GTGTTTTACG CGATTACAAC
TTGATGGATA TCGAGTTGTA CGAGTTCGCG GTAAAGGAAT TCAATCGCCG CTTCGACGCG
GGATACCAGA AGCTTGTGGA GTGCGCCTTT GAGTATCTCG CTGACAAGGA CACTCGCGAC
ATGGGTAATG CTGGCGATTT TTACGCGTTC GACATGACGA ACGCAGTCGG CGCCCGAGGT
TTGCATTTTC TGGAATCCAC CCGGTTGCCG TGTGGTGCGA ATGTTCTTGG ACGTTGGACA
GGGCTGGAGC CGCGAGCTGT ATGGGAGATT CCGCTTCGCG CGGGGCGCGA CAGCCATGTC
GTGATCGAAG TGGACTATAT CGATAGCGTG TCGCCGGAGG CCCTGGCGCC GGAGCATTTC
ACGTTAAACG GCATGCCGGC CAGGCAGCAT GCGTTCAGCG CGGAGGGCTC GATCCAGCGT
CTGCGCCTGG TCTTTTCCGC CGGCGCCGCG CTTGCCGGCA GAATGTTGCA CACGCTGAAA
TTGACTACTC CGCTTGTGCG TGCGGAAGAC GGAACGCGCG ACGTTGGAGT GCTTCTATTG
CGCTTGCAGT CTTACAGCGT TTAG
 
Protein sequence
MRSRERLARQ IVFHHIPKTA GSSFNQILRT LYRDDEVCDA ALDDELDEVM ADETRRYELF 
VGHFSFDALH RHFGGATRLT FLRDPVQRCI SQYHNWHDAS RYSDAWIGRS DTNPDVIKAL
KMTSEMSLCE FVSSDNLVIS DSAQNMMTRY LAPSVEWKKE RGYYDAELVE KAKRNLVEYF
HFFGLTEQFD RSLVLLAHTL GIRPWERSDA LLTNRNPKKA SFDSVYNTTP EEGGVLRDYN
LMDIELYEFA VKEFNRRFDA GYQKLVECAF EYLADKDTRD MGNAGDFYAF DMTNAVGARG
LHFLESTRLP CGANVLGRWT GLEPRAVWEI PLRAGRDSHV VIEVDYIDSV SPEALAPEHF
TLNGMPARQH AFSAEGSIQR LRLVFSAGAA LAGRMLHTLK LTTPLVRAED GTRDVGVLLL
RLQSYSV