Gene BURPS1106A_3284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3284 
Symbol 
ID4902594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3199543 
End bp3200826 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content56% 
IMG OID640136510 
Productputative capsular polysaccharide biosynthesis protein 
Protein accessionYP_001067521 
Protein GI126453898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.32234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCAGTC GGGAGCGGCT GGCGCGCCAA ATTGTGTTCC ATCACATTCC CAAGACGGCG 
GGATCGTCGT TCAATCAGAT ACTTCGCACG CTATATCGCG ACGACGAAGT ATGCAACGCT
GCGTTGGATG ACGAACTCGA TGAAGTGATG GCCGACGAGA CGCGTCGTTA CGAGCTGTTT
GTCGGGCATT TCAGCTTCGA CGCGCTGCAT CGGCACTTCG GCGGCGCCAC GCGTTTGACT
TTTCTTCGCG ATCCGGTTCA GCGCTGTATT TCCCAGTATC ACAACTGGCA TGACGCCTCG
CGCTATTCGG ATGCGTGGAT CGGGCGCAGC GACACGAATC CGGACGTCAT CAAGGCGCTG
AAGATGACGT CCGAGATGTC GCTTGGTGAA TTTGTGAGTT CGGACAATCT CGTGATTTCC
GACAGCGCTC AAAACATGAT GACTCGCTAC CTCGCGCCGA GCGTCGAATG GAAGAAGGAG
CGTGGATACT ATGACGCCGA GCTTGTCGAG AAAGCCAAGC GCAATCTCGT CGAGTATTTT
CATTTTTTTG GCCTGACCGA GCAATTTGAT CGTTCACTAG TGCTTCTTGC GCATACCCTC
GGTATCCGCC CATGGGAACG GAGCGATGCA CTGCTAACTA ACCGCAATCC GAAGAAGGCT
TCGTTCGACA GTGTTTACAA TACCACGCCA GAAGAAGGCG GTGTTTTACG CGATTACAAC
TTGATGGATA TCGAGTTGTA CGAGTTCGCG GTAAAGGAAT TCAATCGCCG CTTCGACGCG
GGATACCAGA AGCTTGTCGA GTGCGCCTTT GAGTATCTCG CTGACAAGGA CACTCGCGAC
ATGGGTAATG CTGGCGATTT TTACACGTTC GACATGACGA ACGCGGTCGG CGCCCGAGGT
TTGCATTTTC TGGAATCCAC CCGGTTGCCG TGCGGTGCGA ATGTTCTTGG ACGTTGGACA
GGGCTGGAGC CGCGAGCTGT ATGGGAGATT CCGCTTCGCG CGGGGCGCGA CAGCCATGTC
GTGATCGAAG TGGACTATAT CGATAGCGTG TCGCAGGAGG CCCTGGCGCC GGAGCATTTC
ACGTTAAACG GCATGCCGGC CAGGCAGCAT GCGTTCAGCG CGGAGGGCTC GATCCAGCGT
CTGCGCCTGG TCTTTTCCGC CGGCGCCGCG CTTGCCGGCA GAATGTTGCA CACGCTGAAA
TTGACTACTC CGCTTGTGCG TGCGGAAGAC GGAACGCGCG ACGTTGGAGT GCTTCTATTG
CGCTTGCAGT CTTACAGCGT TTAG
 
Protein sequence
MRSRERLARQ IVFHHIPKTA GSSFNQILRT LYRDDEVCNA ALDDELDEVM ADETRRYELF 
VGHFSFDALH RHFGGATRLT FLRDPVQRCI SQYHNWHDAS RYSDAWIGRS DTNPDVIKAL
KMTSEMSLGE FVSSDNLVIS DSAQNMMTRY LAPSVEWKKE RGYYDAELVE KAKRNLVEYF
HFFGLTEQFD RSLVLLAHTL GIRPWERSDA LLTNRNPKKA SFDSVYNTTP EEGGVLRDYN
LMDIELYEFA VKEFNRRFDA GYQKLVECAF EYLADKDTRD MGNAGDFYTF DMTNAVGARG
LHFLESTRLP CGANVLGRWT GLEPRAVWEI PLRAGRDSHV VIEVDYIDSV SQEALAPEHF
TLNGMPARQH AFSAEGSIQR LRLVFSAGAA LAGRMLHTLK LTTPLVRAED GTRDVGVLLL
RLQSYSV