Gene BURPS1710b_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0335 
Symbol 
ID3690743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp348361 
End bp349452 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID637726791 
ProductPhage portal protein 
Protein accessionYP_331751 
Protein GI76812188 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.127245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCGTCGT GGCGCCGCCG CGCGTCGACG ACGGCGTCTT CCGCGTGCTC GAACGTCACC 
AGTTCCGCGG CAACGACTTT GAGGAACAGG CCGCGCCAAA TCCGAGCGCC GGCGCGCGCC
GAGGTCTTCA CGTTCGACGA TCCGACGCCC GTCATGAACC GGGCCGAGAT TCTCGATTAC
GTCGAGTGCT GGTCGAACGG CGAATGGTTC GAGCCGCCGG TCAGCTTCGC CGGCTTGGCG
AAATCGTTTC GCGCGAGCAC GCACCATAGC TCGGCGCTGT TCTTCAAGGC GAACGTGCTG
GCGTCGACGT TTCGCCCGCA CCGCTGGCTG TCGCGGCACG CGTTCGAGCG GTGGGCGCTC
GATTTCCTGA CGTTCGGCAA CGGCTATCTG GAACGCCGCC GCAACATGGT CGGCGGCACG
CTGCGGCTCG AGCCCGCGCT CGCGAAGTAC GTACGGCGCA AGGCCGATTT CAGCGGCTTC
GTGTACGTGA ACGGCTGGCA GGAGCGGCAC GAGTTCGCGC CCGACAGCGT GTTCCAGCTC
GTGCGGCCGG ACATCAATCA GGAGGTCTAT GGCCTGCCCG AGTATCTGAG CTCGCTGCAC
TCGGCGTGGC TGAACGAATC GTCGACGCTG TTCCGGCGCA AGTATTACGA GAACGGCAGC
CACGCCGGCT TCATCCTGTA CATGACCGAC GCCGCGCAGA AGCAGGACGA CGTGGACAAC
ATGCGCGACG CGCTGAAGAA CGCGAAGGGG CCGGGCAACT TCCGCAACGT GTTCATGTAC
GCGCCGGGCG GGAAGAAGGA CGGCATCCAG CTCATTCCCG TGTCCGAGGT CGCCGCGAAG
GACGAGTTCT TCAACATCAA GAACGTGACG CGCGATGACC TGCTCGCCGC ACACCGCGTG
CCGCCGCAGT TGCTTGGCAT CGTGCCGAGC AATTCGGGCG GGTTCGGCAC GCCGGACACC
GCCGCGCGCG TGTTCGGGCG CAACGAAATC AGGCCGCTAC AGGCGCGCTT CGCCGAGCTG
AACGACTGGC TCGGCGACGA GGTCGTGAGG TTCGACGATT ACGAGATTCC GCCGGCGCCG
GTCGCGGCGT AG
 
Protein sequence
MSSWRRRAST TASSACSNVT SSAATTLRNR PRQIRAPARA EVFTFDDPTP VMNRAEILDY 
VECWSNGEWF EPPVSFAGLA KSFRASTHHS SALFFKANVL ASTFRPHRWL SRHAFERWAL
DFLTFGNGYL ERRRNMVGGT LRLEPALAKY VRRKADFSGF VYVNGWQERH EFAPDSVFQL
VRPDINQEVY GLPEYLSSLH SAWLNESSTL FRRKYYENGS HAGFILYMTD AAQKQDDVDN
MRDALKNAKG PGNFRNVFMY APGGKKDGIQ LIPVSEVAAK DEFFNIKNVT RDDLLAAHRV
PPQLLGIVPS NSGGFGTPDT AARVFGRNEI RPLQARFAEL NDWLGDEVVR FDDYEIPPAP
VAA