Gene BURPS1710b_A0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0216 
Symbol 
ID3693219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp327192 
End bp328469 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content60% 
IMG OID637730470 
Productputative surface layer protein 
Protein accessionYP_335375 
Protein GI76817920 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.585544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCATCTT CAGATTCCAA CCCTCCCGAA AGGAATGCCA TGAATTCGAC ATGGATGCGC 
AAATATGGAG AAAGAATCGG AAAGCCTTTT TTCCCAAAAG TCAGCGGCCT TATTCTCCCG
GCCGCCGTAT CTGCCGGCGG CTTGCCATTT TCCGTCGCCC TCGCCTTATT CTCGGCGTGC
CGTGCGATTA TCATATACGC GCTATCCTTT TTATCGATCC TTGCCGCCGT ACCGGCATAT
GGGGCCAGCC CTCCGGGCCC TGACGGCGAG TTATATGTCC CCGAAGCGCA TCACAACAAA
ATCAGCGTAA TCGATACCAA GACGTCCAAA ATAATTCGAA GCATTCCGAT CAACGATATT
CCCGTCCTTC CATTAGGTAG CCGGCCCACC GTACTGGTTG CGACACCCGA CGGGAACAAG
ATTTACAGCG ACAATTTCGG ACTGATTCCG CCGACGATCA CTGCCGTCGA TCGAAAGACC
GGCAGCGTTA AATCGATTCA ATTGTCGAGC GTCCCTCTCG GCGCATCGAT CTCGGAAGAC
GGCAAGGAAA TCTATCTGCC GCAAGGCACG TACAGCGTCG AGGTCGTCGC CACCGATAGC
GACAAGGTCG TACGCCGCCT CTCGTTTTCC GATGTCCCGG TCGCGGCGAT CAAAGGCCCC
GACGGCCATT TGTACGTCGG TTTCGCGAAC GGGGCGATCG GCTCGTTCGA CGTCCGGACG
GGAAAGGCCC TGCGCCAACC GATCGAAACG GGCGGAACAT TGCCGGCCTG GTTCACGTTC
ACGCGGGACG GCAAAAAGCT GTATGTCGAT GCCGTGAATG CAATCGGCGT CATCGATCTG
CGCAGTTGGA CGCTCGTCAA GCGGATCCCG ACGGGCGACG ACGCATCGCA GCGTTCGACG
GACCCGTGGC CGTTCACGTC GACGCTCGCG CCCGACGGCA AGAAACTCTA CGTTACGCTG
CTCGGCGAGT CCGGCGTGCT AGTGATCGAC GTGGCGACAG ACCGCGTCGT CGCGAAGATC
AAAACGGCAG GCTCCACCAC GGGCGTCGCA TTCAGCGCCG ACGGCTCGCG CGGCTACATC
ACCGACATGG GCCCCTCGTT GTCGTTCCTG AAGACCCCTG CGGGCGGCGC GCTGCTGGGC
AATGTCTGGA TCGGCTTCGG TCTGCTGGGC TCCGGGCAGG TGGTCGTGTT CGATCCGAAG
ACGGACGAAC CGGTCGGCGA GCCGATTCCG ACCACACCGG GGCCCGGCAT CCCGGTCTGG
GTTCCGCCGC GCGGCTAG
 
Protein sequence
MASSDSNPPE RNAMNSTWMR KYGERIGKPF FPKVSGLILP AAVSAGGLPF SVALALFSAC 
RAIIIYALSF LSILAAVPAY GASPPGPDGE LYVPEAHHNK ISVIDTKTSK IIRSIPINDI
PVLPLGSRPT VLVATPDGNK IYSDNFGLIP PTITAVDRKT GSVKSIQLSS VPLGASISED
GKEIYLPQGT YSVEVVATDS DKVVRRLSFS DVPVAAIKGP DGHLYVGFAN GAIGSFDVRT
GKALRQPIET GGTLPAWFTF TRDGKKLYVD AVNAIGVIDL RSWTLVKRIP TGDDASQRST
DPWPFTSTLA PDGKKLYVTL LGESGVLVID VATDRVVAKI KTAGSTTGVA FSADGSRGYI
TDMGPSLSFL KTPAGGALLG NVWIGFGLLG SGQVVVFDPK TDEPVGEPIP TTPGPGIPVW
VPPRG