Gene BURPS1710b_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1717 
Symbol 
ID3691924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1841754 
End bp1843331 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content57% 
IMG OID637728173 
Productprohead protease 
Protein accessionYP_333118 
Protein GI76810932 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.155167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAAAAAT CGTTCTCGTC AATCGAGATT AAATCTGTTC AAGAAGATCG GCGAGAGATA 
GAGGGTATCG CCTCAACTCC GACGCCTGAC AGGGTAAATG ACGTGGTAGA GCCTCTGGGC
CTCACGTTCC AGAAAGAAAC GCCCCTCCTA CTTAATCACA AGTCCGACCA GCCTGTAGGC
ACGGTGCAAT TCGGCACGCC TACCGCTAAG GGACTTCCCT TCAAGGCAAA GATTGCGAAG
GTGGACGAAG AAGGCGTTGT GAAGCAGCGC ACCGACGAAG CGTGGCATAG CGTCAAGACG
CGCCTTATCA GGGGCGTCTC TATTGGGTTC ATCGCCAGGG CAACCAGCCC GCTCCCGAAT
GGTGGGACCC GGTTCACAAA AGCGGAAGTC CATGAGTTGT CGCTTACCGC GATTCCCGCT
AATCCGGAAG CAAAGATTAC GGGGTTCAAG GCACTTCCGG AAGTGCCTGA CAGCGCGCGG
TCGACCCTCG ACCTATCCAT CCTTCCGCCT GAACTTGCTG CGATGTACAC GGCGGGGTTG
GCGCGACGCG AAGCTGAGCA GAAGGCCGCC GAAGCAGCAC GTAAAGAGCA AGAAACCATC
AACACGAAAG AGGAAAACGA TATGCAGAAG ACGAACAACA CTAACCACAT TTTCATTCGC
GGCGCTATCG CGAAGGCCGT GACGATGGAG GGCGGGGCAG AGGGCTACGC GTCGATGCGG
TGGGGGGCGG GTTCGAAAAC GGTGGAGTAC ATCAAGGCGA TTGCTAGCCC CATGACGGCC
GGCGTAGACG GAAGCGGTGC ATTGACCTCA GGTACTTTGA GTCGCCAGCA GTTCGTCCAA
GCTGTGTTTA GTCATTCGAT CCTCGGGCAG CTTCGGGGGG TGATTCGTGT ACCGGCGATG
ACGCGCGTCA ATGTGGAAAA TGAGCCAACA GCCGCTGCGT TCTTCGGCCC CGGCGTGCCT
TGTCCGACTG CACAAGGCAC GTTCGGGGTG CATATGGCCG ACAAGCGGAA GATCGGCGTC
ACAGAAGTGA TCTCGGAAGA ACTCGCCCGT GCTACCGATG AAGCGGCTGA GGTAACTATT
AGTGCGATTC TCCAACGTGC CCTGAGTCGA GGGTTGGATA ACGCATTCAT TGGAAGCCAA
ACACGGGGCG AGGTTTCCCC TGCTGGCCTT GGGACGGTTG CAGTAAAAGC CGCAAATTTT
GAGGCAGGCC TTGAAGTGTT TACAGGCGAC CTGACCATGG CAAGTGTGAT TGTCAATCCA
CGTACAGCAG TCGCTTTGCG CAGCCCGACC GAAACTCAGA TTACCGCGAC CGGGGGCATC
TACAAGGGGC TGCCCGCAAT CGCATCATGC GCCGTTCCTC TGGGCAAACT TCTAATTGTG
GATGGTAGTC GGGTGCTGGC TCATATCGGA GACGTGGAGA TTCTCGCACT TCGTCATGCT
GACGTATACA CATTGCATGG AGGTGCGTCC CCCTCGGTCC CGGTCAACAT GTTTCAGACC
AATCAAGTAG CCCTCCAGGC GGGCCAGTAC GCAGACTGGG ATTTCGTTGA CGGTGCTGCT
ATTGAGGTTG GGGTCTAA
 
Protein sequence
MQKSFSSIEI KSVQEDRREI EGIASTPTPD RVNDVVEPLG LTFQKETPLL LNHKSDQPVG 
TVQFGTPTAK GLPFKAKIAK VDEEGVVKQR TDEAWHSVKT RLIRGVSIGF IARATSPLPN
GGTRFTKAEV HELSLTAIPA NPEAKITGFK ALPEVPDSAR STLDLSILPP ELAAMYTAGL
ARREAEQKAA EAARKEQETI NTKEENDMQK TNNTNHIFIR GAIAKAVTME GGAEGYASMR
WGAGSKTVEY IKAIASPMTA GVDGSGALTS GTLSRQQFVQ AVFSHSILGQ LRGVIRVPAM
TRVNVENEPT AAAFFGPGVP CPTAQGTFGV HMADKRKIGV TEVISEELAR ATDEAAEVTI
SAILQRALSR GLDNAFIGSQ TRGEVSPAGL GTVAVKAANF EAGLEVFTGD LTMASVIVNP
RTAVALRSPT ETQITATGGI YKGLPAIASC AVPLGKLLIV DGSRVLAHIG DVEILALRHA
DVYTLHGGAS PSVPVNMFQT NQVALQAGQY ADWDFVDGAA IEVGV