Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1717 |
Symbol | |
ID | 3691924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1841754 |
End bp | 1843331 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637728173 |
Product | prohead protease |
Protein accession | YP_333118 |
Protein GI | 76810932 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.155167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAAAAAT CGTTCTCGTC AATCGAGATT AAATCTGTTC AAGAAGATCG GCGAGAGATA GAGGGTATCG CCTCAACTCC GACGCCTGAC AGGGTAAATG ACGTGGTAGA GCCTCTGGGC CTCACGTTCC AGAAAGAAAC GCCCCTCCTA CTTAATCACA AGTCCGACCA GCCTGTAGGC ACGGTGCAAT TCGGCACGCC TACCGCTAAG GGACTTCCCT TCAAGGCAAA GATTGCGAAG GTGGACGAAG AAGGCGTTGT GAAGCAGCGC ACCGACGAAG CGTGGCATAG CGTCAAGACG CGCCTTATCA GGGGCGTCTC TATTGGGTTC ATCGCCAGGG CAACCAGCCC GCTCCCGAAT GGTGGGACCC GGTTCACAAA AGCGGAAGTC CATGAGTTGT CGCTTACCGC GATTCCCGCT AATCCGGAAG CAAAGATTAC GGGGTTCAAG GCACTTCCGG AAGTGCCTGA CAGCGCGCGG TCGACCCTCG ACCTATCCAT CCTTCCGCCT GAACTTGCTG CGATGTACAC GGCGGGGTTG GCGCGACGCG AAGCTGAGCA GAAGGCCGCC GAAGCAGCAC GTAAAGAGCA AGAAACCATC AACACGAAAG AGGAAAACGA TATGCAGAAG ACGAACAACA CTAACCACAT TTTCATTCGC GGCGCTATCG CGAAGGCCGT GACGATGGAG GGCGGGGCAG AGGGCTACGC GTCGATGCGG TGGGGGGCGG GTTCGAAAAC GGTGGAGTAC ATCAAGGCGA TTGCTAGCCC CATGACGGCC GGCGTAGACG GAAGCGGTGC ATTGACCTCA GGTACTTTGA GTCGCCAGCA GTTCGTCCAA GCTGTGTTTA GTCATTCGAT CCTCGGGCAG CTTCGGGGGG TGATTCGTGT ACCGGCGATG ACGCGCGTCA ATGTGGAAAA TGAGCCAACA GCCGCTGCGT TCTTCGGCCC CGGCGTGCCT TGTCCGACTG CACAAGGCAC GTTCGGGGTG CATATGGCCG ACAAGCGGAA GATCGGCGTC ACAGAAGTGA TCTCGGAAGA ACTCGCCCGT GCTACCGATG AAGCGGCTGA GGTAACTATT AGTGCGATTC TCCAACGTGC CCTGAGTCGA GGGTTGGATA ACGCATTCAT TGGAAGCCAA ACACGGGGCG AGGTTTCCCC TGCTGGCCTT GGGACGGTTG CAGTAAAAGC CGCAAATTTT GAGGCAGGCC TTGAAGTGTT TACAGGCGAC CTGACCATGG CAAGTGTGAT TGTCAATCCA CGTACAGCAG TCGCTTTGCG CAGCCCGACC GAAACTCAGA TTACCGCGAC CGGGGGCATC TACAAGGGGC TGCCCGCAAT CGCATCATGC GCCGTTCCTC TGGGCAAACT TCTAATTGTG GATGGTAGTC GGGTGCTGGC TCATATCGGA GACGTGGAGA TTCTCGCACT TCGTCATGCT GACGTATACA CATTGCATGG AGGTGCGTCC CCCTCGGTCC CGGTCAACAT GTTTCAGACC AATCAAGTAG CCCTCCAGGC GGGCCAGTAC GCAGACTGGG ATTTCGTTGA CGGTGCTGCT ATTGAGGTTG GGGTCTAA
|
Protein sequence | MQKSFSSIEI KSVQEDRREI EGIASTPTPD RVNDVVEPLG LTFQKETPLL LNHKSDQPVG TVQFGTPTAK GLPFKAKIAK VDEEGVVKQR TDEAWHSVKT RLIRGVSIGF IARATSPLPN GGTRFTKAEV HELSLTAIPA NPEAKITGFK ALPEVPDSAR STLDLSILPP ELAAMYTAGL ARREAEQKAA EAARKEQETI NTKEENDMQK TNNTNHIFIR GAIAKAVTME GGAEGYASMR WGAGSKTVEY IKAIASPMTA GVDGSGALTS GTLSRQQFVQ AVFSHSILGQ LRGVIRVPAM TRVNVENEPT AAAFFGPGVP CPTAQGTFGV HMADKRKIGV TEVISEELAR ATDEAAEVTI SAILQRALSR GLDNAFIGSQ TRGEVSPAGL GTVAVKAANF EAGLEVFTGD LTMASVIVNP RTAVALRSPT ETQITATGGI YKGLPAIASC AVPLGKLLIV DGSRVLAHIG DVEILALRHA DVYTLHGGAS PSVPVNMFQT NQVALQAGQY ADWDFVDGAA IEVGV
|
| |