Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1716 |
Symbol | |
ID | 3688963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1838894 |
End bp | 1841668 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637728172 |
Product | putative phage HK97 tail length tape measure-related protein |
Protein accession | YP_333117 |
Protein GI | 76811076 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.309525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCATCA GTAACAACAA TACTACTGTT CGGTACTCGG TCGATGCGTC TGGCGTACAG GCCGGTGTAA ATCAAATCCG GGCAGCAAAC GCCCAGCTTA ACACATCGCA GATCGAAGTA TTCCGCAGGC AGGAAGCCGT TACGCAGGCA ATGAGAGAAG CCGCCAGCAA CGGCTACAAC CTCACCGCAC GCGAGGCAAA GAAGCTCGTA GACCAGTACG ACCGACTCCA GGCCACAGCA GGCAAGACGC GCCTGGAAAT GCTCAACCAA CAAGCCGCCG CGCGAGGCGT TACTCAGGCA TTTGCAGCGC AGGCAGCGGC TATTGCGGAT GCATCGAAGA AAACGCACGA ACTCAACTTG AACAGTGTTG GTGCACGGCG CGAAATGATG GTCCTCGCTC ACGAAGCCGC AACAGGCAGT TGGAAGAACT TCGCCGGTTC CATGATGGTG ATGGCGGAGC AGGTCGACGC TATGAAGTAC GCCACGCACC CATTGGCTTT GGGGCTCGCG GCGGCCGGTA CGGCAGCTTA CGCGTTTTAC AAGGGCATTT CCAACGCCAA CGCACAGTAC AGGGCGTTTA ACGACGCGAT GAACTCCACG GGCGGCTACG CGCAACAGAC ACGGGAATCC ATCCAGGGGC TTGCGGAGGA CTTGTCCAAG CGGTTCGGTG TCGGTATCAG CGCGGCGACG AACGGGCTTA ATCAACTGGT GGCGACGGGT CGCGTAACTG CTGATATTTT CCCGCAGGTT GGGGCCGTGG CGTTGGCGAT GTCGAAGTCC TCGGGAGAGG CATTCGATAA GACCGTGGAA TCGCTCTTGA AGCAGCAGGA CGAGGTTAAG CGCGCGGCGG AGGAGTACCA ACGCACGCAC CACTCGATGT CAGACGCCAA CATGGCGCTT ATCGAGTCTC TTGAAAAGAC GGGCCAAAAG CATGAAGCGT TCAAGATTCT CATCCAGCAG CAGTTGGCCG ATATCGAGCG CGAAACCAAG GCCAGCACGG AGCATCAGGC AGGATTCTGG GATAACGTTA CAGCCTCAAT GCAACGCTAT TCGCGCGCGC TTGCTGGTAA GTCCACTGAT CTCGACATCC TCAACGACCT CAAGTCGAAG CAAGCCGCCC AACTTGGCGG GCGTATGCCG GGGGACTACA CGGATTACGG CCCGCTGATT GCCGCTCAAC AGAAGATCGT TGATGCGAAC AAGGCCAGTC AAGAGGCCGC CGCTAGAGAA GCCGCAGATA AGGCCGCATT GGCCGATTCG CTCCGGACGG TGACGGCCGA GTACGAGCGC ACGAAGTCAG CGCAGCAACA CCTTAGCGAT GCCGTAAAGC ACGATAACGC GGTCATTGAC ACGCGTATTG CGTTGCTGAC GAAGCAGGGG AAGATGACCG ATTCCGTGCG GGCGCAGCTT GAAGCCCAGC GAAAGCAGAT GATCGCCTTC GATACTGAAC ACATCACCCC GACTCGGAAA CACGGTAGCG GTTCCGCCAT CGCGGAGATG AACGCCGAGA CTCAGACGGG AATGGCAATT CGGCAATTGA TTGAGCAGCA GGCGGAGAAG CAGTTACAGG CGCAGCGCCA GCTAGGCGTG ATTGACGCCG AGACGTATTA CCGCAGGCTT ACCGACCTGC AAAAATCCGC TCTGAACGAC CAAATTGCAC TTGCGAGCAG ACGCGCCGAT GTACTTCGAT CTTCGTCAGA CAAGAGGGCC TACACGGAAG CTGCGGCGGC AGTTGAAAAA CTCCAGCTTC AGCAGAAGGG CTTGGATACG AGTCTGCAAG ATACGTTGAC GGGTCTGTCG CAGCGACGTG ACATCGATGT GCGCCGGTAC GTCATGGGCC TGGGCCAGAT GAATACGCAG CAGGAGGATG CGTACCAATA TCAGGACACA ACCCGCAACA TGACCGCACG GGCCAAGGCC GAATTCGACG CACGGTACGC GTTGCAACAG CAGTATCAGC AGCGCGTTCG GCAGTTGGTC GAACAATACG CGCTCGATCC TACCTCGGAT ATGAAGCAAT ACGCGGAAAA ACTCCGAGCG GAACAGGCGT ATCTCGCGGA GCGTAGCGCA GGAATGGAAC GATTCTTTGT CCGTGAGGAG GCGCGGCGTA ATAGCTTCGC GGCTCAGATG AAAGACGGTC TATTGTCTCT CGGCGGTGAC GCCATGACTA ACGCGGAGCT TGCCCGGACT GCCCTTGTAA CTGCGTGGCA GGATTCGCAG AGCGCTTTGG AGCAATTCAT TACGAGCGGC GAGGGAAATT TCAAGAAGTT CACGGCGAGC ATTCTGGGCG ACCTTGCGAA AATCGCGTTG CGCCAAGCCG AAGTGTTCGC GATCCAGAGC ATCAGCAGTT CGTTCGGCTC ATTCTTTAGC GAGGGCGGCC CGGTGCTGCA CCGCGCAGGC GGCGGCCCCA TCGCCGGCCC AGGCACAACG ACCAGTGACA GCATTCCCGC GATGCTTTCG AACGGGGAAT TCGTCATCAA TGCAGCGTCT ACGAGGAAAT ACCGCAGCCT GCTTGAGTCC ATCAACTCGG GCCACATGGC GCACTTTGCG ACCGGAGGCA TTGCAAGTTC TCTTGCACCA TCTCCGGCCC CTATGTCGGG TGGCGGTGAT AGGCCTCACT TTACCGTCAA TCTGAACGGA GGGCACGGCG GATTAACTGA GGCTGACGTG GCGTCCCTGG TTACGCAATT CCAGTCGATC GTGGATGTTC AGCTACACAA GCGGATGGCG GAGCAGGGCG GATATGCATA CAAGATGAGG TACGGCCTGC TGTAA
|
Protein sequence | MSISNNNTTV RYSVDASGVQ AGVNQIRAAN AQLNTSQIEV FRRQEAVTQA MREAASNGYN LTAREAKKLV DQYDRLQATA GKTRLEMLNQ QAAARGVTQA FAAQAAAIAD ASKKTHELNL NSVGARREMM VLAHEAATGS WKNFAGSMMV MAEQVDAMKY ATHPLALGLA AAGTAAYAFY KGISNANAQY RAFNDAMNST GGYAQQTRES IQGLAEDLSK RFGVGISAAT NGLNQLVATG RVTADIFPQV GAVALAMSKS SGEAFDKTVE SLLKQQDEVK RAAEEYQRTH HSMSDANMAL IESLEKTGQK HEAFKILIQQ QLADIERETK ASTEHQAGFW DNVTASMQRY SRALAGKSTD LDILNDLKSK QAAQLGGRMP GDYTDYGPLI AAQQKIVDAN KASQEAAARE AADKAALADS LRTVTAEYER TKSAQQHLSD AVKHDNAVID TRIALLTKQG KMTDSVRAQL EAQRKQMIAF DTEHITPTRK HGSGSAIAEM NAETQTGMAI RQLIEQQAEK QLQAQRQLGV IDAETYYRRL TDLQKSALND QIALASRRAD VLRSSSDKRA YTEAAAAVEK LQLQQKGLDT SLQDTLTGLS QRRDIDVRRY VMGLGQMNTQ QEDAYQYQDT TRNMTARAKA EFDARYALQQ QYQQRVRQLV EQYALDPTSD MKQYAEKLRA EQAYLAERSA GMERFFVREE ARRNSFAAQM KDGLLSLGGD AMTNAELART ALVTAWQDSQ SALEQFITSG EGNFKKFTAS ILGDLAKIAL RQAEVFAIQS ISSSFGSFFS EGGPVLHRAG GGPIAGPGTT TSDSIPAMLS NGEFVINAAS TRKYRSLLES INSGHMAHFA TGGIASSLAP SPAPMSGGGD RPHFTVNLNG GHGGLTEADV ASLVTQFQSI VDVQLHKRMA EQGGYAYKMR YGLL
|
| |