Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1517 |
Symbol | |
ID | 4884933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1480168 |
End bp | 1482942 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640127445 |
Product | putative phage HK97 tail length tape measure-related protein |
Protein accession | YP_001058558 |
Protein GI | 126438418 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.813512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCATCA GTAACAACAA TACCACTGTT CGGTACTCGG TCGATGCGTC TGGCGTACAG GCCGGTGTAA ATCAAATCCG GGCAGCAAAC GCCCAGCTTA ACGCATCGCA GATCGAAGTA TTCCGCAGGC AGGAAGCCGT TACGCAGGCA ATGAGAGAAG CCGCCAGCAA CGGCTACAAC CTCACCGCAC GCGAAGCAAA GAAGCTCGTA GACCAGTACG ACCGACTCCA GGCCACAGCA GGCAAGACGC GCCTGGAAAT GCTCAACCAA CAAGCCGCCG CGCGAGGCGT TACTCAGGCA TTTGCAGCGC AGGCAGCGGC TATTGCGGAT GCATCGAAGA AAACGCACGA ACTCAACTTG AACAGTGTTG GTGCACGGCG CGAAATGATG GTCCTCGCTC ACGAAGCCGC AACGGGCAGT TGGAAGAACT TCGCCGGTTC CATGATGGTG ATGGCGGAGC AGGTCGACGC TATGAAGTAC GCCACGCACC CATTGGCTTT GGGGCTCGCG GCGGCCGGTA CGGCAGCTTA CGCGTTTTAC AAGGGCATTT CCAACGCCAA CGCACAGTAC AAGGCGTTTA ACGACGCGAT GAACTCCACG GGCGGCTACG CGCAACAGAC ACGGGAATCC ATCCAGGGGC TTGCGGAGGA CTTGTCTAAG CGGTTCGGTG TCGGTATCAG CACGGCGACG AACGGGCTTA ATCAACTGGT GGCGACGGGG CGCGTAACTG CTGATATTTT CCCGCAGGTT GGGGCCGTGG CGTTGGCGAT GTCGAAGTCC TCGGGAGAGG CATTCGATAA GACCGTGGAA TCGCTCTTGA AGCAGCAGGA CGAGGTTAAG CGCGCGGCGG AGGAGTACCA ACGCACGCAC CACTCGATGT CAGACGCCAA CATGGCGCTT ATCGAGTCTC TTGAAAAGAC GGGCCAAAAG CATGAAGCGT TCAAGATTCT CATCCAGCAG CAGTTGGCCG ATATCGAGCG CGAAACCAAG GCCAGCACGG AGCATCAGGC AGGATTCTGG GATAACGTTA CAGCCTCAAT GCAACGCTAT TCGCGCGCGC TTGCTGGTAA GTCCACTGAT CTCGACATCC TCAACGACCT CAAGTCGAAG CAAGCCGCCC AACTTGGCGG GCGTATGCCG GGGGACTACA CGGATTACGG CCCGCTGATT GCCGCTCAAC AGAAGATCGT TGACGCGAAC AAGGCCAGTC AAGAGGCCGC CGCTAGAGAA GCTGCGGATA AGGCAGCATT GGCCGATTCG CTCCGGACCG TGACGGCAGA GTACGAACGC ACAAAGTCGG CGCAACAACG CCTTACGGAT GCGGTCAATC GTGATAACGC GATCATCGAC ACGCGCATTT CGTTGCTGAC GAAGCAGGGA AAGATGACGG AGTCTGTCCG GGCGCAGTTG GAGGCCCAGC GAAAGCAGAT GATCGCCTTC GATACCGAAC ACATCACTCC GACTCGGAAG CGCGGCGGCG GCTCCGCAAT CGCGGCATTG AATGCAGAGA CGCAGACAGG CTTGGCGATT CGGCAGATTA TCGAACAGCA GGCGGAGAAG CAGCTACAGG CCCAGCGACA GCTAGGCGTA ATCGACGCGG AGACATACTA CCGCAAGCTT ACCGACCTGC AAAAATCTGC GCTGGACGAC CAAATTGCAC TCGCGAACAG ACGTGCCGAT GCGCTTCGAT CTTCGTCAGA CAAGCGAGCC TACACGGAAG CGGCGGCGGC AGTTGAAAAA CTCCACCTTC AACGCAAGGG CTTGGATTCG AGTTTGCAAG ATACGTTGGC GGGTCTGTCG CAGCGACGTG ACATTGATGT GCGCCGCTAC GTCATGGGCC TAGACCAGAT GAATACGCAG CAGGAGGATG CGTACCAATA TCAGGACACA ACCCGCAACA TGACCGCACG GGCGAAGGCC GAATTCGACG CACGGTACGC GTTGCAACAA CAGTATCAGC AGCGTGTGCG GCAGTTGGTC GAACAATACG CGCTCGATCC TACCTCGGAT ATGAAGCAAT ACGCGGAAAA ACTCCGAGCG GAACAGGCAT ATCTCGCGGA GCGCAGCGCG GGAATGGAAC GATTCTTTGC CCGTGAGGAG GCGCGGCGTA ATAGCTTCGA GGCTCAGATG AAGGATGGTC TATCGTCACT CGGCGGCGAC GCCATGACGA ACGCGGAACT TGCCAAGACG GCCTTTGTTA CTGCGTGGCA GGATTCGCAG AGTGCCTTGG AGCAGTTCAT TACGAGCGGC GAGGGCAATT TCAAGAAATT CACGGCGAGC ATCCTTGCTG ACCTTGCGAA GATCGCGCTC CGTCAGGCTG AGGTATTCGC GATCCAGAGT ATCGGCAGTT CGTTCGGATT CTTCAGCGAA GGCGGCCCGG TTGGTCATTT CGCGTCGGGC GGTGCAATCA GCGGTCCCGG CACTGGTACA AGCGACAGCA TCCCCGCGAT GCTCTCTAAC GGCGAGTTCG TCGTCAATGC AGCGTCAACG AAGAAGTACC GTAGCCTGCT TGAGTCGATC AACTCCGGTC ATATGGCGCA CTTCGCGTCG GGCGGTATCG CTGCAACGCT CGCGCCGTCT CCTGTCGCCT CTATGTCGTC GGCGGGTGAG AGAACAAATC TCACGCTCAA TCTGAACGGA GGCGGTAACG TTCTAACAGC GGAGGATCTG AAATATTTGG CTTCGCAGAT TCAGGGGCTT ATCGACATCC AGGTTCACAA GCGGATTACC GAGCAGGGCG GATATGCATA CCAAATCCGT AACGGTCTGC TGTAA
|
Protein sequence | MSISNNNTTV RYSVDASGVQ AGVNQIRAAN AQLNASQIEV FRRQEAVTQA MREAASNGYN LTAREAKKLV DQYDRLQATA GKTRLEMLNQ QAAARGVTQA FAAQAAAIAD ASKKTHELNL NSVGARREMM VLAHEAATGS WKNFAGSMMV MAEQVDAMKY ATHPLALGLA AAGTAAYAFY KGISNANAQY KAFNDAMNST GGYAQQTRES IQGLAEDLSK RFGVGISTAT NGLNQLVATG RVTADIFPQV GAVALAMSKS SGEAFDKTVE SLLKQQDEVK RAAEEYQRTH HSMSDANMAL IESLEKTGQK HEAFKILIQQ QLADIERETK ASTEHQAGFW DNVTASMQRY SRALAGKSTD LDILNDLKSK QAAQLGGRMP GDYTDYGPLI AAQQKIVDAN KASQEAAARE AADKAALADS LRTVTAEYER TKSAQQRLTD AVNRDNAIID TRISLLTKQG KMTESVRAQL EAQRKQMIAF DTEHITPTRK RGGGSAIAAL NAETQTGLAI RQIIEQQAEK QLQAQRQLGV IDAETYYRKL TDLQKSALDD QIALANRRAD ALRSSSDKRA YTEAAAAVEK LHLQRKGLDS SLQDTLAGLS QRRDIDVRRY VMGLDQMNTQ QEDAYQYQDT TRNMTARAKA EFDARYALQQ QYQQRVRQLV EQYALDPTSD MKQYAEKLRA EQAYLAERSA GMERFFAREE ARRNSFEAQM KDGLSSLGGD AMTNAELAKT AFVTAWQDSQ SALEQFITSG EGNFKKFTAS ILADLAKIAL RQAEVFAIQS IGSSFGFFSE GGPVGHFASG GAISGPGTGT SDSIPAMLSN GEFVVNAAST KKYRSLLESI NSGHMAHFAS GGIAATLAPS PVASMSSAGE RTNLTLNLNG GGNVLTAEDL KYLASQIQGL IDIQVHKRIT EQGGYAYQIR NGLL
|
| |