Gene BURPS668_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1517 
Symbol 
ID4884933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1480168 
End bp1482942 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content58% 
IMG OID640127445 
Productputative phage HK97 tail length tape measure-related protein 
Protein accessionYP_001058558 
Protein GI126438418 
COG category[S] Function unknown 
COG ID[COG5281] Phage-related minor tail protein 
TIGRFAM ID[TIGR01541] phage tail tape measure protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.813512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATCA GTAACAACAA TACCACTGTT CGGTACTCGG TCGATGCGTC TGGCGTACAG 
GCCGGTGTAA ATCAAATCCG GGCAGCAAAC GCCCAGCTTA ACGCATCGCA GATCGAAGTA
TTCCGCAGGC AGGAAGCCGT TACGCAGGCA ATGAGAGAAG CCGCCAGCAA CGGCTACAAC
CTCACCGCAC GCGAAGCAAA GAAGCTCGTA GACCAGTACG ACCGACTCCA GGCCACAGCA
GGCAAGACGC GCCTGGAAAT GCTCAACCAA CAAGCCGCCG CGCGAGGCGT TACTCAGGCA
TTTGCAGCGC AGGCAGCGGC TATTGCGGAT GCATCGAAGA AAACGCACGA ACTCAACTTG
AACAGTGTTG GTGCACGGCG CGAAATGATG GTCCTCGCTC ACGAAGCCGC AACGGGCAGT
TGGAAGAACT TCGCCGGTTC CATGATGGTG ATGGCGGAGC AGGTCGACGC TATGAAGTAC
GCCACGCACC CATTGGCTTT GGGGCTCGCG GCGGCCGGTA CGGCAGCTTA CGCGTTTTAC
AAGGGCATTT CCAACGCCAA CGCACAGTAC AAGGCGTTTA ACGACGCGAT GAACTCCACG
GGCGGCTACG CGCAACAGAC ACGGGAATCC ATCCAGGGGC TTGCGGAGGA CTTGTCTAAG
CGGTTCGGTG TCGGTATCAG CACGGCGACG AACGGGCTTA ATCAACTGGT GGCGACGGGG
CGCGTAACTG CTGATATTTT CCCGCAGGTT GGGGCCGTGG CGTTGGCGAT GTCGAAGTCC
TCGGGAGAGG CATTCGATAA GACCGTGGAA TCGCTCTTGA AGCAGCAGGA CGAGGTTAAG
CGCGCGGCGG AGGAGTACCA ACGCACGCAC CACTCGATGT CAGACGCCAA CATGGCGCTT
ATCGAGTCTC TTGAAAAGAC GGGCCAAAAG CATGAAGCGT TCAAGATTCT CATCCAGCAG
CAGTTGGCCG ATATCGAGCG CGAAACCAAG GCCAGCACGG AGCATCAGGC AGGATTCTGG
GATAACGTTA CAGCCTCAAT GCAACGCTAT TCGCGCGCGC TTGCTGGTAA GTCCACTGAT
CTCGACATCC TCAACGACCT CAAGTCGAAG CAAGCCGCCC AACTTGGCGG GCGTATGCCG
GGGGACTACA CGGATTACGG CCCGCTGATT GCCGCTCAAC AGAAGATCGT TGACGCGAAC
AAGGCCAGTC AAGAGGCCGC CGCTAGAGAA GCTGCGGATA AGGCAGCATT GGCCGATTCG
CTCCGGACCG TGACGGCAGA GTACGAACGC ACAAAGTCGG CGCAACAACG CCTTACGGAT
GCGGTCAATC GTGATAACGC GATCATCGAC ACGCGCATTT CGTTGCTGAC GAAGCAGGGA
AAGATGACGG AGTCTGTCCG GGCGCAGTTG GAGGCCCAGC GAAAGCAGAT GATCGCCTTC
GATACCGAAC ACATCACTCC GACTCGGAAG CGCGGCGGCG GCTCCGCAAT CGCGGCATTG
AATGCAGAGA CGCAGACAGG CTTGGCGATT CGGCAGATTA TCGAACAGCA GGCGGAGAAG
CAGCTACAGG CCCAGCGACA GCTAGGCGTA ATCGACGCGG AGACATACTA CCGCAAGCTT
ACCGACCTGC AAAAATCTGC GCTGGACGAC CAAATTGCAC TCGCGAACAG ACGTGCCGAT
GCGCTTCGAT CTTCGTCAGA CAAGCGAGCC TACACGGAAG CGGCGGCGGC AGTTGAAAAA
CTCCACCTTC AACGCAAGGG CTTGGATTCG AGTTTGCAAG ATACGTTGGC GGGTCTGTCG
CAGCGACGTG ACATTGATGT GCGCCGCTAC GTCATGGGCC TAGACCAGAT GAATACGCAG
CAGGAGGATG CGTACCAATA TCAGGACACA ACCCGCAACA TGACCGCACG GGCGAAGGCC
GAATTCGACG CACGGTACGC GTTGCAACAA CAGTATCAGC AGCGTGTGCG GCAGTTGGTC
GAACAATACG CGCTCGATCC TACCTCGGAT ATGAAGCAAT ACGCGGAAAA ACTCCGAGCG
GAACAGGCAT ATCTCGCGGA GCGCAGCGCG GGAATGGAAC GATTCTTTGC CCGTGAGGAG
GCGCGGCGTA ATAGCTTCGA GGCTCAGATG AAGGATGGTC TATCGTCACT CGGCGGCGAC
GCCATGACGA ACGCGGAACT TGCCAAGACG GCCTTTGTTA CTGCGTGGCA GGATTCGCAG
AGTGCCTTGG AGCAGTTCAT TACGAGCGGC GAGGGCAATT TCAAGAAATT CACGGCGAGC
ATCCTTGCTG ACCTTGCGAA GATCGCGCTC CGTCAGGCTG AGGTATTCGC GATCCAGAGT
ATCGGCAGTT CGTTCGGATT CTTCAGCGAA GGCGGCCCGG TTGGTCATTT CGCGTCGGGC
GGTGCAATCA GCGGTCCCGG CACTGGTACA AGCGACAGCA TCCCCGCGAT GCTCTCTAAC
GGCGAGTTCG TCGTCAATGC AGCGTCAACG AAGAAGTACC GTAGCCTGCT TGAGTCGATC
AACTCCGGTC ATATGGCGCA CTTCGCGTCG GGCGGTATCG CTGCAACGCT CGCGCCGTCT
CCTGTCGCCT CTATGTCGTC GGCGGGTGAG AGAACAAATC TCACGCTCAA TCTGAACGGA
GGCGGTAACG TTCTAACAGC GGAGGATCTG AAATATTTGG CTTCGCAGAT TCAGGGGCTT
ATCGACATCC AGGTTCACAA GCGGATTACC GAGCAGGGCG GATATGCATA CCAAATCCGT
AACGGTCTGC TGTAA
 
Protein sequence
MSISNNNTTV RYSVDASGVQ AGVNQIRAAN AQLNASQIEV FRRQEAVTQA MREAASNGYN 
LTAREAKKLV DQYDRLQATA GKTRLEMLNQ QAAARGVTQA FAAQAAAIAD ASKKTHELNL
NSVGARREMM VLAHEAATGS WKNFAGSMMV MAEQVDAMKY ATHPLALGLA AAGTAAYAFY
KGISNANAQY KAFNDAMNST GGYAQQTRES IQGLAEDLSK RFGVGISTAT NGLNQLVATG
RVTADIFPQV GAVALAMSKS SGEAFDKTVE SLLKQQDEVK RAAEEYQRTH HSMSDANMAL
IESLEKTGQK HEAFKILIQQ QLADIERETK ASTEHQAGFW DNVTASMQRY SRALAGKSTD
LDILNDLKSK QAAQLGGRMP GDYTDYGPLI AAQQKIVDAN KASQEAAARE AADKAALADS
LRTVTAEYER TKSAQQRLTD AVNRDNAIID TRISLLTKQG KMTESVRAQL EAQRKQMIAF
DTEHITPTRK RGGGSAIAAL NAETQTGLAI RQIIEQQAEK QLQAQRQLGV IDAETYYRKL
TDLQKSALDD QIALANRRAD ALRSSSDKRA YTEAAAAVEK LHLQRKGLDS SLQDTLAGLS
QRRDIDVRRY VMGLDQMNTQ QEDAYQYQDT TRNMTARAKA EFDARYALQQ QYQQRVRQLV
EQYALDPTSD MKQYAEKLRA EQAYLAERSA GMERFFAREE ARRNSFEAQM KDGLSSLGGD
AMTNAELAKT AFVTAWQDSQ SALEQFITSG EGNFKKFTAS ILADLAKIAL RQAEVFAIQS
IGSSFGFFSE GGPVGHFASG GAISGPGTGT SDSIPAMLSN GEFVVNAAST KKYRSLLESI
NSGHMAHFAS GGIAATLAPS PVASMSSAGE RTNLTLNLNG GGNVLTAEDL KYLASQIQGL
IDIQVHKRIT EQGGYAYQIR NGLL