Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0796 |
Symbol | |
ID | 4887710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 767451 |
End bp | 769742 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640130736 |
Product | Rhs element Vgr protein |
Protein accession | YP_001061795 |
Protein GI | 126442945 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTGA TCGAACTGCG CAGTCCCCTG CTGGACCCGG ATGCGGTGGC GCTGAGCTTC GTGGTGCACG AGAGCTTGTC GCAGGAGCCG TCGTATCAAC TGGATCTGCT GAGCCACGAT CCGGATCTGG ACTTCGACGC GCTGCTCGGC TCGACGCTGT CGGCCGACAT CGACCTGGGC GAAGGCGACA TCCGGACGTT CAACACGCAC GTGTTCGGCG GCTACGACAC GGGGCAGATG AGCGGGCAAT ACACGTACAC GCTGGAGCTG CGAAGCTGGC TGTCGTTTCT CGCGGAGAAC CGCAACAGCC GGATCTTCCA GAACATGAGC GTGCCGCAGA TCGTCGAGCA GGTGTTCCAG GGCCATCAGC GCAACGGCTA CCGGTTCGAG CTCGAAGGCA CGTACGAGCC GCGCGAGTAC TGCGTGCAGT TTCAGGAAAC GGATCTGAAC TTCGTGAAGC GGCTGCTGGA GGACGAAGGG ATCTACTTCT GGGTGGAGCA CGAGCCGGAT CGTCATGTGG TGGTGATCTC GGACACGCAG CGGTTCGAGG ATCTGCCGCT GCCGAACGAC ACGCTGGAGT ATCTGCCGGA CGGCGAGGAG TCGCGCGCGA TCCAGGGGCG CGAAGGGGTG CAGCGGCTGC AGCGCACGCG GCGGATCAAG TCGAACAACG TCGCGCTGCG GGATTTCGAC TATCACGCGC CGTCGAACAA GCTCGACAGC GACGCGCAGC AGGTGTCGCC GCCGAACCTC GAAGGGATTC CGCTCGAGTA TTACGACTAT GCGGCCGGCT ACCGCGAGCC CGAGCAGGGC GAGCGTCTCG CGCGGCTGCG GCTCGAGGCG ATTCAGGCGG AATCACACAC GCTGGTGGGC GAGGCGAACG CGCGCGCGCT GGCCACGGGC CGGGCGTTCA CGTTGATCGG GCATCCGGCG TTGGGGCGCA ATCGTCGGTA CTACGTGACG AACAGCGAGC TGACGTTCAT CCAGGACGGA CCGGACAGCA CGTCGCAGGG GCGCAACGTC GCGGTGAAGT TCCGCGCGCT TGCCGACGAT CAGCCGTTTC GGCCGCTGTT GACGACGCCG CGTCCCGAAG TGCCGGGCAT CCAGAGCGCG ACGGTGGTGG GTCCGGAGAT GTCGGAAGTG CATACCGACA AGCTCGGGCG GATTCGCGTG CATTTCCATT GGGACCGCTA CAAGACGACC GAGGCGGATG CGTCGTGCTG GATACGCGTG TCGCAGGCAT GGGCGGGCAA GGGCTGGGGC GTGATCGCGA TGCCGCGGGT CGGGCAGGAA GTGCTCGTCA CGTATGTCGA CGGCGATCTC GATCGGCCGC TCGTGACGGG CATCGTCTAC AACGGCGAGA ACCCGACGCC TTATGACTTG CCGAAGGACA TCCGCTACAC GGGGCTGGTG TCGCGTTCGA TCAAGCGTGC GGGCGGTTAT CAGAACGCGA GTCAGATCAC GTTCGATGAC CAGCGCGGCG CGGAGCGCGT GATGATCCAC GCGGAGCGCG ACATGCAGCA GACGGTCGAG CGCAACAGCT CGACGTCGAT CGCGCAGGAT CTGAACCTGT CGGTGAAGGG CACGTCGACG TCGGTCGTCG GCATCTCGAT CAGCTTCACG GGCATCTCGG TGTCGTACAC GGGGTTGTCG GTGAGCTTCA CCGGCGTGTC GGCGAGCTTC ACGGGCTTGA GCACGTCGTT TACCGGCGTG AGCACGAGTT TCACCGGCGT GTCGACGTCG TTTACCGGCG TCGACACCAG CTTCAAGGGC GTGTCGACGT CGTTTACCGG TGTCGATACC AGCTTCAAGG GGGTGTCGAC TTCGTTCACG GGCGTAAGCA CGAGCCTCAC CGGTTCGAGC AACTCGGTGA CGGGCGTATC GAACAGCATG ACGGGCATCT CGTCATCATG GACCGACGTC AGCATGTCGA CGACCGGCCA ATCCCAAAGC ATCACGGGGG TATCGCTGTC GTACACGGGC ACGTCGAACA GCATGACGGG CACGAGCACG TCGGTGACGG GCACCTCGAC GAGCATCACC GGCACGTCGA TGTCGAACAC CGGCAGCTCG ACGAGCATCA CGGGCACGTC GATGTCGACG ACGGGCAGCT CGACGAGCGT CACCGGCTCG AGCGTATCGA CGACCGGCAG CTCGGTGAGC ACGACGGGCT CGAGCGTGTC GACAACGGGC AGCAGCGTGT CGACGACAGG TTTCAGCTTC TCGTACACCG GCGTCAGCTA TTCGGACACC GGTATCGACC TGAAGAAGGT CGGCATGCAA GTGAAGAGCT GA
|
Protein sequence | MRLIELRSPL LDPDAVALSF VVHESLSQEP SYQLDLLSHD PDLDFDALLG STLSADIDLG EGDIRTFNTH VFGGYDTGQM SGQYTYTLEL RSWLSFLAEN RNSRIFQNMS VPQIVEQVFQ GHQRNGYRFE LEGTYEPREY CVQFQETDLN FVKRLLEDEG IYFWVEHEPD RHVVVISDTQ RFEDLPLPND TLEYLPDGEE SRAIQGREGV QRLQRTRRIK SNNVALRDFD YHAPSNKLDS DAQQVSPPNL EGIPLEYYDY AAGYREPEQG ERLARLRLEA IQAESHTLVG EANARALATG RAFTLIGHPA LGRNRRYYVT NSELTFIQDG PDSTSQGRNV AVKFRALADD QPFRPLLTTP RPEVPGIQSA TVVGPEMSEV HTDKLGRIRV HFHWDRYKTT EADASCWIRV SQAWAGKGWG VIAMPRVGQE VLVTYVDGDL DRPLVTGIVY NGENPTPYDL PKDIRYTGLV SRSIKRAGGY QNASQITFDD QRGAERVMIH AERDMQQTVE RNSSTSIAQD LNLSVKGTST SVVGISISFT GISVSYTGLS VSFTGVSASF TGLSTSFTGV STSFTGVSTS FTGVDTSFKG VSTSFTGVDT SFKGVSTSFT GVSTSLTGSS NSVTGVSNSM TGISSSWTDV SMSTTGQSQS ITGVSLSYTG TSNSMTGTST SVTGTSTSIT GTSMSNTGSS TSITGTSMST TGSSTSVTGS SVSTTGSSVS TTGSSVSTTG SSVSTTGFSF SYTGVSYSDT GIDLKKVGMQ VKS
|
| |