Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2142 |
Symbol | |
ID | 4884723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2132026 |
End bp | 2134911 |
Gene Length | 2886 bp |
Protein Length | 961 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640128070 |
Product | Rhs element Vgr protein |
Protein accession | YP_001059177 |
Protein GI | 126441245 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria [COG4253] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGTCGCT TCTCTGACGG TAGCTCGAAC CTGAGAGAAG GACGACAAGA AATGGTACTC AGACCACCCG CAATCGGGGA CCTGAAGAAT CGCGACCTGT ACGAGGCCGT TCATCGCGGT TTCTTGCAGC ACGACCGGCT GTTGATGCTG GATACGCCAC TCGGCAAGAA TGCGTTGACT CCGTTGCGGG CGCGCGGATC GTCCGGGATT GGCGGTGACT ACCATTGGAC AATCGACGTT GCGTCGCTGC GTGACGATAC GGCGCTGCTG TCGCTGATGC ATCAGCCCGT CACCCTGTGG ATTCAAGAGC GTACAGCGCT GTACGCAGAT TCGGTCTATC GACCCGTACA TGGCTTCGTG CATCAAGTTG GATATTTGGG CGGGGACGGT AGCGTATCCA CGTACCAGCT TGAATTTTCA TCCGCGCTGA TTTTCCTGAG CAAGACCCAC AACGATGAAG GGTGGCTCGA AAAGGACGCG CGCGAAATCA TTTCCGACGT ATTCAACCGC TACCCGCAGC TGCAGGGGCG GTTCCGGTTC GACCTGACGC GCGAACCTGC GGTGCGCTCA TGGTGCCGTC AGAGCGAATC TGATCTTCAT TTCGTGCACC GCCTTCTCGA AGATGAAGGC ATGTATTTCC GTTGGGTCCA CGAACAGACG AAGGAAGGCG AGCCGCCGAA AACGACGCTC GTTATCGTGG ACCGCGTATC GTCGCTGCCG GAAGCAAAAC CAGCCGAATA TTACCGTGGC AACACTGACC ACGAAGCAGA CGGGTTCACG CAGTGGGCGG TCATGCAGAC CATGCAGAGC CTGCGTTATA TGTCGCGTGC GTTCGACTAT AAGCGCCCGA CATCCCATTT CCAGACGGAA AGCGCCCTTC AATCCACGAC GTACGCAACG GACGGCGGGC GGCAGTCGGA ATCGCGCAGC ATCCCTGCTG CACCGATGAC GATCTATCAG CCGACAGCGT ACGGCTACCC CGATTCAGAC AGCGGCGAAG GCCGTGCCCG TCGTCGTGTT GAAGAGTGGG ATTCGCGTGC GCGTCGCTAT TTCGGTGTTG GCGGTCTGCG ATGGCTCGAT GCCGGATCGC GATTCACGTT AGACAATCAT CCGCGTCATC CCGACAGCGA TCCGAAGAAG CGGGAATTTC TCGTGATCGA GGCGCGTTGG TTCATCGAAA ACAATGTGTC GATTGGTCAA CAAGCGACGG AGTATCCGCG CAGTCTGCGC GCGACGCTGG CCGAGCAGCA GGCAGCACAC GGGACGCGCT TCAAGACGCC GGAACACCCG CAAGACGGAA CAGCGGGATT TTTCGTCATC GAGGTTGAGG CGCAAGAGGC GAACATCGAA TATCGCAGCC CGCTCGCGCA TCCCAAGCCG AATATGGCTA TCGAGCATGC GATCGTTGTG ACGCAGCACG GATCGGAAGC ATGGACGAAC GAGCGGAACC AGGTGCGTGT GCACTTCGCT TGGGACCGGA AAAATCCGGA CGGCACGTTC GCATCGTCGC CGCTGCTATC GACCATGCAG GCTGATACTG GCAACGGCTA CGGCTCGGTG CACGTGCCAC GTGCCGGTGA ATGGGTTTTA ATCGCCTATT GGGCAAACGA TTGCGACAAA CCGTTCATTT TGGGGCGCGT CAATGGCGGT ACGACGCCTT CTCCGTGGCA TTCGAATGTG CTGCTGTCGG GATTTCAGTC ACAGGGGTTC GGCGGAACGG GCGCGTTCAA CGCGTTCGTG CACGATGACG CCACGAACCA AGGCGGTACG CGGCTCGTCA GCTACACAGG CAGCAGCTAT GCATCGATCG CGCAGGGATA CTTGATTCAG CAGAGCGGTA ACTCGCGCGG GCGGTATCTC GGTTCGGGAT TGCTGTTGCA CGCCGATCAT TACGCATCGG TACGCGGTAG CCGTGGCGTG TCGATCAGTG CGCATCCCGT GTCGCGCGAC AGTGATCAAC TCGATGTTGA CGAAGCACGC GAGCAGCTTA CACGCTCAAA AGACCTGCTC GGCAGCATTT CGGATGCGAG CGAACAGCAT CAGGCCGAGA GCCTGAAACT CGGTGTCGAT GCGCTGGCGA CGTTCACCGA CGCAACGAAG CAAGCGGCAT CCGGCGAATC ATCCGGAGGT CGGACGGCAG GCGGCGGCAC GGGTAACGCG AACGCGTTCG CTGAGCCGCT TTTGCTGCTC GGCAGTCCGG CCGGTATTGG CCTGTCCACA CATCAGTCGC TACACGCGAG CGCCGACCAA CAGGCGAACT GGATCAGCGG ACAGGATTCA TATTTTGCGG CGGGTGGATC GATGCACGCG GCTGCGGTAA ATCACCTGAG CCTGTTCGCG CGCAACCAGG GGATCAAGGC TGTCGCTGGT AAGGGCAAGG TCGAAATACA GGCGCAGGCT GGTGATCTGG AAATGATCGC GCAGCAGCTT GTGAAATTGC TGTCGGTAGC CGGACGGATG GAAATTGCGG CTGATCAGGA ACTAGTCCTG TACTGCGGCG GCGCGACGAT CCGCATCAAG GGCGGCAACG TCTCGATACA CGCACCGGGC AATGTTGACT TCAAGGGCGC GTCGTTCAGC TTCGCGGGGC CGGTCAGTGA GTCCTACGCG ATGCCGCAGT TCAAACCGTC GTATCAGGCG CGGTATGTCT TGAAGAATCA GACAGACGGT ACACCGATGA TTCGACACGC TTACGAAATG AAGCTGCCAT CCGGGCGAAC GGTGCTCGGC CACACGAACG ACCTTGGCGA GACAGTGCCG GTTTTCACGC CGAGCGCACA GGATGTGCCG TTGCAAGCCG CAAAGGCGAA GCCTGCTCAG GTCGAATCGT GGCAGTTTGC GGGCTCGGAC AAACCGATGA TTCAACGCGA CTACTTGGAG GACTGA
|
Protein sequence | MCRFSDGSSN LREGRQEMVL RPPAIGDLKN RDLYEAVHRG FLQHDRLLML DTPLGKNALT PLRARGSSGI GGDYHWTIDV ASLRDDTALL SLMHQPVTLW IQERTALYAD SVYRPVHGFV HQVGYLGGDG SVSTYQLEFS SALIFLSKTH NDEGWLEKDA REIISDVFNR YPQLQGRFRF DLTREPAVRS WCRQSESDLH FVHRLLEDEG MYFRWVHEQT KEGEPPKTTL VIVDRVSSLP EAKPAEYYRG NTDHEADGFT QWAVMQTMQS LRYMSRAFDY KRPTSHFQTE SALQSTTYAT DGGRQSESRS IPAAPMTIYQ PTAYGYPDSD SGEGRARRRV EEWDSRARRY FGVGGLRWLD AGSRFTLDNH PRHPDSDPKK REFLVIEARW FIENNVSIGQ QATEYPRSLR ATLAEQQAAH GTRFKTPEHP QDGTAGFFVI EVEAQEANIE YRSPLAHPKP NMAIEHAIVV TQHGSEAWTN ERNQVRVHFA WDRKNPDGTF ASSPLLSTMQ ADTGNGYGSV HVPRAGEWVL IAYWANDCDK PFILGRVNGG TTPSPWHSNV LLSGFQSQGF GGTGAFNAFV HDDATNQGGT RLVSYTGSSY ASIAQGYLIQ QSGNSRGRYL GSGLLLHADH YASVRGSRGV SISAHPVSRD SDQLDVDEAR EQLTRSKDLL GSISDASEQH QAESLKLGVD ALATFTDATK QAASGESSGG RTAGGGTGNA NAFAEPLLLL GSPAGIGLST HQSLHASADQ QANWISGQDS YFAAGGSMHA AAVNHLSLFA RNQGIKAVAG KGKVEIQAQA GDLEMIAQQL VKLLSVAGRM EIAADQELVL YCGGATIRIK GGNVSIHAPG NVDFKGASFS FAGPVSESYA MPQFKPSYQA RYVLKNQTDG TPMIRHAYEM KLPSGRTVLG HTNDLGETVP VFTPSAQDVP LQAAKAKPAQ VESWQFAGSD KPMIQRDYLE D
|
| |