Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1580 |
Symbol | |
ID | 4882441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1544522 |
End bp | 1547329 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640127508 |
Product | Rhs element Vgr protein |
Protein accession | YP_001058621 |
Protein GI | 126439104 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTGA TCACGCAATT CACGTCGCTT TTCAGCGCTG CCAATTGTCT CTATTGCCTC GACGGTCCGG GCGAAATCGC GTCGCTGCAG ATCGAGCGCT GGGTGGGGCG GGAGACGCTC TCCGAGAACT TCCAGTGGGA TGTCTACGCG CTCGCGACCG ATCCCGGCCT CGACCTCGAC GCGATGCTCG GGCAGCGCGT GACGATCCGC ACGGCGCTCG CGAACGGCAC GAGCGCCGTG CGCAGCGGGC TCGTGGCGCA GGCCGAGTGC ATCGGCTACG ACGCCGGCCT CGCGCGCTAC TGCCTGCAGC TCGTGCCGTG GCTCGCCGCG CTCGCGCACG GCCGCGACGA GCGCACGTTC GTGCGGCAGG GCGTCGCCGA CGTGCTCGGC GCGGTGTTCG CGCCGTACGG CGCGATCGCC CGCTGGCGCT TCACCGCGGA CGCGAACCAG CGCATCGCGG AACTCGGCGC GCGCGACTAC CGCGTGCAGT ACCGCACCCA CACGCATTTC GATTTCGTCC GGCACGTGCT TGCCGAAGCG GGTCTCGGCT TTTGCTTCGT CGAGGAGGCG GACGCGCCCG CGGGCCACAC GATGCTGATC TTCGACGACA GCACGCAGTT GCCCGAGGAC GAGACTTCCG CGCGCATGGG CGGCGTGCCG CAGCGGCTGA GCGGCACGGC GACCGAGCCG GACGACGTGA TCGTCGGCAT CGGGCAGTCG CTGTCGCTGA CCGCGGATCG CGTCACGCTG ATCAGCAGCG ACTACCGCGG CAACCAGTCG ACGAGCGCGA CGGCGAGCCT CGGCAATCCG GCCGGATCGC GCGAGCTGTA CGACGACGTC GGGCCGGAAG CGTTCGACAG CCTGCGCGAG GCCGACGCCG CCGCGCGGCG GCACGCGGAC GCGATCGTCT CGGCGGCGCG CTTCTGGACA GGCTACGGCA CGTTGCGCAC CGCGCGCGCC GGCCGCGCGC TGCGCATCGC GGGCGCGACG TGGCGCATGT CGCGCGGCGG CGCGCAGGCG CCCGATGCGT TCGTGTTGAC GCGCGTCGAT CAGATCGGCA TCAACAATTT GCCCGCGACG GTGATGGAGC GCGTCGAGCG CAGCCTCGGC CCGCTGCCGC CCGCGAATCT CGACGCGCGC GTGCTCGCGC AGGCGAAGGC GGGCGGCTAT GCGAACCGGT TCGACGCCGC GCCGCGCGAC CTGGCATGGC GCCCGACGCT GGAGGACGGC ACCGGCCAGC GACTGAACCC GGTGCCGACG GCGCTCGGCG CGCAGACGGC GATCGTCGTC GGGCCGCAGG GCGAGACGCG GCCGGGCTCG ACGGGCCCGG TGCATACCGA CGCGCAGGGC CGGTTGCGGC TGCGCTATCA CTGGCAGGCG GACGGCGACG CGGGCACCTA TCCGACGCGC GCGATGCAGC GGCTCGCGAG CGAGGGGCAC GGGCTGCAGC AGACGCCGCG GATCGGCCAC GAAGTGCTCG TGCAGTTCGT CAACGGCCTC GTGCACCGGC CGATCGTGCT CGGCGGGCTG TTCAACGGCC GCGGCGAGGG CGGCGAAGCG CCGACGCCCG GCGGCGAAGC GGGGCAGCCG CTCGACGAAT CGGTCTACGC ACAGGCGAGC GACCACGCGG CGAGCGCGCA GGGCAATCTT GCGGGCGGCC ATAGCCCCGC GTGGCACGGC GCGGGCGGCG GCGCGAAGCG CCACGATCAC GCGGGCGCGC TGTCCGGCTT CAAGAGCCAG GGGTTCGACG GCCAGGGCTA CAACCAGCTC GTCGCGGACG ATACCGATCG CATGGGCCGC ATGCAGATGG CGACGACGCA CGCGGCGACC CAGCTCAACA TCGGCCACTT GCGGCATCAG GCCGACAACT ATCTCGGCAG CTTCCGCGGG CAAGGCGTCG AGCTGCGCTC GGACGCGTAC GGCGCGCTGC GCGGCGCGCG CGGCGTGCTG ATCTCCAGCT ATGCGCCGAC GGGGCCGTCG CAGCCGGCGG GCGATGCGTC GGCGCTGCAG TCGCTGCTCG CGCAACAGGC GGCGCTCGCG AAGCTTCTCG ACAAATCGGC GGACACGCAC AAGACCTTGC CGTTCGCCGC GCAGCGTGGC GCGCAGCGGG CCGGACAGTC GTCGATGAAC GGCGACGCCG CGCCGCTCGA CGCGCTCGGG AAGAGTTTCG CGACGACGGT CGGCGCCGAC GGCTACGCGC AGGCGTCGGC CGACGCATCG CGCCGCGCGA CGGGCAATGC GCTGCCGCAT ACCGGCGACG CGGTGCTCGG CGTCGAGGCG AAGGGCGGCC AGGGGCTGCT GGCCGGCCAG TCGCTGCAAT GGGCGGCGGG CGAGACGCTG ACCATCGGCA GCGGCAACGA CACCAACCTC GCGGTCAACC GGACGTTGCG CGTGCACAGC GGGCAGGGCA TCGGCTGGCT TGCCGGCGCG AATGGCGCGA ACGGCGCGCA GGGCGTCGGC ATGAGCGTGA TCGCCGGCAA GGACGATCTG GCGCTGCAGG CGCAGCACGA CCTGATGGCG CTGCGCGCGC GCGACGATAT GAGGGTCGCG TCGATCGGCG CGGATGTCGA GATCGCGGCG AAGCAGACCG TGCGGCTCGC GGTCAGCGGC GGCGCGCATC TGACGATCGA AGGCGGCAAC GTCACGTTCG GCTGCCCGGG CTCGCTCGTC GTTCATGCGG CCCAGCATAC GTTCGTCGGG CCGGATCAAC TGGCTCCCGC GCTGCCGCAG TTTCCCGAGA AGGTGTGCGT CGAATGCCTG AAGCACGCGA TGCAATCGGG CAGCGCACTG GCCGGCAAGG CGATCTGA
|
Protein sequence | MSVITQFTSL FSAANCLYCL DGPGEIASLQ IERWVGRETL SENFQWDVYA LATDPGLDLD AMLGQRVTIR TALANGTSAV RSGLVAQAEC IGYDAGLARY CLQLVPWLAA LAHGRDERTF VRQGVADVLG AVFAPYGAIA RWRFTADANQ RIAELGARDY RVQYRTHTHF DFVRHVLAEA GLGFCFVEEA DAPAGHTMLI FDDSTQLPED ETSARMGGVP QRLSGTATEP DDVIVGIGQS LSLTADRVTL ISSDYRGNQS TSATASLGNP AGSRELYDDV GPEAFDSLRE ADAAARRHAD AIVSAARFWT GYGTLRTARA GRALRIAGAT WRMSRGGAQA PDAFVLTRVD QIGINNLPAT VMERVERSLG PLPPANLDAR VLAQAKAGGY ANRFDAAPRD LAWRPTLEDG TGQRLNPVPT ALGAQTAIVV GPQGETRPGS TGPVHTDAQG RLRLRYHWQA DGDAGTYPTR AMQRLASEGH GLQQTPRIGH EVLVQFVNGL VHRPIVLGGL FNGRGEGGEA PTPGGEAGQP LDESVYAQAS DHAASAQGNL AGGHSPAWHG AGGGAKRHDH AGALSGFKSQ GFDGQGYNQL VADDTDRMGR MQMATTHAAT QLNIGHLRHQ ADNYLGSFRG QGVELRSDAY GALRGARGVL ISSYAPTGPS QPAGDASALQ SLLAQQAALA KLLDKSADTH KTLPFAAQRG AQRAGQSSMN GDAAPLDALG KSFATTVGAD GYAQASADAS RRATGNALPH TGDAVLGVEA KGGQGLLAGQ SLQWAAGETL TIGSGNDTNL AVNRTLRVHS GQGIGWLAGA NGANGAQGVG MSVIAGKDDL ALQAQHDLMA LRARDDMRVA SIGADVEIAA KQTVRLAVSG GAHLTIEGGN VTFGCPGSLV VHAAQHTFVG PDQLAPALPQ FPEKVCVECL KHAMQSGSAL AGKAI
|
| |