Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2032 |
Symbol | |
ID | 4903319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2001981 |
End bp | 2005004 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640145137 |
Product | Rhs element Vgr protein |
Protein accession | YP_001076065 |
Protein GI | 126457275 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCGT CCCCCCGCCA CGATGCGCCC GCGTCGCGCG CGAACGCCGC GCCGTCCGCG AACGCGCGAC GCTTCACGTT CGCGAGCGAC GCGTACGACC CCGCGACGTT CGACGTCGTC GACATCAACG GCCGCGACGC GATCTCGCAG CCGTACCGGT TCGAGATCAC GCTCGTGAGC AGGCAGTTGC GGATCGACTT CGCGAAGATG CTGAGCCGCG GGGCGACGCT CGCGATCCTG CCGCCGTTCG GCGAGGCCGG CACCACCCGC TATGCCGGCG TGCTCGCCGA ATTCGAGCAG AAGAAGCGCT TTCGCGACTT CACCGTCTAT CGCGCGACGC TCGTGCCGCG CCTCTGGCGA CTATCGCTGT ACAAGGCGTC GGACGTCTAT CTGAACGAGC AGACGATTCC CGACATCGTC AAGCGCGTGC TGCGCGCCGC CTCGTTCGGC AGCCGCGATT TCCGCATGCG GCACGGCGGC GGCTACCGCA AGCGCAGCTT CGTCTGCCAG TACGACGAGA GCCATCTCGA TTTCGTGTCG CGCTGGATGG AGAAGGAAGG CCTCTACTAC TACTTCGAGC ATGACGGCCG GCACGAAACG CTCGTGATCG TCGACGACCG CCGCCATCAG CCCGGCCCCG CCGACGATCT CGCGCTGCGC TACCGACCCG CGACCAGCCT CGACGCGGGC ATCGAAGCGG ACCGCGTGCA GGCGTTCACA TGCCGGGCGA CGCCGCTGCC GCGCGAAGTC GTGCTGCGCG ATTTCAACCA CCGCAAGGCG GAGCTCTCGC TCGAAGTCCG CGAGCGCGTG GCGCGCGACG GCGTCGGCGA GCGGGTGTCG AGCGACGAGC ACTTCCACAC GAAGGACGAA GGGCGGCGCT ACGCGAAGCT GCGCGCCGAG GCGCTCGTCT GCGAAGGGCG CCGATTCGCC GGCGAATCGA CCGCTGCCGG GCTGCGCGCC GGCCGCTTCT TCGCGCTGTC GGGCCACTAC CGCGAGGATT TCGACGGCCG CTATCTGGTG ACGGCGCTCA CGCATCGCGG CTCGCAGGCA CACCTGCTGT TTCCCGATCT CGACGCGCCG TTCGGCGCGA CGCCGGGCGA GCCCATCTAC CGCGCCGAGT TCGAGGCGAT TGCCGCCGAC CTCCAGTACC GGCCGCCGCG CACGACGCCC AAGCCGCGCG CGGCGGGCGT CGTCAGCGCG ATCGTCGACG GCGAGGGCGG CGGCAAGCTC GCCGAGCTCG ACGAATACGG CCAGTACAAG GTGCGCTTTC CGTTCGCGCA CACCGCGCAT CCGGCGAACA AGGCCTCCGC GCGCATCCGG ATGGCGACGC CCTACGCGGG CGACGACCGC GGCATGCACC TGCCGCTGCT GAAGCGCACC GAAGTGAAGA TCGCATTCGA CGGCGGCGAT CCGGACCGCC CCGTGATCGT CGGCGCGGTG CCCAACTCGT CGCACCGCAG CGTCGTCACG CGCCGCAACC CCGCCGAGCA TCGCATCCTC ACCGAGCACA ACCAGCTCTA CATGAAGGAC GGCAGCGGCG CGGCGACGTG GCTGCACGCG CCGAACAACC ACATCGGCAT CGGCGCGGTC GGGCCGGGCG ACGGCCTCGC GCTCCTCACG TCCGGCAACA AGTTCGACTT CTCGCTCGGC AACGCGTACA GCTTCTCGGG CGGGCTCAAG TGCTCGGTGT CGATGGGCGG CAACACCGAC GTCTACGTCG GCGTGCGCAA CAGCCTCGAC GTCAGCGCGA ACTTCCTGAC GACGCTGCAG GGCAACCTGC GCTGGATGCT GCCCGGCAGC CGGAGCTTCG AGATCAACGA CAGCGCATCG ACACTGCTGC AGACGCTGCA CAAGCAGTCC GCGACGGGTG CGATCCGGCT GTCCGCCGGG CAGGACGCGT CCGCGCTGCT GCAAAAGCAG CTCGACAAGC TCAAGGGCAC GGTGCGCAAG TTCATGATCG TGTCGGGCCT CGCGAACGCC GGGTCCGCGG CCGCCGCCGC GGGGCTCATC AAGGGCGGCG GCAAGCTCGC CGATCTGCCG TGGGCGGGCT TCGGCATATC TGCCGCGCAG TTCGCCGGCG CGACCGGCGT CAGCACGGCG CTGATGGCGA CCTCGCGCAC GCTGCTCGCG AACGTCGCGA AGCTCCAGGA GGCGTTGCCG CTCGTCGCCG ATCTGTCGCT CGGCAAGCAG GGCATCGCGC TCGCCGCGAA GAACCTCACG CACGCGACGC GGATGTCGCT CACCGTCGAC GGCGTCTCGT GGTCGACGCA CGCGAAGGGG CCCGGCGCGG CGGGCGCCGC GATGAGCGTC GGCAAGGGCC GCTGGGGCGT CGAAGCGGCG AAGCACGCGC ACGTCCACGC GAGCGACACG CTGCTGTTCG CGGTGCCGGC CGACCCGACG ACCCAGTTCG ACCTGAAGGA CCTGATCGGG CTGCGCCGCG ATCTCGACGA ATGCATGAAG GACATCGCCG ATCTCGAAGC CGACATTTCG GAAAACGAAG TGCTGTCGAC CGATCAGAAC ACGTTCGGCG TCAGCGCGCT CATCCCCACG CCGCCGTCGC CCGCCGGCGC GCTCGCGGCG GTCGCGATCA AGGTGAAGCA AGCGAAGCTC GTCGAGCTGA AGGCCAAGCA GAAGCTCGTC GCGCTGAAGG TCGACAACCT GCAGCAGAAG TTCGCGAAGC ACGTGCAGCA CCTGAGCGCC GTGCGGATGA GCGCTTCCGA CGCGCAGCTC GGCTTCAAGG GCAACCGGCT CGTCGCGACG GCCGACGGCG TCACGCTCGC GCATGCGCAG GGCAAGGCGA AGCTCGACGT GCGCGAAGCG AAGATCGGCG TCACGGCGGG CAAATCGAGC GTCGAGCTCG ACGAAGGCAA GATCGCGGCC GGCTGCGGCA GCGCATCGCT GAAGCTCGGC AGCGACGGCG CGATCGACGT GAGCGCGACC GACGTCAAGC TGAACGGCAC CAACGTCAAG CTGAACGGCA GCGCGTCGCT GAAGTTCGAC GGCCAACTGA TCCAGCTCGG CTGA
|
Protein sequence | MPSSPRHDAP ASRANAAPSA NARRFTFASD AYDPATFDVV DINGRDAISQ PYRFEITLVS RQLRIDFAKM LSRGATLAIL PPFGEAGTTR YAGVLAEFEQ KKRFRDFTVY RATLVPRLWR LSLYKASDVY LNEQTIPDIV KRVLRAASFG SRDFRMRHGG GYRKRSFVCQ YDESHLDFVS RWMEKEGLYY YFEHDGRHET LVIVDDRRHQ PGPADDLALR YRPATSLDAG IEADRVQAFT CRATPLPREV VLRDFNHRKA ELSLEVRERV ARDGVGERVS SDEHFHTKDE GRRYAKLRAE ALVCEGRRFA GESTAAGLRA GRFFALSGHY REDFDGRYLV TALTHRGSQA HLLFPDLDAP FGATPGEPIY RAEFEAIAAD LQYRPPRTTP KPRAAGVVSA IVDGEGGGKL AELDEYGQYK VRFPFAHTAH PANKASARIR MATPYAGDDR GMHLPLLKRT EVKIAFDGGD PDRPVIVGAV PNSSHRSVVT RRNPAEHRIL TEHNQLYMKD GSGAATWLHA PNNHIGIGAV GPGDGLALLT SGNKFDFSLG NAYSFSGGLK CSVSMGGNTD VYVGVRNSLD VSANFLTTLQ GNLRWMLPGS RSFEINDSAS TLLQTLHKQS ATGAIRLSAG QDASALLQKQ LDKLKGTVRK FMIVSGLANA GSAAAAAGLI KGGGKLADLP WAGFGISAAQ FAGATGVSTA LMATSRTLLA NVAKLQEALP LVADLSLGKQ GIALAAKNLT HATRMSLTVD GVSWSTHAKG PGAAGAAMSV GKGRWGVEAA KHAHVHASDT LLFAVPADPT TQFDLKDLIG LRRDLDECMK DIADLEADIS ENEVLSTDQN TFGVSALIPT PPSPAGALAA VAIKVKQAKL VELKAKQKLV ALKVDNLQQK FAKHVQHLSA VRMSASDAQL GFKGNRLVAT ADGVTLAHAQ GKAKLDVREA KIGVTAGKSS VELDEGKIAA GCGSASLKLG SDGAIDVSAT DVKLNGTNVK LNGSASLKFD GQLIQLG
|
| |