Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0863 |
Symbol | |
ID | 3844580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1009425 |
End bp | 1012463 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637838166 |
Product | Rhs element Vgr protein |
Protein accession | YP_439060 |
Protein GI | 83717090 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0201208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCGT CCCATCGACA CTACGCCGAC ACGGCGCTCG CGGATGTCGC GGCGCTCACG GATGCCGCGT CGCGCGCCGA CGCCGCGCCG CTCGCGAACG CGCGACGCTT CACGTTCGCG AGCACCGCGT ACGACGCTGC CACGTTCGAC GTCGTCGACA TCGACGGCCG CGACGCGATC TCGCAGCCGT ACCGGTTCGA GATCACGCTC GTGAGCAGGA GTGTGCGGAT CGACTTCGCG AAGATGCTGA GTTGCGAGGC GACGCTCGCG ATCCTGCCGC CGTTCGGTGA GGCCGGCACG ACCCGCTATG CCGGCGTGCT CGCCGAATTC GAGCAGAAGG AACGCTTTCG CGACTTCACC GTCTATCGCG CGGCGCTCGT GCCGCGCCTC TGGCGCCTCT CGCTGTACAA GGCGTCGGAC GTCTACCTGA ACGAGCAGAC GATTCCCGAT ATCGTCAAGC GCGTGCTGCG CGCCGCGTCG TTCGGCAAGC GCAATTTCCG CATGCGGCAC CGCGGCGTCT ACCGCAAGCG CAGCTTCGTC TGCCAGTACG ACGAGAGCCA TCTCGATTTC GTGTCGCGCT GGATGGAGAA GGAAGGCCTC TACTACTACT TCGAGCATGA CGGCCGACGC GAAAAGCTCG AGATCGTTGA CGACCGCCGC GACCAGCCCG GCCCCGCCGA CGATCTCGCG CTGCGCTACC TACCCGCCAC CTGTCTCGAC GCGGGCATCG AATCGGACCG CGTGCAGGCG TTCGCATGCC GCGCGACGCC GCTGCCGCGC GAGGTCGTGC TGCGCGATTT CAACCACCGC AAGGCGGAGC TGTCGCTCGA AGTCCGCGAG CACGTGGCGC ACGACGGCGT CGGCGAGCGG GTATCGAGCG ACGAGCATTT CCACACGAAG GACGAAGGGC GGCGCTACGC GAAGCTGCGC GCCGAGGCGC TCGTTTGCGA AGGCCGCCGT TTCGCCGGCG AATCGACCGC GGCCGGGCTG CGCGCGGGCC GCTTCTTCGC GCTCTCGGGC CACTACCGCA AGGACTTCGA CGGCCGCTAT CTGGTGACGG CGGTCACGCA TCGCGGCTCG CAGGCGCACC TGCTGTTTCC CGATCTCGAC GCGCCGTTCG GCGCGACGCC GGGCGAGCCC GTCTACCGCG CCGAGTTCGA GGCGATCGCC GCCAACCTCC AGTACCGGCC GCCGCGCACG ACGCCGAAGC CGCGCGCGGC GGGCGTCGTC AGCGCGATCG TCGACGGCGA GGGCAGCGGC AAGCGCGCCG AACTCGACGA ACACGGCCAG TACAAAGTGC GCTTTCCGTT CGCGCACACC GCGCATCCGA CGAACAAGGC CTCCGCGCGC ATCCGGATGG CGACGCCCTA TGCGGGCGAC GACCGCGGCA TGCATCTGCC GCTTCTGAAG CGCACCGAAG TGAAGATCGC ATTCGACGGC GGCGATCCGG ACCGCCCCGT GATCGTCGGC GCGGTGCCCA ACTCGTCGCA CCGCAGCGTC GTCACGCGCA GCAACCCCGA CGCGCACCGG ATCCTCACCG AGCACAACCA GCTCTACATG AAGGACGGCA GCGGCGCGGC GACGTGGCTG CACGCGCCGA ACAACCACAT CGGCATCGGC GCGGTCGGGC CGGGCGACGG CCTCGCGCTC CTCACGTCCG GCAACAAGTT CGATTTCTCG CTCGGCAACG CGTACAGCTT CTCGGGCGGG CTCAAGTGCT CGGTGTCGAT GGGCGGCAAC ACCGACATCT ACGTCGGCGT GCGCAACAGC CTCGACGTCA GCGCGAACTT CCTGACGACG CTGCAGGGCA ACCTGCGCTG GATGCTGCCC GGCAGCCGAA GCTTCGAGAT CAACGACAGC GCGTCCACGC TGCTGCAGAC GCTGCACAAG CAGTCCGCGA CGGGCGCGAT CCGGCTGTCC GCCGGGCAGG ACGCGTCCGC GCTGCTGCAA AAGCAGCTCG ACAAGCTCAA GGGCACGGTG CGCAAGTTCA TGATCGTGTC GGGCCTCGCG AACGCCGGAG CCGCGGCCAC CGCTGCGGGG CTCATCAAGG GCGGCGGCGC GCTCGCCGAT CTGCCGTGGG CGGGCTTCGG CGTGTCCGCC GCGCAGTTCG CCGGCGCGAC CGGCTTCAGC ACGGCGCTGA TGGCGACCTC GCGCACGCTG CTCTCGAAAA TCGCGAAGCT CCAGGAGGCG TTGCCGCTCG TCGCCGATCT ATCGCTCGAC AAGCAAGGCA TCGCGCTCGC GGCGAAGAAC CTCACGCACG CGACGCGGAT GTCGCTCACC GTCGACGGCG TCTCGTGGTC GACGCACGCG AAGGGGCCGG GCGCGGCAGG CGCCGCGATG AGCGTCGGCA AGGGCCGCTG GGGCGTCGAA GCGGCGGAGC ACGCGCATGT CCACGCGAAT GACACGCTGC TGTTCGCCGT GCCGGCCGAC CCAACGAGCA AGTTCGACCT CAAGGAGCTG ATCGGGCTGC GCCGCGATCT CGACGAATGC GTGAAGGGTA TCGCCGATCT CGAAGCCGAC ATTTCGGAAA ACGAAGTGCT TTCGACCGAT CAGAACACGT TCGGCGTCGG CGCGCTCGTG CCCACGCCGC CGTCGCCCGC CAATGCGGTC GCGGCGGTCG CGATCAAGGC GAAGGAAGCG AAGCTCGTCG AGCTGAACGC CAAACGCAAG CTCGTCGCGA CGAAGATCGA CAACCTCCAG CAGAAGCTCG CGAAGCACGC GAAGAACCTG AGCGCCGCGC GGATGAGCGC GTCGGACGCG GAAGTCGGCT TCAAGGGCAA CCGGCTCGTC GCGACGGCCG AAGGCGTCAC GCTCGCGCAT GCGCAGGGCA AGGCGAAGCT CGACGTGCGC GAGGCGAAGA TCGGCGTCGA GGCAGGCAAA TCGAGCCTCG AGCTCGACGA GAGCAAGCTC GCGGCCGGCT GCGGCGGCGC CTCGCTGAAG CTGGGCAGCG ACGGCGCGAT CGACGTGCGC GCGACTAACG TCAAGCTGAA CGGCAGCGCG TCGCTGAAGC TCGACGGACA GTTGATCCAG CTAGGCTGA
|
Protein sequence | MSSSHRHYAD TALADVAALT DAASRADAAP LANARRFTFA STAYDAATFD VVDIDGRDAI SQPYRFEITL VSRSVRIDFA KMLSCEATLA ILPPFGEAGT TRYAGVLAEF EQKERFRDFT VYRAALVPRL WRLSLYKASD VYLNEQTIPD IVKRVLRAAS FGKRNFRMRH RGVYRKRSFV CQYDESHLDF VSRWMEKEGL YYYFEHDGRR EKLEIVDDRR DQPGPADDLA LRYLPATCLD AGIESDRVQA FACRATPLPR EVVLRDFNHR KAELSLEVRE HVAHDGVGER VSSDEHFHTK DEGRRYAKLR AEALVCEGRR FAGESTAAGL RAGRFFALSG HYRKDFDGRY LVTAVTHRGS QAHLLFPDLD APFGATPGEP VYRAEFEAIA ANLQYRPPRT TPKPRAAGVV SAIVDGEGSG KRAELDEHGQ YKVRFPFAHT AHPTNKASAR IRMATPYAGD DRGMHLPLLK RTEVKIAFDG GDPDRPVIVG AVPNSSHRSV VTRSNPDAHR ILTEHNQLYM KDGSGAATWL HAPNNHIGIG AVGPGDGLAL LTSGNKFDFS LGNAYSFSGG LKCSVSMGGN TDIYVGVRNS LDVSANFLTT LQGNLRWMLP GSRSFEINDS ASTLLQTLHK QSATGAIRLS AGQDASALLQ KQLDKLKGTV RKFMIVSGLA NAGAAATAAG LIKGGGALAD LPWAGFGVSA AQFAGATGFS TALMATSRTL LSKIAKLQEA LPLVADLSLD KQGIALAAKN LTHATRMSLT VDGVSWSTHA KGPGAAGAAM SVGKGRWGVE AAEHAHVHAN DTLLFAVPAD PTSKFDLKEL IGLRRDLDEC VKGIADLEAD ISENEVLSTD QNTFGVGALV PTPPSPANAV AAVAIKAKEA KLVELNAKRK LVATKIDNLQ QKLAKHAKNL SAARMSASDA EVGFKGNRLV ATAEGVTLAH AQGKAKLDVR EAKIGVEAGK SSLELDESKL AAGCGGASLK LGSDGAIDVR ATNVKLNGSA SLKLDGQLIQ LG
|
| |