Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1168 |
Symbol | |
ID | 4885720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1107827 |
End bp | 1112944 |
Gene Length | 5118 bp |
Protein Length | 1705 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640131107 |
Product | surface-exposed protein |
Protein accession | YP_001062165 |
Protein GI | 126444937 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5422] RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAGA TTTATCGCAA GGTTTGGAAC AAGGCGCGCG GCCAACTGGT CGTCGCGTCG GAACTGGCAT CCAGCCGTTC CGGTGTGGGA GAAGCTTCGG TCGACGCGGG GCGGTCTGGA GATCGGACAG CCTCGGCGGC ATTCGCCAGC GAGGAGCGCA AACCTGGCTC AGGCCGGATG ATTCCGCTTG CAATAGCCGT GGCTCTGACG TTTTCACCCT ACGCATGGGC GGGTGTCGGC GGAGCCGACA ACGGCGTGAC GGGGACGAGC AACAACGGCG GGGTAGGCGG CTCGTCCGGC GGCGGCGGTG TCCAGTTCAG CGACATGGGC GTGGCCTTTG TCGGCGATGG CGACTGCTCG ATGCTCACGT CCGGGCCGGG ATCGTATGCC GGCGTATACG GTTCGGGGAG CAATTATCTG GGCGGCCTGT TCGGCTTCGG CGCACAGACG TCGGCCGTCG GCTGGGGGAC GCCCAGCAAC GCCGGCGCCA ACAGCGGTAT CGTCCCATAC CAGGGCGCTG CTCAAACCTT CGGCAACGTC ACCTATGCCG GCAACGGCAC GCAGAGCGGC AACTTCACGC AGGCGTTCGG CCTGAATTCC TTTGCGGTCG GCTGCGGCGC TCACGCGACC GGCCTGAGCG CGACGGCGAT CGGCTGGGGA ACCACCGCGA GCGGCGCCGG AAGCGTCGCG CTCGGGCTGT ACAGCACCGC GAGCGGCCAG GGATCGTTGG CGTTCGGCAC CAGCGCGACG GCGACGGCCA CCGACACCAT CGCGCTCGGC ACGCTGGCCA CGGCCAACGC GGTCAGCGGC GTGGCGATCG GCGCGAACAC GCAGGCTTCG GCCGCCAATG CAACCGCGAT CGGCGGCAAT TCATCCGGCG CCAACCTCGG CGCGCAGGCG ACGGCGGCCG GCGCCACCGC GATCGGCGGC AACGCGACGG CCGGGGCCGC CGCGACGGCG ACGAACGCCA TCGCGATCGG CGGGCAGTCG TCGGCGAAGG ATGCGAACGA TGTGGCCGTG GGCGTGGGCG CGAGGGCCGG CACGGGAAGC GGCGCGGGCA ACGATCTCGC GATCGGCAAT GGCGCGACGG CCACGGGCGG CAATTCGATC GCCCAGGGCG CGGGCGCGAG CGCCAATGCG GCCGGCGCGG TGGCCATCGG CAAATCGGCG TCCGCCGCCG GCGGGCAAGC CGTTTCGATC GGCGTGGCCA ATACCGCGTC GGGCAACGGC GCGGTGGCGA TCGGCGATCC GAACGTCGCG ACCGGAACCG GCGCTGTCGC GCTGGGCAAC AACAATACGG CCAATGGTCA AGGCGCGGTG GCGCTGGGCA ACGTCAGCAC GGCGGTCGGC CAAGGCAGCG TGGCGCTCGG CAACAGCAGC AATGCGGCCG CGGCGGGCGG AGTGGCCTTG GGCGATACCG CGAGCGCGGT GATGGCGGGC GGCTTGGCAC TCGGCTCGCT CGCGACGGCG AGCAATGCGA ACGACGTGGC GCTCGGCGCG GGTTCGAAAA CCGCCGCGGC AGTCGCGACG TCCACGGTTT CGGTGAACGG CGCCAACTAC GCGGTGGCGG GAAGCGGCCC GGCCAGCACG GTCAGCGTGG GTGCGCCGGG CAGCGAGCGC ACGATCACCA ATGTGGCCGC GGGTCAAGTA AGCGCCGGTT CCACCGATGC GGTGAATGGT TCGGAACTGT ACGCGACGAA CCAGGCAATC ACGACCGGAT TGTCGACAGC GAACAGCAGC ATCGCGTCGC TGTCCACGTC GACGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC TCGTTGTCGA CGGGTTTGTC GACCGCCAAC AGCACGGTTG CGTCGTTGTC GACGTCCACG GTTGCCGGCC TGAATTCGCT GTCCACCGGA TTGAGCACGA CCAATAGCAA TGTCGCGTCG CTGTCGAGTT CCACGTCGAC GGGGCTGTCC TCGGCCAATA GCGCGGTGGC GTCGCTGTCC ACGTCGGCGT CGACGGGTCT TTCGAGCGCC AACAGCAACA TCGGCTCGTT GTCGACGGGC TTGTCGACCA CGAACAGCAC GGTTGCGTCG TTGTCGACGT CCACGGTTGC CGGCCTGAAT TCGCTGTCCA CCGGATTGAG CACGACCAAT AGCAATGTCG CGTCGCTGTC GAGTTCCACG TCGACGGGAT TGTCCTCGGC CAATAGCGCG GTGGTGTCGC TGTCCACGTC GGCGTCGACG GGTCTCTCGA GCGCCAACAG CAACCTCGGC TCGTTGTCGA CGGGTTTGTC GACCACGAAC AGCACGGTTG CGTCGTTGTC GAGCTCGACG TCGACCGGTA TCGGTTCGTT GTCGACGGGG GTGGCCAATT CTGTCCAGTA TGACAGTCCT GCTCATACGT CCATTACCCT GGGTGGCGCC AGTGCAACGT CACCCGTGAA GATCACTAAT TTGGCGGCGG GCGCGAACCC GAGCGATGCC GTCAACTATG AGCAACTGAC ATCGTTGTCG ACCTCGGCGT CGACGGGACT GTCGTCGGCC AACAGCGCGA TCACGTCGCT GTCCACCTCG ACGTCGACCG GCATCGGCTC GCTGTCCACC GGACTGAGCA CGACCAACAG CAACGTCGCG TCGTTGTCGA CCTCGGCGTC GACGGGACTG TCCTCGGCCA ACAGCGCGAT CACGTCGCTG TCCACCTCGA CGTCGACCGG CATCGGCTCG CTGTCCACCG GACTGAGCAC GACCAACAGC AACGTCGCGT CGTTGTCGAC CTCGGCGTCG ACGGGACTGT CCTCGGCCAA CAGCGCGATC ACGTCGCTGT CCACCTCGAC GTCGACCGGC ATCGGCTCGT TGTCCACCGG ACTGAGCACG ACCAACAGCA ACGTCGCGTC GTTGTCGACG TCGGCGTCGA CGGGACTGTC GTCGGCCAAC AGCGCGATCA CGTCGCTATC CACCTCGACG TCGACCGGCA TCGGCTCGCT GTCCACCGGG TTGAGCACGA CCAACAGCAA CGTCGCGTCG TTGTCGACGT CGGCGTCGAC GGGACTGTCG TCGGCCAACA GCGCGATCAC GTCGCTGTCC ACCTCGACGT CGACCGGCAT CGGTTCGCTG TCCACCGGAC TGAGCACGAC CAACAGCAAC GTCGCATCGT TGTCGACGTC GGCGTCGACA GGATTATCCT CGGCCAACAG CGCGATCACG TCGCTGTCCA CCTCGACGTC GACCGGCATC GGCTCGCTGT CCACCGGACT GAGCACGACC AACAGCAACG TCGCGTCGCT GTCGACGTCG GCGTCGACGG GACTGTCGTC GGCCAACAGC GCGATCACGT CGCTATCCAC CTCGACGTCG ACCGGCATCG GTTCGCTGTC CACCGGGTTG AGCACGACCA ACAGCAACCT GAGCTTCCTG TCCACGTCGA GCTCGACCGG CCTGAGTACG GCCAACAGCA ACATCTCGTC GCTGTCCACC GGGCTGAATT CGCTGTCGAC CGCGGTCAAC GGCGGCGGGA CGAAGTACTT CCACGCCAAC TCGACGCAGC CGGACAGTCA GGCGCTGGGG GCGGATTCCG TCGCGGTCGG GCCGGCGGCC ATCGCGGCGG GCGCAAGCGG CATTGCGATC GGCAATGCGG CGAACGCGGC CGCAAACGGC GCCGTCGCGA TCGGCCAGGC CGCCGTCGCG AAGGGCGGGC TGGCTGTCTC GATCGGGGTG TCGAACACGG CGAGCGGAGA CGGCGCGGTG GCGATCGGCG ATCCGAACGT CGCGACCGGC ACCGGCGCGG TCGCGCTTGG CGCGGACAAT TCGGCAAACG GCCAGGGCGC CGTCGCGCTC GGCAACGCGA ACATCGCAAC CGGAACGGGC TCGCTTGCGT TCGGCAACAC GTCGACGGCG GCAGCGGCGG GCGCGGTCGC GTTGGGCGCC GGCGCAATCG CGAACAATGC GAACGATGTC GCGCTGGGTT CCGCTAGCGT GACCGCGGCT GCGAATCCGG TGGCCAGCGC GTTGATCGCA GGTCAGGCTT ATTCGCTTGC CGGCGGCGCG CCGGCGAGCG TGGTGAGCGT CGGCGCGCCC GGCGCCGAAC GGCAAATCAC CAACGTCGCG GCCGGGCGGA TTTCCGCCAC GTCGACCGAT GCGGTGAACG GCTCGCAGAT GAATGCGATG ACTCAGGCGC TGGAATCGCT GTCGACTTCG ACGGCCAGCG CGCTGTCCAC GGCGCAAAGC GGTCTGGGTT CGTTGTCGAC GGGGCTCAGC TCGACGCAGA GCAGCGTGAG TTCGCTGTCG ACGGGGCTCA GCACGACGAG CGGCAGTGTG GCGTCGCTGT CGAGCGGTCT GGGCACGATG CAAAGCGGTA TCGCGTCGCT GTCCACGGGG CTGAGCACGA CGAACAGCAG CCTCGCGTCG CTGTCGACCG CCGTGTCCGG CGGCGGTGTT CGCACCAGCA GCTTGGGCGA CACGTCGGCG GGCAATGGCG CGAACGCGTC CGGCGGCAAC GGCACGGCGG TCGGCGGCGC CGCGTCCGCT TCGGGAACCG ATGCGACCGC GCTGGGCCAG GCGTCGAACG CGTCGGGCAA TCATTCGACC GCATTGGGGC AAGCATCGAG CGCGTCCGGA AGCGGCTCCA CCGCGGTGGG ACAAGGCGCC GGCGCGCCCG GCGACGGCGC TTCGGCATTC GGCCAAGGGG CACTTGCCTC CGGTACGGAC TCGACGGCGC TCGGCGCTCA TTCGACGGCT GCGGCGCCGA ACTCGGCGGC GATCGGCGCG AATTCGGTGG CGTCCGCGCC GAATTCGGTG TCGTTCGGTT CGCGGGGCCA TGAGCGCAGG CTGACGAATG TCGCGCCGGG GATCGACGGC ACCGACGCGG CGAACATGAA CCAGCTCTGG GGCGTGCAAT CGAGCGTCGA TCAGGCGGCG CGCCGCGCCT ATTCCGGGGT GGCGGCCGCG ACCGCGCTGA CGATGATTCC GGAGGTCGAC CCCGGCAAGA CGATCGCGGT CGGGATCGGC GCGGGCAGCT ATCAAGGGTA TTCGGCGTCC GCGATCGGCG TGTCCGTGCG GTTCTCCGAC AACCTGAAGG CGAAGCTCGG CGTGGGGATC AGCGCTCAGG GCAGCACATA TGGCGCAGGC GTCTCGTACC AGTGGTAG
|
Protein sequence | MNKIYRKVWN KARGQLVVAS ELASSRSGVG EASVDAGRSG DRTASAAFAS EERKPGSGRM IPLAIAVALT FSPYAWAGVG GADNGVTGTS NNGGVGGSSG GGGVQFSDMG VAFVGDGDCS MLTSGPGSYA GVYGSGSNYL GGLFGFGAQT SAVGWGTPSN AGANSGIVPY QGAAQTFGNV TYAGNGTQSG NFTQAFGLNS FAVGCGAHAT GLSATAIGWG TTASGAGSVA LGLYSTASGQ GSLAFGTSAT ATATDTIALG TLATANAVSG VAIGANTQAS AANATAIGGN SSGANLGAQA TAAGATAIGG NATAGAAATA TNAIAIGGQS SAKDANDVAV GVGARAGTGS GAGNDLAIGN GATATGGNSI AQGAGASANA AGAVAIGKSA SAAGGQAVSI GVANTASGNG AVAIGDPNVA TGTGAVALGN NNTANGQGAV ALGNVSTAVG QGSVALGNSS NAAAAGGVAL GDTASAVMAG GLALGSLATA SNANDVALGA GSKTAAAVAT STVSVNGANY AVAGSGPAST VSVGAPGSER TITNVAAGQV SAGSTDAVNG SELYATNQAI TTGLSTANSS IASLSTSTST GLSSANSNIG SLSTGLSTAN STVASLSTST VAGLNSLSTG LSTTNSNVAS LSSSTSTGLS SANSAVASLS TSASTGLSSA NSNIGSLSTG LSTTNSTVAS LSTSTVAGLN SLSTGLSTTN SNVASLSSST STGLSSANSA VVSLSTSAST GLSSANSNLG SLSTGLSTTN STVASLSSST STGIGSLSTG VANSVQYDSP AHTSITLGGA SATSPVKITN LAAGANPSDA VNYEQLTSLS TSASTGLSSA NSAITSLSTS TSTGIGSLST GLSTTNSNVA SLSTSASTGL SSANSAITSL STSTSTGIGS LSTGLSTTNS NVASLSTSAS TGLSSANSAI TSLSTSTSTG IGSLSTGLST TNSNVASLST SASTGLSSAN SAITSLSTST STGIGSLSTG LSTTNSNVAS LSTSASTGLS SANSAITSLS TSTSTGIGSL STGLSTTNSN VASLSTSAST GLSSANSAIT SLSTSTSTGI GSLSTGLSTT NSNVASLSTS ASTGLSSANS AITSLSTSTS TGIGSLSTGL STTNSNLSFL STSSSTGLST ANSNISSLST GLNSLSTAVN GGGTKYFHAN STQPDSQALG ADSVAVGPAA IAAGASGIAI GNAANAAANG AVAIGQAAVA KGGLAVSIGV SNTASGDGAV AIGDPNVATG TGAVALGADN SANGQGAVAL GNANIATGTG SLAFGNTSTA AAAGAVALGA GAIANNANDV ALGSASVTAA ANPVASALIA GQAYSLAGGA PASVVSVGAP GAERQITNVA AGRISATSTD AVNGSQMNAM TQALESLSTS TASALSTAQS GLGSLSTGLS STQSSVSSLS TGLSTTSGSV ASLSSGLGTM QSGIASLSTG LSTTNSSLAS LSTAVSGGGV RTSSLGDTSA GNGANASGGN GTAVGGAASA SGTDATALGQ ASNASGNHST ALGQASSASG SGSTAVGQGA GAPGDGASAF GQGALASGTD STALGAHSTA AAPNSAAIGA NSVASAPNSV SFGSRGHERR LTNVAPGIDG TDAANMNQLW GVQSSVDQAA RRAYSGVAAA TALTMIPEVD PGKTIAVGIG AGSYQGYSAS AIGVSVRFSD NLKAKLGVGI SAQGSTYGAG VSYQW
|
| |