Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0127 |
Symbol | |
ID | 4888440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 108223 |
End bp | 111009 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640130068 |
Product | putative Rhs element Vgr protein |
Protein accession | YP_001061133 |
Protein GI | 126444328 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG TCGACCCGCC GCGATCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGACGT CCAGATCGGC CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGAGGT GCTCGCCACG CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGACGCGC CCGTGCACAC GCTCGTGTGC TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG CCCGGCAAGG CCGGGCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGATC TGGCGAAGCT GCTGACCGAC ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG GATCTGGCCG TGGGCGCGTG GATCACGCTG ACGGGCGATC CGGACTGGGA CAGGCAACTC GCCGACAAGC GCCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCGA GCCGCAATCT CGCGTGCGCG CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC ACGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC GAAGACGGCC GGGTGCGCGT GCGGTTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC CTCGCCGGCG CCTATTTCGG CGCATCGTTT CTGCCGCGCG TCGGCATGGA AGTCCTCCTC GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAATCGCTA CCTGTCGGGC ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT ACCGACGCGG CGGCGGCGCT GCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGCGGAT ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGGCACG GACGGCGCCG ATGCGCCGTC CGACGGCGCA GCGCGCGCGC TGATGGCGTT CGGCGCGCAG GCCGGTTCGG TGAACGTCAC GCCGAAGACG CATGTGACGT ATGCCGGCGA GAACATCGAT CAGGTCGCGC AGCAGCACCT GCAACTGATG AGCGGCCAGC GGCTGAACGC GACGGCCGGG CAGGGCATGC AGCTCTTCGC GCGGGGCGCG GGGGTGCAGG CCGTGGCGGG CGAAGGGCCG ATGCTGCTGC AGGCGCAAGC CGGCACGCTG ACGGCGAACG CGCAGAAGGG CGTCAAGATC ACGACGAACG AGCACGAGGT GTTCGTGAGC GCGCCGAAGA TTCGGCTCGT TGCCGAGGAC GGCAGCTACC TCGAGCTCGG CGGCGGCATC ACGCTCGGCA CGAACGGCGA CATCAAGCTG CTGTCGGCGT CGCACCAGTG GGGCGGGCCG TCGACCGCGC AGGCGGCGAA GAGCGGGTTC GGCAATCAGC CGACGGATCA GCGTTTCAAG CTGCACTATC CGGGCGAGGA CGGCGATTTG CAGGCGGCGG CGAACAAGCG GTTCCGGATC ACGCTGGACG ACGGGCGCGT CATCGAAGGC AAGTCCGACG CGAGCGGCCT GACGGATCTG GTCAAGGACG ACGCGATGCG TATCGCGAAG ATCGACTATC TGAAACCGAA GCTCTGA
|
Protein sequence | MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISDVQIG QSDGELCVYQ LTVCDALSLM DKRTNSRVFR KRSVIEVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGAWITL TGDPDWDRQL ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF TCVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRLH GLDPADHAHA QGAGTNGNAG DSAPIRVASS LAGAYFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC AELFKALGDY AGQHGGQAAD TAGQHAVAAA FKRWAPGTGT DGADAPSDGA ARALMAFGAQ AGSVNVTPKT HVTYAGENID QVAQQHLQLM SGQRLNATAG QGMQLFARGA GVQAVAGEGP MLLQAQAGTL TANAQKGVKI TTNEHEVFVS APKIRLVAED GSYLELGGGI TLGTNGDIKL LSASHQWGGP STAQAAKSGF GNQPTDQRFK LHYPGEDGDL QAAANKRFRI TLDDGRVIEG KSDASGLTDL VKDDAMRIAK IDYLKPKL
|
| |