Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0100 |
Symbol | |
ID | 4905828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 88347 |
End bp | 91127 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640143207 |
Product | Rhs element Vgr protein |
Protein accession | YP_001074143 |
Protein GI | 126457812 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG TCGACCCGCC GCGACCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGGCGT CCAGATCGGC CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGATGT GCTCGCCACG CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGACGCGC CCGTGCACAC GCTCGTGTGC TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG CCCGGCAAGG CCGGGCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGATC TGGCGAAGCT GCTGACCGAC ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG GATCTCGCCG TGGGCACATG GATCACGCTG ACGGGCGATC CGCAATGGGA CAGGCAACGC GCCGACAAGC GTCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCGA GCCGCAATCT CGCGTGCGCG CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC GCGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC GAAGACGGCC GGGTGCGCGT GCGGGTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC CTCGCCGGCG CCTATTTCGG CGCATCGTTT CTGCCGCGAG TCGGCATGGA AGTCCTTCTC GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAACCGCTA CCTGTCGGGC ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT ACCGACGCGG CGGCGGCGCT GCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGTGGAT ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGACGGC GCCGATGCGC CGTCCGACGG CGCAGCGCGC GCGCTGATGG CGTTCGGCGC GCAGGCCGGT TCGGTGAACG TCACGCCGAA GACGCATGTG ACGTATGCCG GCGAGAACAT CGATCAGGTC GCGCAGCAGC ACCTGCAACT GATGAGCGGC CAGCGGCTGA ACGCGACGGC CGGGCAGGGC ATGCAGCTCT TCGCGCGGGG CGCGGGGGTG CAGGCCGTGG CGGGCGAAGG GCCGATGCTG CTGCAGGCGC AAGCCGGCAC GCTGACGGCG AACGCGCAGA AGGGCGTCAA GATCACGACG AACGAGCACG AGGTGTTCGT GAGCGCGCCG AAGATTCGGC TCGTTGCCGA GGACGGCAGC TACCTCGAGC TCGGCGGCGG CATCACGCTC GGCACGAACG GCGACATCAA GCTGCTGTCG GCGTCGCACC AGTGGGGCGG GCCGTCGACC GCGCAGGCGG CGAAGAGCGG GTTCGGCAAT CAGCCGACGG ATCAGCGTTT CAAGCTGCAC TATCCGGGCG AGGACGGCGA TTTGCAGGCG GCGGCGAACA AGCGGTTCCG GATCACGCTG GACGACGGGC GCGTCATCGA AGGCAAGACC GACGCGAGCG GCCTGACGGA TCTGGTCAAG GACGACGCGA TGCGTATCGC GAAGATCGAC TATCTGAAGC CGAAGCTCTG A
|
Protein sequence | MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISGVQIG QSDGELCVYQ LTVCDALSLM DKRTNSRVFR KRSVIDVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGTWITL TGDPQWDRQR ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF ACVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRVH GLDPADHAHA QGAGTNGNAG DSAPIRVASS LAGAYFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC AELFKALGDY AGQHGGQAVD TAGQHAVAAA FKRWAPGTDG ADAPSDGAAR ALMAFGAQAG SVNVTPKTHV TYAGENIDQV AQQHLQLMSG QRLNATAGQG MQLFARGAGV QAVAGEGPML LQAQAGTLTA NAQKGVKITT NEHEVFVSAP KIRLVAEDGS YLELGGGITL GTNGDIKLLS ASHQWGGPST AQAAKSGFGN QPTDQRFKLH YPGEDGDLQA AANKRFRITL DDGRVIEGKT DASGLTDLVK DDAMRIAKID YLKPKL
|
| |