Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1615 |
Symbol | |
ID | 4903595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1582879 |
End bp | 1585665 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640144721 |
Product | putative Rhs element Vgr protein |
Protein accession | YP_001075649 |
Protein GI | 126458074 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein [TIGR03361] type VI secretion system Vgr family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.491638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG TCGACCCGCC GCGACCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGGCGT CCAGATCGGC CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGAGGT GCTCGCCACG CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGATGCGC CCGTGCACAC GCTCGTGTGC TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG CCCGGCAAGG CCGGCCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGACC TGGCGAAGCT GCTGACCGAC ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG GATCTCGCCG TGGGCACATG GATCACGCTG ACGGGCGATC CGCAATGGGA CAGGCAACGC GCCGACAAGC GTCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCAA GCCGCAATCT CGCGTGCGCG CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC GCGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC GAAGACGGCC GGGTGCGCGT GCGGGTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC CTTGCCGGCG CCCATTTCGG CGCATCGTTC CTGCCGCGAG TCGGCATGGA AGTCCTCCTC GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAACCGCTA CCTGTCGGGC ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT ACCGACGCGG CGGCGGCGCT TCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGCGGAT ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGGCACG GACGGCGCCG ATGCGCCGTC CGACGGCGCA GCGCGCGCGC TGATGGCGTT CGGCGCGCAG GCCGGTTCGG TGAACGTCAC GCCGAAGACG CATGTGACGT ATGCCGGCGA GAACATCGAT CAGGTCGCGC AGCAGCACCT GCAACTGATG AGCGGCCAGC GGCTGAACGC GACGGCCGGG CAGGGCATGC AGCTCTTCGC GCGGGGCGCG GGGGTGCAGG CCGTGGCGGG CGAAGGGCCG ATGCTGCTGC AGGCGCAAGC CGGCACGCTG ACGGCGAACG CGCAGAAGGG CATCAAGATC ACGACGAACG AGCACGAGGT GTTCGTGAGT GCGCCGAAGA TTCGGCTCGT TGCCGAGGAC GGCAGCTACC TCGAGCTCGG CGGCGGCATC ACGCTCGGCA CGAACGGCGA CATCAAGCTG CTGTCGGCGT CGCACCAGTG GGGCGGGCCG TCGACCGCGC AGGCGGCGAA GAGCGGGTTC GGCAATCAGC CGACGGATCA GCGTTTCAAG CTGCACTATC CGGGCGAGGA CGGCGATTTG CAGGCGGCGG CGAACAAGCG GTTCCGGATC ACGCTGGACG ACGGGCGCGT CATCGAAGGC AAGACCGACG CGAGCGGCCT GACGGATCTG GTCAAGGACG ACGCGATGCG TATCGCGAAG ATCGACTATC TGAAGCCGAA GCTCTGA
|
Protein sequence | MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISGVQIG QSDGELCVYQ LTVCDALSLM DKRTNSRVFR KRSVIEVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGTWITL TGDPQWDRQR ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF ACVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRVH GLDPADHAHA QGAGTNGNAG DSAPIRVASS LAGAHFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC AELFKALGDY AGQHGGQAAD TAGQHAVAAA FKRWAPGTGT DGADAPSDGA ARALMAFGAQ AGSVNVTPKT HVTYAGENID QVAQQHLQLM SGQRLNATAG QGMQLFARGA GVQAVAGEGP MLLQAQAGTL TANAQKGIKI TTNEHEVFVS APKIRLVAED GSYLELGGGI TLGTNGDIKL LSASHQWGGP STAQAAKSGF GNQPTDQRFK LHYPGEDGDL QAAANKRFRI TLDDGRVIEG KTDASGLTDL VKDDAMRIAK IDYLKPKL
|
| |