Gene BURPS1710b_A0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0541 
Symbol 
ID3693622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp729560 
End bp732583 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content69% 
IMG OID637730795 
ProductRhs element Vgr protein 
Protein accessionYP_335700 
Protein GI76819027 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGT CCCCCCGCCA CGATGCGCCC GCGTCGCGCG CGCACGCCGC GCCGTCCGCG 
AACGCGCGAC GCTTCACGTT CGCGAGCGAC GCGTACGACC CCGCGACGTT CGACGTCGTC
GACATCAACG GCCGCGACGC GATCTCGCAG CCGTACCGGT TCGAGATCAC GCTCGTGAGC
AGGCAGTTGC GGATCGACTT CGCGAAGATG CTGAGCCGCG GGGCGACGCT CGCGATCCTG
CCGCCGTTCG GCGAGGCCGG CACCACCCGC TATGCCGGCG TGCTCGCCGA ATTCGAGCAG
AAGAAACGCT TTCGCGACTT CACCGTCTAT CGCGCGACGC TCGTGCCGCG CCTCTGGCGG
CTGTCGCTGT ACAAGGCGTC GGACGTCTAT CTGAACGAGC AGACGATTCC CGACATCGTC
AAGCGCGTGC TGCGCGCCGC CTCGTTCGGC AGCCGCGATT TCCGCATGCG GCACGGCGGC
GGCTACCGCA AGCGCAGCTT CGTCTGCCAA TACGACGAGA GCCATCTCGA TTTCGTGTCG
CGCTGGATGG AGAAGGAAGG CCTCTACTAC TACTTCGAGC ATGACGGCCG GCACGAAACG
CTCGTGATCG TCGACGACCG CCGCCATCAG CCCGGCCCCG CCGACGATCT CGCGCTGCGC
TACCTACCCG TGACCAGCCT CGACGCGGGC ATCGAATCGG ACCGCGTGCA GGCGTTCACA
TGCCGGGCGA CGCCGCTGCC GCGCGAAGTC GTGCTGCGCG ATTTCAACCA CCGCAAGGCG
GAGCTCTCGC TCGAAGTCCG CGAGCGCGTG GCGCGCGACG GCGTCGGCGA GCGGGTGTCG
AGCGACGAGC ACTTCCACAC GAAGGACGAA GGGCAGCGCT ACGCGAAGCT GCGCGCCGAG
GCGCTCGTCT GCGAAGGGCG CCGATTCGCC GGCGAATCGA CCGCGGCCGG GCTGCGCGCC
GGCCGCTTCT TCGCGCTGTC GGGCCACTAC CGCGAGGATT TCGACGGCCG CTATCTGGTG
ACGGCGCTCA CGCATCGCGG CTCGCAGGCA CACCTGCTGT TTCCCGATCT CGACGCGCCG
TTCGGCGCGA CGCCGGGCGA GCCCATCTAC CGCGCCGAGT TCGAGGCGAT TGCCGCCGAC
CTCCAGTACC GGCCGCCGCG CACGACGCCC AAGCCGCGCG CGGCGGGCGT CGTCAGCGCG
ATCGTCGACG GCGAGGGCGG CGGCAAGCTC GCCGAGCTCG ACGAATACGG CCAGTACAAG
GTGCGCTTTC CGTTCGCGCA CACCGCGCAT CCGGCGAACA AGGCCTCCGC GCGCATCCGG
ATGGCGACGC CCTACGCGGG CGACGACCGC GGCATGCACC TGCCGCTGCT GAAGCGCACC
GAAGTGAAGA TCGCATTCGA CGGCGGCGAT CCGGACCGCC CCGTGATCGT CGGCGCGGTG
CCCAACTCGT CGCACCGCAG CGTCGTCACG CGCCGCAACC CCGCCGAGCA TCGCATCCTC
ACCGAGCACA ACCAGCTCTA CATGAAGGAC GGCAGCGGCG CGGCGACGTG GCTGCACGCG
CCGAACAACC ACATCGGCAT CGGCGCGGTC GGGCCGGGCG ACGGCCTCGC GCTCCTCACG
TCCGGCAACA AGTTCGACTT CTCGCTCGGC AACGCGTACA GCTTCTCGGG CGGGCTCAAG
TGCTCGGTGT CGATGGGCGG CAACACCGAC GTCTACGTCG GCGTGCGCAA CAGCCTCGAC
GTCAGCGCGA ACTTCCTGAC GACGCTGCAG GGCAACCTGC GCTGGATGCT GCCCGGCAGC
CGGAGCTTCG AGATCAACGA CAGCGCATCG ACACTGCTGC AGACGCTGCA CAAGCAGTCC
GCGACGGGCG CGATCCGGCT GTCCGCCGGG CAGGACGCGT CCGCGCTGCT GCAAAAGCAG
CTCGACAAGC TCAAGGGCAC GGTGCGCAAG TTCATGATCG TGTCGGGCCT CGCGAACGCC
GGGTCCGCGG CCGCCGCCGC GGGGCTCATC AAGGGCGGCG GCAAGCTCGC CGATCTGCCG
TGGGCGGGCT TCGGCATATC CGCCGCGCAG TTCGCCGGTG CGACCGGCGT CAGCACGGCG
CTGATGGCGA CCTCGCGCAC GCTGCTCGCG AACGTCGCGA AGCTCCAGGA GGCGTTGCCG
CTCGTCGCCG ATCTGTCGCT CGGCAAGCAG GGCATCGCGC TCGCCGCGAA GAACCTCACG
CACGCGACGC GGATGTCGCT CACCGTCGAC GGCGTCTCGT GGTCGACGCA CGCGAAGGGG
CCCGGCGCGG CGGGCGCCGC GATGAGCGTC GGCAAGGGCC GCTGGGGCGT CGAAGCGGCG
AAGCACGCGC ACGTCCACGC GAGCGACACG CTGCTGTTCG CGGTGCCGGC CGACCCGACG
ACCCAGTTCG ACCTGAAGGA CCTGATCGGG CTGCGCCGCG ATCTCGACGA ATGCATGAAG
GACATCGCCG ATCTCGAAGC CGACATTTCG GAAAACGAAG TGCTGTCGAC CGATCAGAAC
ACGTTCGGCG TCAGCGCGCT CATCCCCACG CCGCCGTCGC CCGCCGGCGC GCTCGCGGCG
GTCGCGATCA AGGTGAAGCA AGCGAAGCTC GTCGAGCTGA AGGCCAAGCA GAAGCTCGTC
GCGCTGAAGG TCGACAACCT GCAGCAGAAG TTCGCGAAGC ACGTGCAGCA CCTGAGCGCC
GTGCGGATGA GCGCTTCCGA CGCGCAGCTC GGCTTCAAGG GCAACCGGCT CGTCGCGACG
GCCGACGGCG TCACGCTCGC GCATGCGCAG GGCAAGGCGA AGCTCGACGT GCGCGAAGCG
AAGATCGGCG TCACGGCGGG CAAATCGAGC GTCGAGCTCG ACGAAGGCAA GATCGCGGCC
GGCTGCGGCA GCGCATCGCT GAAGCTCGGC AGCGACGGCG CGATCGACGT GAGCGCGACC
GACGTCAAGC TGAACGGCAC CAACGTCAAG CTGAACGGCA GCGCGTCGCT GAAGTTCGAC
GGCCAACTGA TCCAGCTCGG CTGA
 
Protein sequence
MPSSPRHDAP ASRAHAAPSA NARRFTFASD AYDPATFDVV DINGRDAISQ PYRFEITLVS 
RQLRIDFAKM LSRGATLAIL PPFGEAGTTR YAGVLAEFEQ KKRFRDFTVY RATLVPRLWR
LSLYKASDVY LNEQTIPDIV KRVLRAASFG SRDFRMRHGG GYRKRSFVCQ YDESHLDFVS
RWMEKEGLYY YFEHDGRHET LVIVDDRRHQ PGPADDLALR YLPVTSLDAG IESDRVQAFT
CRATPLPREV VLRDFNHRKA ELSLEVRERV ARDGVGERVS SDEHFHTKDE GQRYAKLRAE
ALVCEGRRFA GESTAAGLRA GRFFALSGHY REDFDGRYLV TALTHRGSQA HLLFPDLDAP
FGATPGEPIY RAEFEAIAAD LQYRPPRTTP KPRAAGVVSA IVDGEGGGKL AELDEYGQYK
VRFPFAHTAH PANKASARIR MATPYAGDDR GMHLPLLKRT EVKIAFDGGD PDRPVIVGAV
PNSSHRSVVT RRNPAEHRIL TEHNQLYMKD GSGAATWLHA PNNHIGIGAV GPGDGLALLT
SGNKFDFSLG NAYSFSGGLK CSVSMGGNTD VYVGVRNSLD VSANFLTTLQ GNLRWMLPGS
RSFEINDSAS TLLQTLHKQS ATGAIRLSAG QDASALLQKQ LDKLKGTVRK FMIVSGLANA
GSAAAAAGLI KGGGKLADLP WAGFGISAAQ FAGATGVSTA LMATSRTLLA NVAKLQEALP
LVADLSLGKQ GIALAAKNLT HATRMSLTVD GVSWSTHAKG PGAAGAAMSV GKGRWGVEAA
KHAHVHASDT LLFAVPADPT TQFDLKDLIG LRRDLDECMK DIADLEADIS ENEVLSTDQN
TFGVSALIPT PPSPAGALAA VAIKVKQAKL VELKAKQKLV ALKVDNLQQK FAKHVQHLSA
VRMSASDAQL GFKGNRLVAT ADGVTLAHAQ GKAKLDVREA KIGVTAGKSS VELDEGKIAA
GCGSASLKLG SDGAIDVSAT DVKLNGTNVK LNGSASLKFD GQLIQLG