Gene BURPS1106A_A2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2032 
Symbol 
ID4903319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2001981 
End bp2005004 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content69% 
IMG OID640145137 
ProductRhs element Vgr protein 
Protein accessionYP_001076065 
Protein GI126457275 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGT CCCCCCGCCA CGATGCGCCC GCGTCGCGCG CGAACGCCGC GCCGTCCGCG 
AACGCGCGAC GCTTCACGTT CGCGAGCGAC GCGTACGACC CCGCGACGTT CGACGTCGTC
GACATCAACG GCCGCGACGC GATCTCGCAG CCGTACCGGT TCGAGATCAC GCTCGTGAGC
AGGCAGTTGC GGATCGACTT CGCGAAGATG CTGAGCCGCG GGGCGACGCT CGCGATCCTG
CCGCCGTTCG GCGAGGCCGG CACCACCCGC TATGCCGGCG TGCTCGCCGA ATTCGAGCAG
AAGAAGCGCT TTCGCGACTT CACCGTCTAT CGCGCGACGC TCGTGCCGCG CCTCTGGCGA
CTATCGCTGT ACAAGGCGTC GGACGTCTAT CTGAACGAGC AGACGATTCC CGACATCGTC
AAGCGCGTGC TGCGCGCCGC CTCGTTCGGC AGCCGCGATT TCCGCATGCG GCACGGCGGC
GGCTACCGCA AGCGCAGCTT CGTCTGCCAG TACGACGAGA GCCATCTCGA TTTCGTGTCG
CGCTGGATGG AGAAGGAAGG CCTCTACTAC TACTTCGAGC ATGACGGCCG GCACGAAACG
CTCGTGATCG TCGACGACCG CCGCCATCAG CCCGGCCCCG CCGACGATCT CGCGCTGCGC
TACCGACCCG CGACCAGCCT CGACGCGGGC ATCGAAGCGG ACCGCGTGCA GGCGTTCACA
TGCCGGGCGA CGCCGCTGCC GCGCGAAGTC GTGCTGCGCG ATTTCAACCA CCGCAAGGCG
GAGCTCTCGC TCGAAGTCCG CGAGCGCGTG GCGCGCGACG GCGTCGGCGA GCGGGTGTCG
AGCGACGAGC ACTTCCACAC GAAGGACGAA GGGCGGCGCT ACGCGAAGCT GCGCGCCGAG
GCGCTCGTCT GCGAAGGGCG CCGATTCGCC GGCGAATCGA CCGCTGCCGG GCTGCGCGCC
GGCCGCTTCT TCGCGCTGTC GGGCCACTAC CGCGAGGATT TCGACGGCCG CTATCTGGTG
ACGGCGCTCA CGCATCGCGG CTCGCAGGCA CACCTGCTGT TTCCCGATCT CGACGCGCCG
TTCGGCGCGA CGCCGGGCGA GCCCATCTAC CGCGCCGAGT TCGAGGCGAT TGCCGCCGAC
CTCCAGTACC GGCCGCCGCG CACGACGCCC AAGCCGCGCG CGGCGGGCGT CGTCAGCGCG
ATCGTCGACG GCGAGGGCGG CGGCAAGCTC GCCGAGCTCG ACGAATACGG CCAGTACAAG
GTGCGCTTTC CGTTCGCGCA CACCGCGCAT CCGGCGAACA AGGCCTCCGC GCGCATCCGG
ATGGCGACGC CCTACGCGGG CGACGACCGC GGCATGCACC TGCCGCTGCT GAAGCGCACC
GAAGTGAAGA TCGCATTCGA CGGCGGCGAT CCGGACCGCC CCGTGATCGT CGGCGCGGTG
CCCAACTCGT CGCACCGCAG CGTCGTCACG CGCCGCAACC CCGCCGAGCA TCGCATCCTC
ACCGAGCACA ACCAGCTCTA CATGAAGGAC GGCAGCGGCG CGGCGACGTG GCTGCACGCG
CCGAACAACC ACATCGGCAT CGGCGCGGTC GGGCCGGGCG ACGGCCTCGC GCTCCTCACG
TCCGGCAACA AGTTCGACTT CTCGCTCGGC AACGCGTACA GCTTCTCGGG CGGGCTCAAG
TGCTCGGTGT CGATGGGCGG CAACACCGAC GTCTACGTCG GCGTGCGCAA CAGCCTCGAC
GTCAGCGCGA ACTTCCTGAC GACGCTGCAG GGCAACCTGC GCTGGATGCT GCCCGGCAGC
CGGAGCTTCG AGATCAACGA CAGCGCATCG ACACTGCTGC AGACGCTGCA CAAGCAGTCC
GCGACGGGTG CGATCCGGCT GTCCGCCGGG CAGGACGCGT CCGCGCTGCT GCAAAAGCAG
CTCGACAAGC TCAAGGGCAC GGTGCGCAAG TTCATGATCG TGTCGGGCCT CGCGAACGCC
GGGTCCGCGG CCGCCGCCGC GGGGCTCATC AAGGGCGGCG GCAAGCTCGC CGATCTGCCG
TGGGCGGGCT TCGGCATATC TGCCGCGCAG TTCGCCGGCG CGACCGGCGT CAGCACGGCG
CTGATGGCGA CCTCGCGCAC GCTGCTCGCG AACGTCGCGA AGCTCCAGGA GGCGTTGCCG
CTCGTCGCCG ATCTGTCGCT CGGCAAGCAG GGCATCGCGC TCGCCGCGAA GAACCTCACG
CACGCGACGC GGATGTCGCT CACCGTCGAC GGCGTCTCGT GGTCGACGCA CGCGAAGGGG
CCCGGCGCGG CGGGCGCCGC GATGAGCGTC GGCAAGGGCC GCTGGGGCGT CGAAGCGGCG
AAGCACGCGC ACGTCCACGC GAGCGACACG CTGCTGTTCG CGGTGCCGGC CGACCCGACG
ACCCAGTTCG ACCTGAAGGA CCTGATCGGG CTGCGCCGCG ATCTCGACGA ATGCATGAAG
GACATCGCCG ATCTCGAAGC CGACATTTCG GAAAACGAAG TGCTGTCGAC CGATCAGAAC
ACGTTCGGCG TCAGCGCGCT CATCCCCACG CCGCCGTCGC CCGCCGGCGC GCTCGCGGCG
GTCGCGATCA AGGTGAAGCA AGCGAAGCTC GTCGAGCTGA AGGCCAAGCA GAAGCTCGTC
GCGCTGAAGG TCGACAACCT GCAGCAGAAG TTCGCGAAGC ACGTGCAGCA CCTGAGCGCC
GTGCGGATGA GCGCTTCCGA CGCGCAGCTC GGCTTCAAGG GCAACCGGCT CGTCGCGACG
GCCGACGGCG TCACGCTCGC GCATGCGCAG GGCAAGGCGA AGCTCGACGT GCGCGAAGCG
AAGATCGGCG TCACGGCGGG CAAATCGAGC GTCGAGCTCG ACGAAGGCAA GATCGCGGCC
GGCTGCGGCA GCGCATCGCT GAAGCTCGGC AGCGACGGCG CGATCGACGT GAGCGCGACC
GACGTCAAGC TGAACGGCAC CAACGTCAAG CTGAACGGCA GCGCGTCGCT GAAGTTCGAC
GGCCAACTGA TCCAGCTCGG CTGA
 
Protein sequence
MPSSPRHDAP ASRANAAPSA NARRFTFASD AYDPATFDVV DINGRDAISQ PYRFEITLVS 
RQLRIDFAKM LSRGATLAIL PPFGEAGTTR YAGVLAEFEQ KKRFRDFTVY RATLVPRLWR
LSLYKASDVY LNEQTIPDIV KRVLRAASFG SRDFRMRHGG GYRKRSFVCQ YDESHLDFVS
RWMEKEGLYY YFEHDGRHET LVIVDDRRHQ PGPADDLALR YRPATSLDAG IEADRVQAFT
CRATPLPREV VLRDFNHRKA ELSLEVRERV ARDGVGERVS SDEHFHTKDE GRRYAKLRAE
ALVCEGRRFA GESTAAGLRA GRFFALSGHY REDFDGRYLV TALTHRGSQA HLLFPDLDAP
FGATPGEPIY RAEFEAIAAD LQYRPPRTTP KPRAAGVVSA IVDGEGGGKL AELDEYGQYK
VRFPFAHTAH PANKASARIR MATPYAGDDR GMHLPLLKRT EVKIAFDGGD PDRPVIVGAV
PNSSHRSVVT RRNPAEHRIL TEHNQLYMKD GSGAATWLHA PNNHIGIGAV GPGDGLALLT
SGNKFDFSLG NAYSFSGGLK CSVSMGGNTD VYVGVRNSLD VSANFLTTLQ GNLRWMLPGS
RSFEINDSAS TLLQTLHKQS ATGAIRLSAG QDASALLQKQ LDKLKGTVRK FMIVSGLANA
GSAAAAAGLI KGGGKLADLP WAGFGISAAQ FAGATGVSTA LMATSRTLLA NVAKLQEALP
LVADLSLGKQ GIALAAKNLT HATRMSLTVD GVSWSTHAKG PGAAGAAMSV GKGRWGVEAA
KHAHVHASDT LLFAVPADPT TQFDLKDLIG LRRDLDECMK DIADLEADIS ENEVLSTDQN
TFGVSALIPT PPSPAGALAA VAIKVKQAKL VELKAKQKLV ALKVDNLQQK FAKHVQHLSA
VRMSASDAQL GFKGNRLVAT ADGVTLAHAQ GKAKLDVREA KIGVTAGKSS VELDEGKIAA
GCGSASLKLG SDGAIDVSAT DVKLNGTNVK LNGSASLKFD GQLIQLG