Gene BURPS1106A_1615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1615 
Symbol 
ID4900993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1568463 
End bp1571303 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content71% 
IMG OID640134845 
ProductRhs element Vgr protein 
Protein accessionYP_001065886 
Protein GI126453710 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATCC CCACTGACGT TCTGCAAGCA CTCTTCGGCG GCTGGTCGCA ACACGACCGC 
TTTCTCTGGA TCACGACGCC GCTCGGCGCG AACGCGCTCG TCGCGGAAAG CCTGCACGGC
TGGGAGGCGC TCGACGATGG CGGCTTCCGT TTGCAGCTCA CCGCGCTCGC CGAGCATCCG
TCGCTGCCGC TCGCGCAACT GATCGGCGCG CCGATCCTGA TCGAATGGCA GGCGCAGGAA
GGGCGCGACG CGCGCCGGCC GTTCCACGGC CACGTGATCG CCGCCGAGCT CGTCGGCTAC
AACGGCGGCC TCGCACGCGT GCGGCTCGTC GTCGAGCCGT GGCTCACGCT GCTGCGGCAG
CGCGTCGACA GCTACAACTA TCTGGACGCG AGCGTCGTCG AGATCAGCGA ACAGGTGTTC
CGCCGTTACG CGCGCGGCGC GATCGCGCCC GCATGGCGAT GGGCGCTCGC CGATGCGGCG
AAGTACCCGA AGCGCAGCTT GACCGCGCAA GCCGGCGAGT CCGATTTCGA CTTCCTCGAG
CGGCTCTGGG CGGAGGAGGG CATCTTCTAC TGGTTCGAGC ACGAAGGCGA CGCGCGCGCA
TCGAGCCTCG GCAAGCACAC GCTCGTGCTC GCCGATTCGA ATCAGCGCTT CGCGCCGGAC
GAACCGGAGA TCGTCGGCTT TCATCAGACG AGCGACGACG ACCCGCAAGG CTGCATCCAG
CACTTCGTGC ACGCGCGGCG TTGGCGCATC GGCAGCGTCG CGCGCGCGAG CTGGGATCAC
CGCAGCCTGT CGACGCGTCC GACGGGCGCG CGCGCGAACG GCGCGGTCGC GCCGGGCGAG
GATCGCGACG TCGCGGGCCC GTACGCATAC CAGACGGGCG CGATCGGAGA CCGGCGCGCG
CAGCAGCAAC TCGACGCCCA GCGCGTGGCC GCGCTGCGGA GCGAAGGCCG CAGCACGCGC
CGCGATCTGC GGCCCGGGCT GCGTTTTGCG ATCGCGCATC ATCCGACGCT CGGCGTATCG
GATGCGTTCA TCTGCCTGCG CGTCGAGCAT TCGGCGCGCG CGAACGTCGA TGCGACCGTG
CGCAGCGCGA TCGAACAGCG CCTCGGCGCG ATTGCGTCGA TCGCCGGCGC CGCGCCGGTG
CCGCATGGCC CCGCGAGCGC GCTGAATGCG GCGCTCGGCG CCGATACGCA TCACGGCGGA
TCGCTGATGC GGGACGACGC CGTCTATCGC AATCGCTTCG TCGCGTTGCC CGCCGAGCAG
ACGTATCGGC CGCTCGCCGC ATCCGGCCAT GGCGCGCGCA TGCACCCGGT CGCCGTGATG
CCGGGGGCGC AGACGGCGAT CGTCGTGGGC GCGGGCGATC CCGTTCACAC CGACCGGGAT
CACCGGATTC GAATCCAGCA CCACGCGCAG CGCGGCGAGA ATGCGGCGAG CCGCGACGAT
CACCCGCATG CGGCGAACGC GCCGGCGAAC CGTGGCGCGG GCACGTGGAC GCGGATGATG
ACGCCCGTGG GCGGCGATAA CTGGGGTGGG GTGAGCGTGC CGCGCGTCGG GCAGGAAGTG
TGGACCGAAT GGCTCGAAGG CCAGCCCGAC CGGCCGGTTG CGGTGGCCGC GCTCTACAAC
GGCCGCGGCA ACGCGCACGC GCAGCACAAC GCGCAGGCGG GAGGGCCGAG CGGCAGCACC
GGCAATGCGG CCGCATGGTT CGCCGGCAAT GCGCATGCGG CGGTGCTGAC GGGTTTCAAG
ACGCAGGACA TGAGCATGAG CCAGCAAGGC ACGGGCGGCT ATCGGCAATT CATGCTCGAC
GATACGGCGG GCCAGTCGAG CGCGCGCCTG TACACGACGG ACCGCAACAG CGGGCTCACG
CTCGGGCACG TGAAGCAGAC GCAGGACAAC CGGCGCCAGG CCGATCGCGG CTATGGCGCG
GAACTGGCGA CGGATGCCGC CGGCGCGCTG CGCGGCGGCG CGGGGCTCTT GATCAGCACG
GCGCCCGGCG TGAGCCAGAT GGATGCGAGC GCGCCGAGCC AGGTGCTCGC TCAGCATCGA
CAGACGTTGC AAAGCCTCGC GGCGCTCGCG CACAAGCAGG GCGCGGAGCC GGGCGGTGCG
GTGCCCGAGG CGGCGGGCGG AGCGGGCGCG GCGTCGGCGT CGACGGGGGC GGCGGCGGAC
AAGCCTTTGC CGGCCGTCGA TGGCATCGAG CAGAGCCGTG AAGCGATCGG CGCGACGCGT
GAAGGAAGCG GCGGCGATAC GGCAGGCGGC GGGGGCGGCG GCGCGATTGC GTGGAGCAAG
CCGCATCTCG TCGCACACGG CGAGGCGGGG CTGGCCGCGA TGTCGGCGAA GAGCCACGTG
TGGGTGTCGG GTACCGAGAC GGTGTTGAGC GCCGGACAGG ACGTGCAATT GACGGCGAAG
GGCAAGACGA GCGTCGTCGC GAATCACGGC ATCTCGCTGT ATACGCAGGG CGCGGCGGGC
GACGGGCGGC CGGTTGCCGG CCGGGGCATC GCGTTGCACG CGGCGTCCGG CGCGGTGAGT
GTGCAAGCGC AGAACGCCGG CAAGCTGAGC GCGAGCGCGC AGAAGGCGGT GACGCTCGCG
AGCGCGCAAG GCAGCGCGTC CGTGCAGGCG CAGCAGCGCG TCCTGCTGAG CGCGGCGAAG
GCCTATCTGA AGATGGAAGG CAACGACATC GTCGTCGGCG CGCCGGGGCG CGCGGATTTC
AAGGCGGCGG CGCATCAGTT GACGGGGCCG AAGAGCGCGG GGGCGTCAGC GCAAATGCCG
AAGAGCGAAC CGAAGCTGTG CGAGTACAAG ACGCGCGCGG CCGACGTTGC TCACGAAGGC
ACGATGAAGT CGGCGGCTTG A
 
Protein sequence
MSIPTDVLQA LFGGWSQHDR FLWITTPLGA NALVAESLHG WEALDDGGFR LQLTALAEHP 
SLPLAQLIGA PILIEWQAQE GRDARRPFHG HVIAAELVGY NGGLARVRLV VEPWLTLLRQ
RVDSYNYLDA SVVEISEQVF RRYARGAIAP AWRWALADAA KYPKRSLTAQ AGESDFDFLE
RLWAEEGIFY WFEHEGDARA SSLGKHTLVL ADSNQRFAPD EPEIVGFHQT SDDDPQGCIQ
HFVHARRWRI GSVARASWDH RSLSTRPTGA RANGAVAPGE DRDVAGPYAY QTGAIGDRRA
QQQLDAQRVA ALRSEGRSTR RDLRPGLRFA IAHHPTLGVS DAFICLRVEH SARANVDATV
RSAIEQRLGA IASIAGAAPV PHGPASALNA ALGADTHHGG SLMRDDAVYR NRFVALPAEQ
TYRPLAASGH GARMHPVAVM PGAQTAIVVG AGDPVHTDRD HRIRIQHHAQ RGENAASRDD
HPHAANAPAN RGAGTWTRMM TPVGGDNWGG VSVPRVGQEV WTEWLEGQPD RPVAVAALYN
GRGNAHAQHN AQAGGPSGST GNAAAWFAGN AHAAVLTGFK TQDMSMSQQG TGGYRQFMLD
DTAGQSSARL YTTDRNSGLT LGHVKQTQDN RRQADRGYGA ELATDAAGAL RGGAGLLIST
APGVSQMDAS APSQVLAQHR QTLQSLAALA HKQGAEPGGA VPEAAGGAGA ASASTGAAAD
KPLPAVDGIE QSREAIGATR EGSGGDTAGG GGGGAIAWSK PHLVAHGEAG LAAMSAKSHV
WVSGTETVLS AGQDVQLTAK GKTSVVANHG ISLYTQGAAG DGRPVAGRGI ALHAASGAVS
VQAQNAGKLS ASAQKAVTLA SAQGSASVQA QQRVLLSAAK AYLKMEGNDI VVGAPGRADF
KAAAHQLTGP KSAGASAQMP KSEPKLCEYK TRAADVAHEG TMKSAA