Gene BURPS1710b_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1771 
Symbol 
ID3690362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1924245 
End bp1927085 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content71% 
IMG OID637728227 
ProductRhs element Vgr protein 
Protein accessionYP_333172 
Protein GI76810168 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.156841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATCC CCACTGACGT TCTGCAAGCA CTCTTCGGCG GCTGGTCGCA ACACGACCGC 
TTTCTCTGGA TCACGACGCC GCTCGGCGCG AACGCGCTCG TCGCGGAAAG CCTGCACGGC
TGGGAGGCGC TCGACGATGG CGGCTTCCGT TTGCAGCTCA CCGCGCTCGC CGAGCATCCG
TCGCTGCCGC TCGCGCAACT GATCGGTGCG CCGATCCTGA TCGAATGGCA GGCGCAGGAA
GGGCGCGACG CGCGCCGGCC GTTCCACGGC CACGTGATCG CCGCCGAGCT CGTCGGCTAC
AACGGCGGAC TCGCACGCGT GCGGCTCGTC GTCGAGCCGT GGCTCACGCT GTTGCGGCAG
CGCGTCGACA GCTACAACTA TCTGGACGCG AGCGTCGTCG AGATCAGCGA ACAGGTGTTC
CGCCGTTACG CGCGCGGCGT GATCGCGCCC GCATGGCGAT GGGCGCTCGC CGATGCGGCG
AAGTACCCGA AGCGCAGCTT GACCGCGCAA GCCGGCGAGT CCGATTTCGA CTTCCTCGAG
CGGCTCTGGG CGGAGGAGGG CATCTTCTAC TGGTTCGAGC ACGAAGGCGA CGCGCGCGCG
TCGAGCCTCG GCAAGCACAC GCTCGTGCTC GCCGATTCGA ATCAGCGCTT CGCGCCGGAC
GAACCGGAGA TCGTCGGCTT TCATCAGACG AGCGACGACG ACCCGCAAGG CTGCATCCAG
CACTTCATGC ACGCGCGGCG TTGGCGCATC GGCAGCGTCG CGCGCGCGAG CTGGGATCAC
CGCAGCCTGT CGACGCGTCC GACGGGCGCG CGCGCGAACG GCGCGGTCGC GCCGGGCGAG
GATCGCGACG TCGCGGGCCC GTACGCATAC CAGACGGGCG CGATCGGAGA CCGGCGCGCG
CAGCAGCAAC TCGACGCCCA GCGCGTGGCC GCGCTGCGGA GCGAAGGCCG CAGCACGCGC
CGCGATCTGC GGCCCGGGCT GCGTTTTGCG ATCGCGCATC ATCCGACGCT CGGCGTATCG
GATGCGTTCA TCTGCCTGCG CGTCGAGCAT TCGGCGCGCG CGAACGTCGA TGCGACCGTG
CGCAGCGCGA TCGAACAGCG CCTCGGCGCG ATTGCGTCGA TCGCCGGCGC CGCGCCGGTG
CCGCATGGCC CCGCGAGCGC GCTGAATGCG GCGCTCGGCG CCGATACGCA TCACGGCGGA
TCGCTGATGC GGGACGACGC CGTCTATCGC AATCGCTTCG TCGCGTTGCC CGCCGAGCAG
ACGTATCGGC CGCTCGCCGC ATCCGGCCAT GGCGCGCGCA TGCACCCGGT CGCCGTGATG
CCGGGGGCGC AGACGGCGAT CGTCGTGGGC GCGGGCGATC CCGTTCACAC CGACCGGGAT
CACCGGATTC GAATCCAGCA CCACGCGCAG CGCGGCGAGA ATGCGGCGAG CCGCGACGAT
CACCCGCATG CGGCGAACGC GCCGGCGAAC CGTGGCGCGG GCACGTGGAC GCGGATGATG
ACGCCCGTGG GCGGCGATAA CTGGGGTGGG GTGAGCGTGC CGCGCGTCGG GCAGGAAGTG
TGGACCGAAT GGCTCGAAGG CCAGCCCGAC CGGCCGGTTG CGGTGGCCGC GCTCTACAAC
GGCCGCGGCA ACGCGCACGC GCAGCACAAC GCGCAGGCGG GAGGGCCGAG CGGCAGCACC
GGCAATGCGG CCGCATGGTT CGCCGGCAAT GCGCATGCGG CGGTGCTGAC GGGTTTCAAG
ACGCAGGACA TGAGCATGAG CCAGCAAGGC ACGGGCGGCT ATCGGCAATT CATGCTCGAC
GATACGGCGG GCCAGTCGAG CGCGCGCCTG TACACGACGG ACCGCAACAG CGGGCTCACG
CTCGGGCACG TGAAGCAGAC GCAGGACAAC CGGCGCCAGG CCGATCGCGG CTATGGCGCG
GAACTGGCGA CGGATGCCGC CGGCGCGCTG CGCGGCGGCG CGGGGCTCTT GATCAGCACG
GCGCCCGGCG TGAGCCAGAT GGATGCGAGC GCGCCGAGCC AGGTGCTCGC TCAGCATCGA
CAGACGTTGC AAAGCCTCGC GGCGCTCGCG CACAAGCAGG GCGCGGAGCC GGGCGGTGCG
GTGCCCGAGG CGGCGGGCGG AGCGGGCGCG GCGTCGGCGT CGACGGGGGC GGCGGCGGAC
AAGCCTTTGC CGGCCGTCGA TGGCATCGAG CAGAGCCGTG AAGCGATCGG CGCGACGCGT
GAAGGAAGCG GCGGCGATAC GGCAGGCGGC GGGGGCGGCG GCGCGATTGC GTGGAGCAAG
CCGCATCTCG TCGCACACGG CGAGGCGGGG CTGGCCGCGA TGTCGGCGAA GAGCCACGTG
TGGGTGTCGG GTACCGAGAC GGTGTTGAGC GCCGGACAGG ACGTGCAATT GACGGCGAAG
GGCAAGACGA GCGTCGTCGC GAATCACGGC ATCTCGCTGT ATACGCAGGG CGCGGCGGGC
GACGGGCGGC CGGTTGCCGG CCGGGGCATC GCGTTGCACG CGGCGTCCGG CGCGGTGAGT
GTGCAAGCGC AGAACGCCGG CAAGCTGAGC GCGAGCGCGC AGAAGGCGGT GACGCTCGCG
AGCGCGCAAG GCAGCGCGTC CGTGCAGGCG CAGCAGCGCG TCCTGCTGAG CGCGGCGAAG
GCCTATCTGA AGATGGAAGG CAACGACATC GTCGTCGGCG CGCCGGGGCG CGCGGATTTC
AAGGCGGCGG CGCATCAGTT GACGGGGCCG AAGAGCGCGG GGGCGTCAGC GCAAATGCCG
AAGAGCGAAC CGAAGCTGTG CGAGTACAAG ACGCGCGCGG CCGACGTTGC TCACGAAGGC
ACGATGAAGT CGGCGGCTTG A
 
Protein sequence
MSIPTDVLQA LFGGWSQHDR FLWITTPLGA NALVAESLHG WEALDDGGFR LQLTALAEHP 
SLPLAQLIGA PILIEWQAQE GRDARRPFHG HVIAAELVGY NGGLARVRLV VEPWLTLLRQ
RVDSYNYLDA SVVEISEQVF RRYARGVIAP AWRWALADAA KYPKRSLTAQ AGESDFDFLE
RLWAEEGIFY WFEHEGDARA SSLGKHTLVL ADSNQRFAPD EPEIVGFHQT SDDDPQGCIQ
HFMHARRWRI GSVARASWDH RSLSTRPTGA RANGAVAPGE DRDVAGPYAY QTGAIGDRRA
QQQLDAQRVA ALRSEGRSTR RDLRPGLRFA IAHHPTLGVS DAFICLRVEH SARANVDATV
RSAIEQRLGA IASIAGAAPV PHGPASALNA ALGADTHHGG SLMRDDAVYR NRFVALPAEQ
TYRPLAASGH GARMHPVAVM PGAQTAIVVG AGDPVHTDRD HRIRIQHHAQ RGENAASRDD
HPHAANAPAN RGAGTWTRMM TPVGGDNWGG VSVPRVGQEV WTEWLEGQPD RPVAVAALYN
GRGNAHAQHN AQAGGPSGST GNAAAWFAGN AHAAVLTGFK TQDMSMSQQG TGGYRQFMLD
DTAGQSSARL YTTDRNSGLT LGHVKQTQDN RRQADRGYGA ELATDAAGAL RGGAGLLIST
APGVSQMDAS APSQVLAQHR QTLQSLAALA HKQGAEPGGA VPEAAGGAGA ASASTGAAAD
KPLPAVDGIE QSREAIGATR EGSGGDTAGG GGGGAIAWSK PHLVAHGEAG LAAMSAKSHV
WVSGTETVLS AGQDVQLTAK GKTSVVANHG ISLYTQGAAG DGRPVAGRGI ALHAASGAVS
VQAQNAGKLS ASAQKAVTLA SAQGSASVQA QQRVLLSAAK AYLKMEGNDI VVGAPGRADF
KAAAHQLTGP KSAGASAQMP KSEPKLCEYK TRAADVAHEG TMKSAA