Gene BURPS668_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1590 
Symbol 
ID4884726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1557430 
End bp1560270 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content71% 
IMG OID640127518 
ProductRhs element Vgr protein 
Protein accessionYP_001058631 
Protein GI126441823 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATCC CCACTGACGT TCTGCAAGCA CTCTTCGGCG GCTGGTCGCA ACACGACCGC 
TTTCTCTGGA TCACGACGCC GCTCGGCGCG AACGCGCTCG TCGCGGAAAG CCTGCACGGC
TGGGAGGCGC TCGACGATGG CGGCTTCCGT TTGCAGCTCA CCGCGCTCGC CGAGCATCCG
TCGCTGCCGC TCGCGCAACT GATCGGCGCG CCGATCCTGA TCGAATGGCA GGCGCAGGAA
GGGCGCGACG CGCGCCGGCC GTTCCACGGC CACGTGATCG CCGCCGAGCT CGTCGGCTAC
AACGGCGGCC TCGCACGCGT GCTGCTCGTC GTCGAGCCGT GGCTCACGCT GCTGCGGCAG
CGCGTCGACA GCTACAACTA TCTGGACGCG AGCGTCGTCG AGATCAGCGA ACAGGTGTTC
CGCCGTTACG CGCGCGGCGC GATCGCGCCC GCATGGCGAT GGGCGCTCGC CGATGCGGCG
AAGTACCCGA AGCGCAGCTT GACCGCGCAA GCCGGCGAGT CCGATTTCGA TTTCCTCGAG
CGGCTCTGGG CGGAGGAGGG CATCTTCTAC TGGTTCGAGC ACGAAGGCGA CGCGCGCGCA
TCGAGCCTTG GCAAGCACAC GCTCGTGCTC GCCGATTCGA ATCAGCGCTT CGCGCCGGAC
GAACCGGAGA TCGTCGGCTT TCATCAGACG AGCGACGACG ACCCGCAAGG CTGCATCCAG
CACTTCATGC ACGCGCGGCG TTGGCGCATC GGCAGCGTCG CGCGCGCGAG CTGGGATCAC
CGCAGCCTGT CGACGCGTCC GACGGGCGCG CGCGCGAACG GCGCGGTCGC GCCGGGCGAG
GATCGCGACG TCGCGGGCCC GTACGCATAC CAGACGGGCG CGATCGGAGA CCGGCGCGCG
CAGCAGCAAC TCGACGCCCA GCGCGTGGCC GCGCTGCGGA GCGAAGGCCG CAGCACGCGC
CGCGATCTGC GGCCCGGGCT GCGTTTTGCG ATCGCGCATC ATCCGACGCT CGGCGTATCG
GATGCGTTCA TCTGCCTGCG CGTCGAGCAT TCGGCGCGCG CGAACGTCGA CGCGACCGTG
CGCAGCGCGA TCGAACAGCG CCTCGGCGCG ATTGCGTCGA TCGCCGGCGC CGCGCCGGTG
CCGCATGGCC CCGCGAGCGC GCTGAATGCG GCGCTCGGCG CCGATACGCA TCACGGCGGA
TCGCTGATGC GGGACGACGC CGTCTATCGC AATCGCTTCG TCGCGTTGCC CGCCGAGCAG
ACGTATCGGC CGCTCGCCGC ATCCGGCCAT GGCGCGCGCA TGCACCCCGT CGCCGTGATG
CCGGGGGCGC AGACGGCGAT CGTCGTGGGC GCGGGCGATC CCGTTCACAC CGACCGGGAT
CACCGGATTC GAATCCAGCA CCACGCGCAG CGCGGCGAGA ATGCGGCGAG CCGCGACGAT
CACCCGCATG CGGCGAACGC GCCGGCGAAC CGTGGCGCGG GCACGTGGAC GCGGATGATG
ACGCCCGTGG GCGGTGATAA CTGGGGTGGG GTGAGCGTGC CGCGCGTCGG GCAGGAAGTG
TGGACCGAAT GGCTCGAAGG CCAGCCCGAC CGGCCGGTTG CGGTGGCCGC GCTCTACAAC
GGCCGCGGCA ACGCGCACGC GCAGCACAAC GCGCAGGCGG GAGGGCCGAG CGGCAGCACC
GGCAATGCGG CCGCATGGTT CGCCGGCAAT GCGCATGCGG CGGTGCTGAC GGGTTTCAAG
ACGCAGGACA TGAGCATGAG CCAGCAAGGC ACGGGCGGCT ATCGGCAATT CATGCTCGAC
GATACGGCGG GCCAGTCGAG CGCGCGCCTG TACACGACGG ACCGCAACAG CGGGCTCACG
CTCGGGCACG TGAAGCAGAC GCAGGACAAC CGGCGCCAGG CCGATCGCGG CTATGGCGCG
GAACTGGCGA CGGATGCCGC CGGCGCGCTG CGCGGCGGCG CGGGGCTCTT GATCAGCACG
GCGCCCGGCG TGAGCCAGAT GGATGCGAGC GCGCCGAGCC AGGTGCTCGC TCAGCATCGA
CAGACGTTGC AAAGCCTCGC GGCGCTCGCG CACAAGCAGG GCGCGGAGCC GGGCGGTGCG
GTGCCCGAGG CGGCGGGCGG AGCCGGCGCG GCGTCGGCGT CGACGGGGGC GGCGGCGGAC
AAGCCTTTGC CGGCCGTCGA CGGCATCGAG CAGAGCCGTG AAGCGATCGG CGCGACGCGT
GAAGGAAGCG GCGGCGATAC GGCAGGCGGC GGGGGCGGCG GCGCGATTGC GTGGAGCAAG
CCGCATCTCG TCGCACACGG CGAGGCGGGG CTGGCCGCGA TGTCGGCGAA GAGCCACGTG
TGGGTGTCGG GTACCGAGAC GGTGCTGAGC GCCGGACAGG ACGTGCAATT GACGGCGAAG
GGCAAGACGA GCGTCGTCGC GAATCACGGC ATCTCGCTGT ATACGCAGGG CGCGGCGGGC
GACGGGCGGC CGGTTGCCGG CCGGGGCATC GCGTTGCACG CGGCGTCCGG CGCGGTGAGC
GTGCAAGCGC AGAACGCCGG CAAGCTGAGC GCGAGCGCGC AGAAGGCGGT GACGCTCGCG
AGCGCGCAAG GCAGCGCGTC CGTGCAGGCG CAGCAGCGCG TCCTGCTGAG CGCGGCGAAG
GCCTATCTGA AGATGGAAGG CAACGACATC GTCGTCGGCG CGCCGGGGCG CGCGGATTTC
AAGGCGGCGG CGCATCAGTT GACGGGGCCG AAGAGCGCGG GGGCGTCAGC GCAAATGCCG
AAGAGCGAAC CGAAGCTGTG CGAGTACAAG ACGCGCGCGG CCGACGTTGC TCACGAAGGC
ACGATGAAGT CGGCGGCTTG A
 
Protein sequence
MSIPTDVLQA LFGGWSQHDR FLWITTPLGA NALVAESLHG WEALDDGGFR LQLTALAEHP 
SLPLAQLIGA PILIEWQAQE GRDARRPFHG HVIAAELVGY NGGLARVLLV VEPWLTLLRQ
RVDSYNYLDA SVVEISEQVF RRYARGAIAP AWRWALADAA KYPKRSLTAQ AGESDFDFLE
RLWAEEGIFY WFEHEGDARA SSLGKHTLVL ADSNQRFAPD EPEIVGFHQT SDDDPQGCIQ
HFMHARRWRI GSVARASWDH RSLSTRPTGA RANGAVAPGE DRDVAGPYAY QTGAIGDRRA
QQQLDAQRVA ALRSEGRSTR RDLRPGLRFA IAHHPTLGVS DAFICLRVEH SARANVDATV
RSAIEQRLGA IASIAGAAPV PHGPASALNA ALGADTHHGG SLMRDDAVYR NRFVALPAEQ
TYRPLAASGH GARMHPVAVM PGAQTAIVVG AGDPVHTDRD HRIRIQHHAQ RGENAASRDD
HPHAANAPAN RGAGTWTRMM TPVGGDNWGG VSVPRVGQEV WTEWLEGQPD RPVAVAALYN
GRGNAHAQHN AQAGGPSGST GNAAAWFAGN AHAAVLTGFK TQDMSMSQQG TGGYRQFMLD
DTAGQSSARL YTTDRNSGLT LGHVKQTQDN RRQADRGYGA ELATDAAGAL RGGAGLLIST
APGVSQMDAS APSQVLAQHR QTLQSLAALA HKQGAEPGGA VPEAAGGAGA ASASTGAAAD
KPLPAVDGIE QSREAIGATR EGSGGDTAGG GGGGAIAWSK PHLVAHGEAG LAAMSAKSHV
WVSGTETVLS AGQDVQLTAK GKTSVVANHG ISLYTQGAAG DGRPVAGRGI ALHAASGAVS
VQAQNAGKLS ASAQKAVTLA SAQGSASVQA QQRVLLSAAK AYLKMEGNDI VVGAPGRADF
KAAAHQLTGP KSAGASAQMP KSEPKLCEYK TRAADVAHEG TMKSAA