Gene BURPS668_A0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0797 
Symbol 
ID4888758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp769908 
End bp772196 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content65% 
IMG OID640130737 
ProductRhs element Vgr protein 
Protein accessionYP_001061796 
Protein GI126443100 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGA TCGAACTGCG CAGTCCCCTG CTGGACCCGG ACGCCGTCGC GCTGAGCTTC 
GTGGTGCACG AGAACCTGTC GCAGGAGCCG TCGTATCAGC TCGATTTGCT GAGCCACGAT
TCGAATCTGG ACTTCGACGC GCTGCTCGGC TCGACGCTGT CGGCCGACAT CGACCTGGGC
GAAGGCGACA TCCGGACGTT CAACACGCAC GTGTTCGGCG GCTACGACAC GGGGCAGATG
AGCGGGCAAT ACACGTACAC GCTGGAGCTG CGAAGCTGGT TGTCGTTTCT CGCGGAGAAC
CGCAACAGCC GGATCTTCCA GGATTTGAGC GTGCCGCAGA TCGTCGAGCA GGTGTTCCAG
GGCCATCAGC GCAACGGCTA CCGGTTCGAG CTCGAAGGCA CGTACGAGCC GCGCGAGTAC
TGCGTGCAGT TTCAGGAAAC GGATCTGAAC TTCGTGAAGC GGCTGCTGGA GGACGAAGGG
ATCTACTTCT GGGTGGAGCA CGAGCCGGAC CGTCATGTGG TGGTGATCTC GGACACGCAG
CGGTTCGAGG ATCTGCCGCT GCCGAACGAC ACGCTGGAGT ATTTGCCGGA CGGCGAGGAG
TCGCGCGCGA TCCAGGGGCG CGAAGGGGTG CAGCGGCTGC AGCGCACGCG GCGGATCAAG
TCGAACAACG TCGCGCTGCG GGATTTCGAC TATCACGCGC CGTCGAAGCA ACTGGACAGC
GACGCGCAGG TCGAGCAGCA GAGCCTCGGC GGCATTCCGC TCGAGTACTA CGACTACGCG
GCCGGCTACC GCGACCCCGA GCAGGGCGAG CGTCTCGCGC GGCTGCGGCT CGAAGCGATT
CAGGCTGATG CACACGCGCT CGGGGGCGAG GCGAACGCAC GCGCGCTGGC GGTGGGTCGC
GCGTTCACGC TGGTCGGCCA TCCGGCGCTG AGCCGCAATC GTCGGTACTA CGTGACGAAC
AGCGAGCTGA CGTTCATCCA GGACGGACCG GACAGCACGT CGCAGGGGCG CAACGTCGCG
GTGAAGTTCC GCGCGCTCGC CGACGATCAG CCGTTTCGGC CGCTGCTCGT CACCAAGCGG
CCGCGCGTGC CGGGCATCCA GAGCGCGACG GTGGTGGGCC CGGAGATGTC GGAGGTGCAT
ACCGACAAGC TCGGGCGGAT TCGCGTGCAC TTCCACTGGG ACCGCTACAA GACGACCGAG
GCGGACGCGT CGTGCTGGAT TCGCGTGACG CAGGCATGGG CGGGCAAGGG CTGGGGCGTG
CTCGCGATGC CGCGGGTCGG GCAGGAAGTC ATCGTCGTGT ATGTCGACGG CGATCTCGAC
CGGCCGCTCG CGACGGGCAT CGTCTACAAC GGCGAAAACC CGACGCCTTA TGACCTGCCG
AAGGATATCC GCTACACGGG CCTCGTCACA CGCTCGATCA AGCGGGCGGG CGGCATTCCG
AATGCGAGCC AACTGACGTT CGACGATCAG CACGGCGCGG AGCGCGTGAT GATCCACGCG
GAGCGCGACA TGCAGCAGAC GGTCGAGCGC AACAGCTCGA CGTCGATCGC ACAGGATCTG
AACCTGTCGG TGAAGGGCAC GTCGACGTCG GTCGTCGGCA TCTCGGTCAG CTTCACGGGC
ATCTCGGTGT CGTACACGGG GTTGTCGGTG AGCTTCACCG GCGTGTCGGC GAGGTTCACG
GGCGTGAGCA CGTCGTTTAC CGGCGTGAGC ACGAGTTTCA CCGGCGTGTC GACGTCGTTT
ACCGGCGTCG ATACCAGCTT CACCGGCGTC TCGACCGGAT TCAAGGGCGT CGACACGAGC
TTCACCGGCG TCGCGACGTC GATGGTGGGC GTGTCGACGA GCATCACGGG CTCCAGCAAT
TCCGTGACGG GCGTGTCGAA CAGCATGACG GGCATCTCGT CTTCCTGGAA GGACGTGAGC
ATGTCGACGA CCGGCCAGTC CGAAAGCATC ACGGGAGTAT CGCTGTCGTA CACGGGCACG
TCGAACAGCA TGACGGGCAC GAGCACGTCG GTGACGGGCA CCTCGACGAG CATCACCGGC
ACGTCGATGT CGAACACCGG CAGCTCGACG AGCATCACGG GCACATCGAT GTCGACGACG
GGCAGCTCGG TGAGCACGAC GGGCTCGAGC ATGTCGGCCA CCGGCAGTTC GGTGGGCACG
ACGGGCTCGA GCGTATCGAC GACGGGAAGC AAGATGTCGG TCACCGGCTT CAGCTTCTCG
TATACGGGAG CGAGCTACGA GGATGTGGGC GTCGATCTGA AAAAGCTCGG GATGCAGACG
AAGAACTGA
 
Protein sequence
MRLIELRSPL LDPDAVALSF VVHENLSQEP SYQLDLLSHD SNLDFDALLG STLSADIDLG 
EGDIRTFNTH VFGGYDTGQM SGQYTYTLEL RSWLSFLAEN RNSRIFQDLS VPQIVEQVFQ
GHQRNGYRFE LEGTYEPREY CVQFQETDLN FVKRLLEDEG IYFWVEHEPD RHVVVISDTQ
RFEDLPLPND TLEYLPDGEE SRAIQGREGV QRLQRTRRIK SNNVALRDFD YHAPSKQLDS
DAQVEQQSLG GIPLEYYDYA AGYRDPEQGE RLARLRLEAI QADAHALGGE ANARALAVGR
AFTLVGHPAL SRNRRYYVTN SELTFIQDGP DSTSQGRNVA VKFRALADDQ PFRPLLVTKR
PRVPGIQSAT VVGPEMSEVH TDKLGRIRVH FHWDRYKTTE ADASCWIRVT QAWAGKGWGV
LAMPRVGQEV IVVYVDGDLD RPLATGIVYN GENPTPYDLP KDIRYTGLVT RSIKRAGGIP
NASQLTFDDQ HGAERVMIHA ERDMQQTVER NSSTSIAQDL NLSVKGTSTS VVGISVSFTG
ISVSYTGLSV SFTGVSARFT GVSTSFTGVS TSFTGVSTSF TGVDTSFTGV STGFKGVDTS
FTGVATSMVG VSTSITGSSN SVTGVSNSMT GISSSWKDVS MSTTGQSESI TGVSLSYTGT
SNSMTGTSTS VTGTSTSITG TSMSNTGSST SITGTSMSTT GSSVSTTGSS MSATGSSVGT
TGSSVSTTGS KMSVTGFSFS YTGASYEDVG VDLKKLGMQT KN