Gene BURPS668_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2142 
Symbol 
ID4884723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2132026 
End bp2134911 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content59% 
IMG OID640128070 
ProductRhs element Vgr protein 
Protein accessionYP_001059177 
Protein GI126441245 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria
[COG4253] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGTCGCT TCTCTGACGG TAGCTCGAAC CTGAGAGAAG GACGACAAGA AATGGTACTC 
AGACCACCCG CAATCGGGGA CCTGAAGAAT CGCGACCTGT ACGAGGCCGT TCATCGCGGT
TTCTTGCAGC ACGACCGGCT GTTGATGCTG GATACGCCAC TCGGCAAGAA TGCGTTGACT
CCGTTGCGGG CGCGCGGATC GTCCGGGATT GGCGGTGACT ACCATTGGAC AATCGACGTT
GCGTCGCTGC GTGACGATAC GGCGCTGCTG TCGCTGATGC ATCAGCCCGT CACCCTGTGG
ATTCAAGAGC GTACAGCGCT GTACGCAGAT TCGGTCTATC GACCCGTACA TGGCTTCGTG
CATCAAGTTG GATATTTGGG CGGGGACGGT AGCGTATCCA CGTACCAGCT TGAATTTTCA
TCCGCGCTGA TTTTCCTGAG CAAGACCCAC AACGATGAAG GGTGGCTCGA AAAGGACGCG
CGCGAAATCA TTTCCGACGT ATTCAACCGC TACCCGCAGC TGCAGGGGCG GTTCCGGTTC
GACCTGACGC GCGAACCTGC GGTGCGCTCA TGGTGCCGTC AGAGCGAATC TGATCTTCAT
TTCGTGCACC GCCTTCTCGA AGATGAAGGC ATGTATTTCC GTTGGGTCCA CGAACAGACG
AAGGAAGGCG AGCCGCCGAA AACGACGCTC GTTATCGTGG ACCGCGTATC GTCGCTGCCG
GAAGCAAAAC CAGCCGAATA TTACCGTGGC AACACTGACC ACGAAGCAGA CGGGTTCACG
CAGTGGGCGG TCATGCAGAC CATGCAGAGC CTGCGTTATA TGTCGCGTGC GTTCGACTAT
AAGCGCCCGA CATCCCATTT CCAGACGGAA AGCGCCCTTC AATCCACGAC GTACGCAACG
GACGGCGGGC GGCAGTCGGA ATCGCGCAGC ATCCCTGCTG CACCGATGAC GATCTATCAG
CCGACAGCGT ACGGCTACCC CGATTCAGAC AGCGGCGAAG GCCGTGCCCG TCGTCGTGTT
GAAGAGTGGG ATTCGCGTGC GCGTCGCTAT TTCGGTGTTG GCGGTCTGCG ATGGCTCGAT
GCCGGATCGC GATTCACGTT AGACAATCAT CCGCGTCATC CCGACAGCGA TCCGAAGAAG
CGGGAATTTC TCGTGATCGA GGCGCGTTGG TTCATCGAAA ACAATGTGTC GATTGGTCAA
CAAGCGACGG AGTATCCGCG CAGTCTGCGC GCGACGCTGG CCGAGCAGCA GGCAGCACAC
GGGACGCGCT TCAAGACGCC GGAACACCCG CAAGACGGAA CAGCGGGATT TTTCGTCATC
GAGGTTGAGG CGCAAGAGGC GAACATCGAA TATCGCAGCC CGCTCGCGCA TCCCAAGCCG
AATATGGCTA TCGAGCATGC GATCGTTGTG ACGCAGCACG GATCGGAAGC ATGGACGAAC
GAGCGGAACC AGGTGCGTGT GCACTTCGCT TGGGACCGGA AAAATCCGGA CGGCACGTTC
GCATCGTCGC CGCTGCTATC GACCATGCAG GCTGATACTG GCAACGGCTA CGGCTCGGTG
CACGTGCCAC GTGCCGGTGA ATGGGTTTTA ATCGCCTATT GGGCAAACGA TTGCGACAAA
CCGTTCATTT TGGGGCGCGT CAATGGCGGT ACGACGCCTT CTCCGTGGCA TTCGAATGTG
CTGCTGTCGG GATTTCAGTC ACAGGGGTTC GGCGGAACGG GCGCGTTCAA CGCGTTCGTG
CACGATGACG CCACGAACCA AGGCGGTACG CGGCTCGTCA GCTACACAGG CAGCAGCTAT
GCATCGATCG CGCAGGGATA CTTGATTCAG CAGAGCGGTA ACTCGCGCGG GCGGTATCTC
GGTTCGGGAT TGCTGTTGCA CGCCGATCAT TACGCATCGG TACGCGGTAG CCGTGGCGTG
TCGATCAGTG CGCATCCCGT GTCGCGCGAC AGTGATCAAC TCGATGTTGA CGAAGCACGC
GAGCAGCTTA CACGCTCAAA AGACCTGCTC GGCAGCATTT CGGATGCGAG CGAACAGCAT
CAGGCCGAGA GCCTGAAACT CGGTGTCGAT GCGCTGGCGA CGTTCACCGA CGCAACGAAG
CAAGCGGCAT CCGGCGAATC ATCCGGAGGT CGGACGGCAG GCGGCGGCAC GGGTAACGCG
AACGCGTTCG CTGAGCCGCT TTTGCTGCTC GGCAGTCCGG CCGGTATTGG CCTGTCCACA
CATCAGTCGC TACACGCGAG CGCCGACCAA CAGGCGAACT GGATCAGCGG ACAGGATTCA
TATTTTGCGG CGGGTGGATC GATGCACGCG GCTGCGGTAA ATCACCTGAG CCTGTTCGCG
CGCAACCAGG GGATCAAGGC TGTCGCTGGT AAGGGCAAGG TCGAAATACA GGCGCAGGCT
GGTGATCTGG AAATGATCGC GCAGCAGCTT GTGAAATTGC TGTCGGTAGC CGGACGGATG
GAAATTGCGG CTGATCAGGA ACTAGTCCTG TACTGCGGCG GCGCGACGAT CCGCATCAAG
GGCGGCAACG TCTCGATACA CGCACCGGGC AATGTTGACT TCAAGGGCGC GTCGTTCAGC
TTCGCGGGGC CGGTCAGTGA GTCCTACGCG ATGCCGCAGT TCAAACCGTC GTATCAGGCG
CGGTATGTCT TGAAGAATCA GACAGACGGT ACACCGATGA TTCGACACGC TTACGAAATG
AAGCTGCCAT CCGGGCGAAC GGTGCTCGGC CACACGAACG ACCTTGGCGA GACAGTGCCG
GTTTTCACGC CGAGCGCACA GGATGTGCCG TTGCAAGCCG CAAAGGCGAA GCCTGCTCAG
GTCGAATCGT GGCAGTTTGC GGGCTCGGAC AAACCGATGA TTCAACGCGA CTACTTGGAG
GACTGA
 
Protein sequence
MCRFSDGSSN LREGRQEMVL RPPAIGDLKN RDLYEAVHRG FLQHDRLLML DTPLGKNALT 
PLRARGSSGI GGDYHWTIDV ASLRDDTALL SLMHQPVTLW IQERTALYAD SVYRPVHGFV
HQVGYLGGDG SVSTYQLEFS SALIFLSKTH NDEGWLEKDA REIISDVFNR YPQLQGRFRF
DLTREPAVRS WCRQSESDLH FVHRLLEDEG MYFRWVHEQT KEGEPPKTTL VIVDRVSSLP
EAKPAEYYRG NTDHEADGFT QWAVMQTMQS LRYMSRAFDY KRPTSHFQTE SALQSTTYAT
DGGRQSESRS IPAAPMTIYQ PTAYGYPDSD SGEGRARRRV EEWDSRARRY FGVGGLRWLD
AGSRFTLDNH PRHPDSDPKK REFLVIEARW FIENNVSIGQ QATEYPRSLR ATLAEQQAAH
GTRFKTPEHP QDGTAGFFVI EVEAQEANIE YRSPLAHPKP NMAIEHAIVV TQHGSEAWTN
ERNQVRVHFA WDRKNPDGTF ASSPLLSTMQ ADTGNGYGSV HVPRAGEWVL IAYWANDCDK
PFILGRVNGG TTPSPWHSNV LLSGFQSQGF GGTGAFNAFV HDDATNQGGT RLVSYTGSSY
ASIAQGYLIQ QSGNSRGRYL GSGLLLHADH YASVRGSRGV SISAHPVSRD SDQLDVDEAR
EQLTRSKDLL GSISDASEQH QAESLKLGVD ALATFTDATK QAASGESSGG RTAGGGTGNA
NAFAEPLLLL GSPAGIGLST HQSLHASADQ QANWISGQDS YFAAGGSMHA AAVNHLSLFA
RNQGIKAVAG KGKVEIQAQA GDLEMIAQQL VKLLSVAGRM EIAADQELVL YCGGATIRIK
GGNVSIHAPG NVDFKGASFS FAGPVSESYA MPQFKPSYQA RYVLKNQTDG TPMIRHAYEM
KLPSGRTVLG HTNDLGETVP VFTPSAQDVP LQAAKAKPAQ VESWQFAGSD KPMIQRDYLE
D