Gene BURPS668_A0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0636 
Symbol 
ID4888242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp594096 
End bp596834 
Gene Length2739 bp 
Protein Length912 aa 
Translation table11 
GC content59% 
IMG OID640130576 
ProductRhs element Vgr protein 
Protein accessionYP_001061635 
Protein GI126445216 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria
[COG4253] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCGACC TCTATGAGGC GATCAGTAAG GGTGTCTTGC AGCAGGGGCG GTTGTTGACG 
CTGCATACGC CGCTTGGCAA GAATGCGCTG GTGCCATTAC GTGCGCGTGG ATCGGCCAGA
ATCGGGCGTG ACTATTCGTA CACGATCGAC GTTGCGTCGA CGCGCGATGA CACTGCGCTG
CTATCGTTGA TGCATCAGCC TGTTACCCTG CGGATTCAGC AGATCACGGC TCCCTTTGCG
GTTCCTGTCT ATCGTCCGGT CCACGGCTTC GTGCATCGGG TCGCCTATCT CGGCGGAAAC
GGTGGCCTTT CGACGTACCA GATCGAGTTT TCATCGGCAC TGGTCTTCCT TGAGAAGACT
CACAACGAAG AAGGCTGGCT CGAGAAAGAC GCCCGCGAAA TCATTTCCGA TGTGTTCGAT
CGGTATCCGC AGCTACGCGG ACAGTACCGT TTCACCCTGT CGTCCGATCC TGCCAAGCGT
TCGTGGTGCA GGCAGAGCGA ATCCGATCTT CACTTTGTGA ACCGCTTGGC TGAGGCCGAG
GGCTGGTATT TCTACTGGGT CCACGAAAAC GTTAGCGAAG GTGACGCTCC CAAGACGACG
CTCGTCATCG TTGACCGCGT CTCGACCCTT CCCGACGCGA AGCCGGTCGA ATTTATTCGC
GCGAACCACA GCAACGAGTC TGACGGTTTC GCCCATTGGG CCGCCGTGCA GACGATGCAG
AGTACACGTT ACATGGCGCG TGCATTCGAC TATAAGCGGC CCTCGTCCAA TTTTCAGGTT
GAGAGTGGAC TTGTCGCGAC GAGCTATGTC ACGGAGGAGC GCCGCCAGAA AGTTGAGCAT
AGCATTCCTG ATGCGCCGAT GACGGTCTAC GAATCGACCG CATACGGCTA TTCCAGTTCG
GACAACGGTG AGGTGCGAGC GCGACGGCGA GTGCAAGTGT GGGATGCACA TGCAAGCCGA
TATTTCGGCG TAGGTGGCGT GCGGTGGCTC GATGCTGGTT TGCGGTTCGT GTTGAACGCT
CATCCGCGTC ACGGGGACAG TGACCCGAAG AAGCGAGAGT TTCTGGTGGT CGAGGCACGT
TGGTTCATCG AGAACAATGT ACCGATCGGT CAGCAGATAG CCGAATTTCC ACAAAGTCTG
CGCGCAACAC TGGCTGAGCA GCAAGCGATT CACCGTGAGC GGTTCAAGAC GCCTGGTCAT
GAGGCAGATG GTTCGGCCGG ATTTTTCGTG CTCGAAGTCG AGGCGCAGCC CACGACAGTT
GAATTTCGTA GCCCGCTGGA TCATCCGAAG CCGGTCATGT CGATTGAGCA TGCGCTCGTC
GTCACTCCGG ATGGAGCTGA GGCGTGGACG AACGACCGGA ACCAGATCAG GGTGCACTTC
GCATGGGATC GCAAGAATCC GCCGAATGCC TTCAATTCCT CGCCGTTGCT GTCGTCGCTT
CAATCCGATA CTGGGAACGG CTATGGAGCG GTTCACGTTC CGAGGGCGCG TGAATGGGTC
ATCGTCGGCT ACTGGAACGG CGACTGTGAT AAGCCATTCG TGTTGGGGCG TATCAACGGA
GGCGCAACCC CCTCTCCATG GCATTCAAAC GCGTTGTTGT CCGGGTTCAG GTCGGAGGGA
TTCGGCAAGA CCGGCGCCTA CAACTCGTTT GTCCACGACG ATTCCACGAA TCAAGGCGGG
ACGCGACTGG TCAGCTATAC CGGCAAGAGT TATGCCGCGC TGACGCAGGG CTACCTGATC
AAGCATGACG GCAACACGCG CGGACAGTAT TTGGGCGCTG GCTTCATCCT GCACGCCGAC
GAGTTCGGTG CGGTTCGTGC CAGCAAAGGG TTGTCCATCA GTGCGCATTG GAAATCCTAC
GACGATGAAC AAATGGGCGT CGACGAGGCG CGATCGCGGT TGCAGCAGGC CGGGATGTTG
GTTGAGTCGC TGTCCAGTGC GAGCACGACG GCTCAGGCGG AATCGTTGCA AACCGGACAG
GATGCGCTCA AGTCGCTGTC GAAGGACATT CAGCACCCGG TATCGGGCGA CACGTCCGGC
GGGGTGACGG CGGGCGGCGG TACGGGCAGC GCCAACGGAT TTGCGCAGCC GAACATTGTG
GTATCGACGC CGAAAGACAT TGCACTGGTT GCGGACAGCT CGACGCATAT CGTGGCGGAG
AAAGAGGTCA ACGTCGTCAG CAACGAGAAC ACGTATGTCG CGACGGGCAA GTCGTTCGTC
GTGGCTGCCG CGGAAAAGGT GAGCTTCTTC GCACAGAAAC TCGGCGCGTT CTTTGTGACG
GCGAAAGGTC CGATCAAACT GTCGGCCAAT ACGGATGACG TGAACGTCAA TGCTGGCAAA
GACGTGACCG TGAAGGCCAA GCGCATCGTG CTCGATGCCG ATGAAATCGT CCTCAAGGCG
GGCGGCTCGT ACACGAGGTG GGTGGCCGCC GGTATCGAGG ATGGCACGCA AGGCCCACGC
ACCATCAAAT CGGCGTCGCT CAGCCGTCAG GGGCCGAGTT CGATTGCGCA GCATATGAAC
AGCCTGCCGC AGGCCAAGTT CAACGACCCG TACGTGCTGC GCAACCGCGT CACCGGCGAG
GTGCTGAAAA ACCACCCGTA CGAACTGATC CGCGGGGACG GCACGCGCCT GACCGGGGTG
ACGAACGAGT TGGGGCATAT CGCCGAGCAG AAGAGCGAGG ATATCGAGAA GCTGGCGGTG
CGCGCTTTGC GCCCGAAGCC GAACGGTCCT GCGGCATGA
 
Protein sequence
MRDLYEAISK GVLQQGRLLT LHTPLGKNAL VPLRARGSAR IGRDYSYTID VASTRDDTAL 
LSLMHQPVTL RIQQITAPFA VPVYRPVHGF VHRVAYLGGN GGLSTYQIEF SSALVFLEKT
HNEEGWLEKD AREIISDVFD RYPQLRGQYR FTLSSDPAKR SWCRQSESDL HFVNRLAEAE
GWYFYWVHEN VSEGDAPKTT LVIVDRVSTL PDAKPVEFIR ANHSNESDGF AHWAAVQTMQ
STRYMARAFD YKRPSSNFQV ESGLVATSYV TEERRQKVEH SIPDAPMTVY ESTAYGYSSS
DNGEVRARRR VQVWDAHASR YFGVGGVRWL DAGLRFVLNA HPRHGDSDPK KREFLVVEAR
WFIENNVPIG QQIAEFPQSL RATLAEQQAI HRERFKTPGH EADGSAGFFV LEVEAQPTTV
EFRSPLDHPK PVMSIEHALV VTPDGAEAWT NDRNQIRVHF AWDRKNPPNA FNSSPLLSSL
QSDTGNGYGA VHVPRAREWV IVGYWNGDCD KPFVLGRING GATPSPWHSN ALLSGFRSEG
FGKTGAYNSF VHDDSTNQGG TRLVSYTGKS YAALTQGYLI KHDGNTRGQY LGAGFILHAD
EFGAVRASKG LSISAHWKSY DDEQMGVDEA RSRLQQAGML VESLSSASTT AQAESLQTGQ
DALKSLSKDI QHPVSGDTSG GVTAGGGTGS ANGFAQPNIV VSTPKDIALV ADSSTHIVAE
KEVNVVSNEN TYVATGKSFV VAAAEKVSFF AQKLGAFFVT AKGPIKLSAN TDDVNVNAGK
DVTVKAKRIV LDADEIVLKA GGSYTRWVAA GIEDGTQGPR TIKSASLSRQ GPSSIAQHMN
SLPQAKFNDP YVLRNRVTGE VLKNHPYELI RGDGTRLTGV TNELGHIAEQ KSEDIEKLAV
RALRPKPNGP AA