Gene BURPS668_A0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0166 
Symbol 
ID4888936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp148425 
End bp151217 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content65% 
IMG OID640130107 
ProductRhs element Vgr protein 
Protein accessionYP_001061172 
Protein GI126442970 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.275346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAACC ATTTTTCGAA CGGACGGACG AATCAAAGCC GCACGGTAGT GATCCGCAGC 
GGTGCGATGC CGCGGCTGCT CGGTCAGCCC GCGCTCGAGT TCCTGTCGCT GCGCGGTGAA
GAGCACCTCG GAAAACTCTA CACGTACGAA TTGCTCCTGC GCACGCCGGA CGATTTTCAT
GTTCCGTTGG CAACGAGCGC GAATCTCGAC CTGAAGGCGA TGATCGGCAC GGAGATGACG
GTCTGCATTC AGCTCGACGG AATCGGGACG GGCGCGCAAG GCGGCGTTGG CGCGGGTGCG
CGCGAAATCA GCGGGCTCGT GGTCAAGGCG GGCTTCCTGC GCTGCGAGGG GCGCTACAAC
GTCTATCGCA TCGAGCTGCG CCCCTGGCTG TGGCTCGCGA CTCTGACGAG CGACTACAAG
ATTTTTCAGG ACAAGAGCGT CGTCGAAATC ATCGATACGG TCTTGCACGA TTACCCTTAC
CCGGTCGAGA AGCGGCTCGA CATCGACAAG TATTCGGTGG CGGGCGAGAG CGCTCGAAAC
GAGCCGCGCG CGTTCCAGGT GCAATATGGC GAAACCGATT TCGACTTCGT TCAGCGTTTG
ATGGAGGAGT GGGGGATCTA CTGGTTCTTC GAGCATTCGG ACAACAAGCA TCGCCTGGTC
TTGTGCGATC ACATCGGCGG GCATCGCAAG GCGCCGAGCG AGGCCTATCA CGAGATCGCG
CATCACCCGG AAGGCGGGAA GATCGACATC GAGTACATCA ACTATTTCTC GACGGACGAA
GCGCTGCGGC CCGGCCGCGT CGTGATAGAC GATTTCGACT TCACGCGTCC GCTCGCGAGC
CTCGTCACGT CCAATCACCA GCCGCGGGAG ACGAACTGGG GGGAGGGCGA GCTGTTCGAA
TGGCCGGGCG ACTATACCGA TAGCAAGCAT GGCGATCTCA TCAGCCGCGT GCGCATGGAA
GAGCGCCGCG CGACCGGGTC GCGCGCATAC GGTCGGGGCA ACGTGCGCGG CCTCGCCTGC
GGTCATACGT TCGTGCTGTC GAAGCACAAG CACGACGGCG CGAACCGCGA GTACCTCGTC
ATCGAATCGG CGTTGATGCT GACCGAAGTC GCGGACGAAA CGGGCAGCGG CTACCGCTAC
GAATGCGATA ACGAACTGGT CGTGCAGCCG TCGAACGAGG TGTTTCGAAT GCCGCGCGAA
ACGCCCAAGC CGACGACGAG CGGGCCACAG TCCGCGATCG TGGTCGGGCC GCCGGGCCAC
GAGGTATGGA CCGACGAATT CGGCCGCGTG AAGATCCGTT TTCTGTGGGA TCGCTACGCA
CGCAATGACG CAACGGATTC CTGCTGGGTA CGCGTGAGCC AGGCGTGGGC CGGCGTGAAC
TTCGGCGGCA TCTACATTCC GCGGATCGGA CAGGAAGTGA TCGTCGGATT CATGAACGGC
GATCCGGACC GTCCGCTGAT TCTCGGCAGC CTCTACAACA CCATTACGCC GCCGCCTTGG
GATCTGCCCG GCGACGCGAC GAAGAGCGGA TTCAAGAGCA AGTCGATCAC GGGCGGGCGC
GAGAACTATA ACGGCATCCG CTTCGAGGAC AAGCTGGGGG CCGAGGAATT TCACATGCAG
GCGGAAAAGG ACATGAACCG CCTGACGAAG AACGACGAGT CGCATACGGT CGGCGCGAAT
TTTTCGATCG GCGTCGGGCT TACCCATACG CGCGCGGTGG GCGCCATGTT CAGCAGCATC
GTCGGCGGAG CCGCCAGCTA TGCGGTGGGG GGCGCGGAAT CGACGATGAT CGGCGGCGCG
TATGCGTTGA ACGTCGGCGG CGCGCACGCG GTTGCGGTGG GCGGCGCGTC GTCCGTTTCC
GTTGGCGGCG CCTACGCGCG CAACGTGGGC GGCGCGTATG CGCTGACAGT CGGCGGCGTG
CTGTCGATCG TCTGCGGTGC GTCTTCGATC ACCATGACGG CTTGCGGCTC GATCAAGATC
GTCGGCAAGA ATATTCGCAT CATCGGCAGC GACGAAGTCG TCGTGCAAGG CGCGCCCCTG
CAACTGAATC CGGGCGATTC GGATTGCGGC GGAGGGGGCG GCGGCGGAGG CGGCGGCGGC
GCGATTCCGC CGATTCCGTT GCCGTCGTTC TTCCTCGATA TCACGAAGCC GATTCTTCCG
CCGCCGCCGC CGCCACCGAC GGAGGTGCCA CCGGATCCGA CGCCGACGCC GACGCCGACG
CCGACGCCAA CGCCAACGCC GACGCCGACG CCGACGCCAA CGCCAACGCC TACGCCAACG
CCAACGCCGA CGCCGACGCC GACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCAACG
CCTACGCCTA CGCCAACGCC TACGCCAACG CCAACGCCGA CGCCAACGCC AACGCCAACG
CCAACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCAACG
CCAACGCCAA CGCCAACGCC AACGCCGACG CCGACGCCGA CGCCGACGCC GACGCCGACG
CCCACGCCAA CGCCCACACC GACGCCAACG CCAACGCCAA CGCCAACGCC AACGCCGACG
CCGACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCAA CGCCAACGCC GACGCCGACG
CCGACGCCAA CGCCAACGCC AACGCCAACG CCAACGCCAA CGCCAACGCC GACGCCAACG
CCAACGCCGA CGCCGACGCC GACGCCGACG CCCACGCCGA CGCCGACGCC AACACCAACA
CCAACGCCAA CTCCGACTAG TTCCGAGATT TAA
 
Protein sequence
MPNHFSNGRT NQSRTVVIRS GAMPRLLGQP ALEFLSLRGE EHLGKLYTYE LLLRTPDDFH 
VPLATSANLD LKAMIGTEMT VCIQLDGIGT GAQGGVGAGA REISGLVVKA GFLRCEGRYN
VYRIELRPWL WLATLTSDYK IFQDKSVVEI IDTVLHDYPY PVEKRLDIDK YSVAGESARN
EPRAFQVQYG ETDFDFVQRL MEEWGIYWFF EHSDNKHRLV LCDHIGGHRK APSEAYHEIA
HHPEGGKIDI EYINYFSTDE ALRPGRVVID DFDFTRPLAS LVTSNHQPRE TNWGEGELFE
WPGDYTDSKH GDLISRVRME ERRATGSRAY GRGNVRGLAC GHTFVLSKHK HDGANREYLV
IESALMLTEV ADETGSGYRY ECDNELVVQP SNEVFRMPRE TPKPTTSGPQ SAIVVGPPGH
EVWTDEFGRV KIRFLWDRYA RNDATDSCWV RVSQAWAGVN FGGIYIPRIG QEVIVGFMNG
DPDRPLILGS LYNTITPPPW DLPGDATKSG FKSKSITGGR ENYNGIRFED KLGAEEFHMQ
AEKDMNRLTK NDESHTVGAN FSIGVGLTHT RAVGAMFSSI VGGAASYAVG GAESTMIGGA
YALNVGGAHA VAVGGASSVS VGGAYARNVG GAYALTVGGV LSIVCGASSI TMTACGSIKI
VGKNIRIIGS DEVVVQGAPL QLNPGDSDCG GGGGGGGGGG AIPPIPLPSF FLDITKPILP
PPPPPPTEVP PDPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT
PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT
PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT PTPTPTPTPT
PTPTPTPTPT PTPTPTPTPT PTPTPTSSEI