Gene BURPS668_A2130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2130 
Symbol 
ID4887318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2067163 
End bp2070186 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content69% 
IMG OID640132067 
ProductRhs element Vgr protein 
Protein accessionYP_001063124 
Protein GI126444285 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGT CCCCCCGCCA CGATGCGCCC GCGTCGCGCG CGAACGCCGC GCCGTCCGCG 
AACGCGCGAC GCTTCACGTT CGCGAGCGAC GCGTACGACC CCGCGACGTT CGACGTCGTC
GACATCAACG GCCGCGACGC GATCTCGCAG CCGTACCGGT TCGAGATCAC GCTCGTGAGC
AGGCAGTTGC GGATCGACTT CGCGAAGATG CTGAGCCGCG GGGCGACGCT CGCGATCCTG
CCGCCGTTCG GCGAGGCCGG CACCACCCGC TATGCCGGCG TGCTCGCCGA ATTCGAGCAG
AAGAAGCGCT TTCGCGACTT CACCGTCTAT CGCGCGACGC TCGTGCCGCG CCTCTGGCGA
CTATCGCTGT ACAAGGCGTC GGACGTCTAT CTGAACGAGC AGACGATTCC CGACATCGTC
AAGCGCGTGC TGCGCGCCGC CTCGTTCGGC AGCCGCGATT TCCGCATGCG GCACGGCGGC
GGCTACCGCA AGCGCAGCTT CGTCTGCCAG TACGACGAGA GCCATCTCGA TTTCGTGTCG
CGCTGGATGG AGAAGGAAGG CCTCTACTAC TACTTCGAGC ATGACGGCCG GCACGAAACG
CTCGTGATCG TCGACGACCG CCGCCATCAG CCCGGCCCCG CCGACGATCT CGCGCTGCGC
TACCGACCCG CGACCAGCCT CGACGCGGGC ATCGAAGCGG ACCGCGTGCA GGCGTTCACA
TGCCGGGCGA CGCCGCTGCC GCGCGAAGTC GTGCTGCGCG ATTTCAACCA CCGCAAGGCG
GAGCTCTCGC TCGAAGTCCG CGAGCGCGTG GCGCGCGACG GCGTCGGCGA GCGGGTGTCG
AGCGACGAGC ACTTCCACAC GAAGGACGAA GGGCGGCGCT ACGCGAAGCT GCGCGCCGAG
GCGCTCGTCT GCGAAGGGCG CCGATTCGCC GGCGAATCGA CCGCGGCCGG GCTGCGCGCC
GGCCGCTTCT TCGCGCTGTC GGGCCACTAC CGCGAGGATT TCGACGGCCG CTATCTGGTG
ACGGCGCTCA CGCATCGCGG CTCGCAGGCA CACCTGATGT TTCCCGATCT CGACGCGCCG
TTCGGCGCGA CGCCGGGCGA GCCCATCTAC CGCGCCGAGT TCGAGGCGAT TGCCGCCGAC
CTCCAGTACC GGCCGCCGCG CACGACGCCC AAGCCGCGCG CGGCGGGCGT CGTCAGCGCG
ATCGTCGACG GCGAGGGCGG CGGCAAGCTC GCGGAGCTCG ACGAATACGG CCAGTACAAG
GTGCGCTTTC CGTTCGCGCA CACCGCGCAT CCGGCGAACA AGGCCTCCGC GCGCATCCGG
ATGGCGACGC CCTACGCGGG CGACGACCGC GGCATGCACC TGCCGCTGGT GAAGCGCACC
GAAGTGAAGA TCGCATTCGA CGGCGGCGAT CCGGACCGCC CCGTGATCGT CGGCGCGGTG
CCCAACTCGT CGCACCGCAG CGTCGTCACG CGCCGCAACC CCGCCGAGCA TCGCATCCTC
ACCGAGCACA ACCAGCTCTA CATGAAGGAC GGCAGCGGCG CGGCGACGTG GCTGCACGCG
CCGAACAACC ATATCGGCAT CGGCGCGGTC GGGCCGGGCG ACGGCCTCGC GCTCCTCACG
TCCGGCAACA AGTTCGACTT CTCGCTCGGC AACGCGTACA GCTTCTCGGG CGGGCTCAAG
TGCTCGGTGT CGATGGGCGG CAACACCGAC GTCTACGTCG GCGTGCGCAA CAGCCTCGAC
GTCAGCGCGA ACTTCCTGAC GACGCTGCAG GGCAACCTGC GCTGGATGCT GCCCGGCAGC
CGGAGCTTCG AGATCAACGA CAGCGCATCG ACACTGCTGC AGACGCTGCA CAAGCAGTCC
GCGACGGGCG CGATCCGGCT GTCCGCCGGG CAGGACGCGT CCGCGCTGCT GCAAAAGCAG
CTCGACAAGC TCAAGGGCAC GGTGCGCAAG TTCATGATCG TGTCGGGCCT CGCGAACGCC
GGGTCCGCGG CCGCCGCCGC GGGGCTCATC AAGGGCGGCG GCAAGCTCGC CGATCTGCCG
TGGGCGGGCT TCGGCATATC CGCCGCGCAG TTCGCCGGCG CGACCGGCGT CAGCACGGCG
CTGATGGCGA CCTCGCGCAC GCTGCTCGCG AACGTCGCGA AGCTCCAGGA GGCGTTGCCG
CTCGTCGCCG ATCTGTCGCT CGGCAAGCAG GGCATCGCGC TCGCCGCGAA GAACCTCACG
CACGCGACGC GGATGTCGCT CACCGTCGAC GGCGTCTCGT GGTCGGCGCA CGCGAAGGGG
CCCGGCGCGG CGGGCGCCGC GATGAGCGTC GGCAAGGGCC GCTGGGGCGT CGAAGCGGCG
AAGCACGCGC ACGTCCACGC GAGCGACACG CTGCTGTTCG CGGTGCCGGC CGACCCGACG
ACCCAGTTCG ACCTGAAGGA CCTGATCGGG CTGCGCCGCG ATCTCGACGA ATGCATGAAG
GACATCGCCG ATCTCGAAGC CGACATTTCG GAAAACGAAG TGCTGTCGAC CGATCAGAAC
ACGTTCGGCG TCAGCGCGCT CATCCCCACG CCGCCGTCGC CCGCCGGCGC GCTCGCGGCG
GTCGCGATCA AGGTGAAGCA AGCGAAGCTC GTCGAGCTGA AGGCCAAGCA GAAGCTCGTC
GCGCTGAAGG TCGACAACCT GCAGCAGAAG TTCGCGAAGC ACGTGCAGCA CCTGAGCGCC
GTGCGGATGA GCGCTTCCGA CGCGCAGCTC GGCTTCAAGG GCAACCGGCT CGTCGCGACG
GCCGACGGCG TCACGCTCGC GCATGCGCAG GGCAAGGCGA AGCTCGACGT GCGCGAAGCG
AAGATCGGCG TCACGGCGGG CAAATCGAGC GTCGAGCTCG ACGAAGGCAA GATCGCGGCC
GGCTGCGGCA GCGCATCGCT GAAGCTCGGC AGCGACGGCG CGATCGACGT GAGCGCGACC
GACGTCAAGC TGAACGGCAC CAACGTCAAG CTGAACGGCA GCGCGTCGCT GAAGTTCGAC
GGCCAACTGA TCCAGCTCGG CTGA
 
Protein sequence
MPSSPRHDAP ASRANAAPSA NARRFTFASD AYDPATFDVV DINGRDAISQ PYRFEITLVS 
RQLRIDFAKM LSRGATLAIL PPFGEAGTTR YAGVLAEFEQ KKRFRDFTVY RATLVPRLWR
LSLYKASDVY LNEQTIPDIV KRVLRAASFG SRDFRMRHGG GYRKRSFVCQ YDESHLDFVS
RWMEKEGLYY YFEHDGRHET LVIVDDRRHQ PGPADDLALR YRPATSLDAG IEADRVQAFT
CRATPLPREV VLRDFNHRKA ELSLEVRERV ARDGVGERVS SDEHFHTKDE GRRYAKLRAE
ALVCEGRRFA GESTAAGLRA GRFFALSGHY REDFDGRYLV TALTHRGSQA HLMFPDLDAP
FGATPGEPIY RAEFEAIAAD LQYRPPRTTP KPRAAGVVSA IVDGEGGGKL AELDEYGQYK
VRFPFAHTAH PANKASARIR MATPYAGDDR GMHLPLVKRT EVKIAFDGGD PDRPVIVGAV
PNSSHRSVVT RRNPAEHRIL TEHNQLYMKD GSGAATWLHA PNNHIGIGAV GPGDGLALLT
SGNKFDFSLG NAYSFSGGLK CSVSMGGNTD VYVGVRNSLD VSANFLTTLQ GNLRWMLPGS
RSFEINDSAS TLLQTLHKQS ATGAIRLSAG QDASALLQKQ LDKLKGTVRK FMIVSGLANA
GSAAAAAGLI KGGGKLADLP WAGFGISAAQ FAGATGVSTA LMATSRTLLA NVAKLQEALP
LVADLSLGKQ GIALAAKNLT HATRMSLTVD GVSWSAHAKG PGAAGAAMSV GKGRWGVEAA
KHAHVHASDT LLFAVPADPT TQFDLKDLIG LRRDLDECMK DIADLEADIS ENEVLSTDQN
TFGVSALIPT PPSPAGALAA VAIKVKQAKL VELKAKQKLV ALKVDNLQQK FAKHVQHLSA
VRMSASDAQL GFKGNRLVAT ADGVTLAHAQ GKAKLDVREA KIGVTAGKSS VELDEGKIAA
GCGSASLKLG SDGAIDVSAT DVKLNGTNVK LNGSASLKFD GQLIQLG