Gene BURPS1106A_A0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0100 
Symbol 
ID4905828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp88347 
End bp91127 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content69% 
IMG OID640143207 
ProductRhs element Vgr protein 
Protein accessionYP_001074143 
Protein GI126457812 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA 
CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG
CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG
TCGACCCGCC GCGACCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC
ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGGCGT CCAGATCGGC
CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG
GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGATGT GCTCGCCACG
CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG
TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG
GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG
GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGACGCGC CCGTGCACAC GCTCGTGTGC
TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC
GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG
CCCGGCAAGG CCGGGCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC
GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGATC TGGCGAAGCT GCTGACCGAC
ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC
GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG
GATCTCGCCG TGGGCACATG GATCACGCTG ACGGGCGATC CGCAATGGGA CAGGCAACGC
GCCGACAAGC GTCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG
AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCGA GCCGCAATCT CGCGTGCGCG
CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC
GCGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG
CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC
GAAGACGGCC GGGTGCGCGT GCGGGTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG
CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC
CTCGCCGGCG CCTATTTCGG CGCATCGTTT CTGCCGCGAG TCGGCATGGA AGTCCTTCTC
GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG
CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAACCGCTA CCTGTCGGGC
ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG
AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT
CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT
ACCGACGCGG CGGCGGCGCT GCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC
ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC
GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGTGGAT
ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGACGGC
GCCGATGCGC CGTCCGACGG CGCAGCGCGC GCGCTGATGG CGTTCGGCGC GCAGGCCGGT
TCGGTGAACG TCACGCCGAA GACGCATGTG ACGTATGCCG GCGAGAACAT CGATCAGGTC
GCGCAGCAGC ACCTGCAACT GATGAGCGGC CAGCGGCTGA ACGCGACGGC CGGGCAGGGC
ATGCAGCTCT TCGCGCGGGG CGCGGGGGTG CAGGCCGTGG CGGGCGAAGG GCCGATGCTG
CTGCAGGCGC AAGCCGGCAC GCTGACGGCG AACGCGCAGA AGGGCGTCAA GATCACGACG
AACGAGCACG AGGTGTTCGT GAGCGCGCCG AAGATTCGGC TCGTTGCCGA GGACGGCAGC
TACCTCGAGC TCGGCGGCGG CATCACGCTC GGCACGAACG GCGACATCAA GCTGCTGTCG
GCGTCGCACC AGTGGGGCGG GCCGTCGACC GCGCAGGCGG CGAAGAGCGG GTTCGGCAAT
CAGCCGACGG ATCAGCGTTT CAAGCTGCAC TATCCGGGCG AGGACGGCGA TTTGCAGGCG
GCGGCGAACA AGCGGTTCCG GATCACGCTG GACGACGGGC GCGTCATCGA AGGCAAGACC
GACGCGAGCG GCCTGACGGA TCTGGTCAAG GACGACGCGA TGCGTATCGC GAAGATCGAC
TATCTGAAGC CGAAGCTCTG A
 
Protein sequence
MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV 
STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISGVQIG QSDGELCVYQ LTVCDALSLM
DKRTNSRVFR KRSVIDVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES
DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR
DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD
IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGTWITL TGDPQWDRQR
ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF
ACVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRVH GLDPADHAHA
QGAGTNGNAG DSAPIRVASS LAGAYFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA
HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY
LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC
AELFKALGDY AGQHGGQAVD TAGQHAVAAA FKRWAPGTDG ADAPSDGAAR ALMAFGAQAG
SVNVTPKTHV TYAGENIDQV AQQHLQLMSG QRLNATAGQG MQLFARGAGV QAVAGEGPML
LQAQAGTLTA NAQKGVKITT NEHEVFVSAP KIRLVAEDGS YLELGGGITL GTNGDIKLLS
ASHQWGGPST AQAAKSGFGN QPTDQRFKLH YPGEDGDLQA AANKRFRITL DDGRVIEGKT
DASGLTDLVK DDAMRIAKID YLKPKL