Gene BURPS1710b_A0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0207 
Symbol 
ID3692917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp310141 
End bp312906 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content69% 
IMG OID637730461 
ProductRhs element Vgr protein 
Protein accessionYP_335366 
Protein GI76818982 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0113608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA 
CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG
CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG
TCGACCCGCC GCGACCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC
ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGGCGT CCAGATCGGC
CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG
GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGATGT GCTCGCCACG
CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG
TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG
GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG
GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGACGCGC CCGTGCACAC GCTCGTGTGC
TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCCT GCATCCGCGC
GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG
CCCGGCAAGG CCGGGCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC
GTCGCTTCGG GCCTCGATCA GGGCGAGGCG GGCAACGATC TGGCGAAGCT GCTGACCGAC
ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC
GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG
GATCTCGCCG TGGGCACATG GATCACGCTG ACGGGCGATC CGCAATGGGA CAGGCAACGC
GCCGACAAGC GTCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG
AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCGA GCCGCAATCT CGCGTGCGCG
CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC
GCGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG
CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC
GAAGACGGCC GGGTGCGCGT GCGGGTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG
CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC
CTCGCCGGCG CCCATTTCGG CGCATCGTTT CTGCCGCGAG TCGGCATGGA AGTCCTTCTC
GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG
CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAACCGCTA CCTGTCGGGC
ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG
AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT
CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT
ACCGACGCGG CGGCGGCGCT GCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC
ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC
GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGCGGAT
ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGGCACG
GACGGCGCAG CGCGCGCGCT GATGGCGTTC GGCGCGCAGG CCGGTTCGGT GAACGTCACG
CCGAAGACGC ATGTGACGTA TGCCGGCGAG AACATCGATC AGGTCGCGCA GCAGCACCTG
CAACTGATGA GCGGCCAGCG GCTGAACGCG ACGGCCGGGC AGGGCATGCA GCTCTTCGCG
CGGGGCGCGG GGGTGCAGGC CGTGGCGGGC GAAGGGCCGA TGCTGCTGCA GGCGCAAGCC
GGCACGCTGA CGGCGAACGC GCAGAAGGGC GTCAAGATCA CGACGAACGA GCACGAGGTG
TTCGTGAGTG CGCCGAAGAT TCGGCTCGTT GCCGAGGACG GCAGCTACCT CGAGCTCGGC
GGCGGCATCA CGCTCGGCAC GAACGGCGAC ATCAAGCTGC TGTCGGCGTC GCACCAGTGG
GGCGGGCCGT CGACCGCGCA GGCGGCGAAG AGCGGGTTCG GCAATCAGCC GACGGATCAG
CGTTTCAAGC TGCACTATCC GGGCGAGGAC GGCGATTTGC AGGCGGCGGC GAACAAGCGG
TTCCGGATCA CGCTGGACGA CGGGCGCGTC ATCGAAGGCA AGACCGACGC GAGCGGCCTG
ACGGATCTGG TCAAGGACGA CGCGATGCGT ATCGCGAAGA TCGACTATCT GAAGCCGAAG
CTCTGA
 
Protein sequence
MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV 
STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISGVQIG QSDGELCVYQ LTVCDALSLM
DKRTNSRVFR KRSVIDVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES
DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR
DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASGLDQGEA GNDLAKLLTD
IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGTWITL TGDPQWDRQR
ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF
ACVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRVH GLDPADHAHA
QGAGTNGNAG DSAPIRVASS LAGAHFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA
HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY
LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC
AELFKALGDY AGQHGGQAAD TAGQHAVAAA FKRWAPGTGT DGAARALMAF GAQAGSVNVT
PKTHVTYAGE NIDQVAQQHL QLMSGQRLNA TAGQGMQLFA RGAGVQAVAG EGPMLLQAQA
GTLTANAQKG VKITTNEHEV FVSAPKIRLV AEDGSYLELG GGITLGTNGD IKLLSASHQW
GGPSTAQAAK SGFGNQPTDQ RFKLHYPGED GDLQAAANKR FRITLDDGRV IEGKTDASGL
TDLVKDDAMR IAKIDYLKPK L