Gene BURPS1106A_A1615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1615 
Symbol 
ID4903595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1582879 
End bp1585665 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content69% 
IMG OID640144721 
Productputative Rhs element Vgr protein 
Protein accessionYP_001075649 
Protein GI126458074 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.491638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA 
CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG
CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG
TCGACCCGCC GCGACCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC
ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGGCGT CCAGATCGGC
CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG
GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGAGGT GCTCGCCACG
CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG
TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG
GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG
GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGATGCGC CCGTGCACAC GCTCGTGTGC
TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC
GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG
CCCGGCAAGG CCGGCCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC
GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGACC TGGCGAAGCT GCTGACCGAC
ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC
GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG
GATCTCGCCG TGGGCACATG GATCACGCTG ACGGGCGATC CGCAATGGGA CAGGCAACGC
GCCGACAAGC GTCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG
AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCAA GCCGCAATCT CGCGTGCGCG
CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC
GCGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG
CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC
GAAGACGGCC GGGTGCGCGT GCGGGTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG
CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC
CTTGCCGGCG CCCATTTCGG CGCATCGTTC CTGCCGCGAG TCGGCATGGA AGTCCTCCTC
GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG
CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAACCGCTA CCTGTCGGGC
ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG
AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT
CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT
ACCGACGCGG CGGCGGCGCT TCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC
ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC
GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGCGGAT
ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGGCACG
GACGGCGCCG ATGCGCCGTC CGACGGCGCA GCGCGCGCGC TGATGGCGTT CGGCGCGCAG
GCCGGTTCGG TGAACGTCAC GCCGAAGACG CATGTGACGT ATGCCGGCGA GAACATCGAT
CAGGTCGCGC AGCAGCACCT GCAACTGATG AGCGGCCAGC GGCTGAACGC GACGGCCGGG
CAGGGCATGC AGCTCTTCGC GCGGGGCGCG GGGGTGCAGG CCGTGGCGGG CGAAGGGCCG
ATGCTGCTGC AGGCGCAAGC CGGCACGCTG ACGGCGAACG CGCAGAAGGG CATCAAGATC
ACGACGAACG AGCACGAGGT GTTCGTGAGT GCGCCGAAGA TTCGGCTCGT TGCCGAGGAC
GGCAGCTACC TCGAGCTCGG CGGCGGCATC ACGCTCGGCA CGAACGGCGA CATCAAGCTG
CTGTCGGCGT CGCACCAGTG GGGCGGGCCG TCGACCGCGC AGGCGGCGAA GAGCGGGTTC
GGCAATCAGC CGACGGATCA GCGTTTCAAG CTGCACTATC CGGGCGAGGA CGGCGATTTG
CAGGCGGCGG CGAACAAGCG GTTCCGGATC ACGCTGGACG ACGGGCGCGT CATCGAAGGC
AAGACCGACG CGAGCGGCCT GACGGATCTG GTCAAGGACG ACGCGATGCG TATCGCGAAG
ATCGACTATC TGAAGCCGAA GCTCTGA
 
Protein sequence
MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV 
STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISGVQIG QSDGELCVYQ LTVCDALSLM
DKRTNSRVFR KRSVIEVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES
DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR
DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD
IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGTWITL TGDPQWDRQR
ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF
ACVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRVH GLDPADHAHA
QGAGTNGNAG DSAPIRVASS LAGAHFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA
HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY
LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC
AELFKALGDY AGQHGGQAAD TAGQHAVAAA FKRWAPGTGT DGADAPSDGA ARALMAFGAQ
AGSVNVTPKT HVTYAGENID QVAQQHLQLM SGQRLNATAG QGMQLFARGA GVQAVAGEGP
MLLQAQAGTL TANAQKGIKI TTNEHEVFVS APKIRLVAED GSYLELGGGI TLGTNGDIKL
LSASHQWGGP STAQAAKSGF GNQPTDQRFK LHYPGEDGDL QAAANKRFRI TLDDGRVIEG
KTDASGLTDL VKDDAMRIAK IDYLKPKL