Gene BURPS668_A0127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0127 
Symbol 
ID4888440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp108223 
End bp111009 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content69% 
IMG OID640130068 
Productputative Rhs element Vgr protein 
Protein accessionYP_001061133 
Protein GI126444328 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAATT TGAACGATAC GCTACGCAAT TTTGCGTCGG GGGCGGTCGA CTGGAATAAA 
CGTCCGGTCG CGTTGCACTT TGGCGCCGCG CAGGCCGCGC TGGGCCACCT CCTCGCGCTG
CAGCACGCCA GTGTTCAGGA AGGCCTGATG ACCGGGATCC ACGGCCGATT GACCTGCGTG
TCGACCCGCC GCGATCTTCC GCCCGGCGTG TTGCTCGGCA TTCCGGTTTC GATCCGGCTC
ATTACCGACC GCGGACAGCC GCACACGGTG AACGCGATCA TCAGCGACGT CCAGATCGGC
CAAAGCGACG GCGAGCTCTG TGTGTACCAG CTGACGGTCT GCGACGCGCT GTCGCTGATG
GACAAGCGCA CCAATTCGCG GGTCTTCCGA AAGCGCAGCG TCATCGAGGT GCTCGCCACG
CTGTTCAACG AATGGCAGCA GCGCAGCCCG GCCCTCGCGC GCGCGTTCGA ATTCGATCTG
TCCGGCTTGC GCGCCGATCG CTATCCGCCC CGCGAGCTGA CCCGGCAGGT CAACGAATCG
GATGCGCATT TCGTGCGCCG TCTGCTGCGC CGCGAAGGGA TCACCGTGTT CGCGAAGGCG
GGGCCGGCGA AGGGCGAACG GCCGTTGCAG GGCGACGCGC CCGTGCACAC GCTCGTGTGC
TGTGACGATC CGATGTCGTT GCCGCAAGCG CCGGCCGGCA CGGTCCGCTT GCATCCGCGC
GACGGCGGCG CCGCGCAGCG CGACACGGTC ACGCTGTTCG CGCTGCGTCG GCAATTGGCG
CCCGGCAAGG CCGGGCGCCC GTCGTGGGAC TACAAGAAGG CGCGGATCGA CGAATCGAGC
GTCGCTTCGA GCCTCGATCA GGGCGAGGCG GGCAACGATC TGGCGAAGCT GCTGACCGAC
ATCGCGATCG ACATTGCGCA CGCGGGCGAT TCATGGCGCG ATCACGAGCG GCTCACCCGC
GCGCGCATGC TCGCGCACGA GTTCGAAGCC GAGCGCCATG ACGGCGTCAG CAGCGTGCGG
GATCTGGCCG TGGGCGCGTG GATCACGCTG ACGGGCGATC CGGACTGGGA CAGGCAACTC
GCCGACAAGC GCCAGTTCGT GATCACGTCG ATCGATCACG ACATCTGGAA CAACCTGCCG
AAGGGGCTCA ACGAGCGCGT GCACGCGCTG TTCGCCGCGA GCCGCAATCT CGCGTGCGCG
CCCCGCGCGC TGCCGTCCGC GCTGGCGAAC GACGCGGATA CCCGCTACGA GAACACGTTC
ACGTGCGTGC GCCGCGGCGT GCCGCTTGCG CCCGCGTACG ATCCGCAAGC CGATTTGCCG
CCCGCGCATC TGCTCACGGG CACGATTGTC GGCGCGGAGG GCGAAGAAGT GTTCTGCGAC
GAAGACGGCC GGGTGCGCGT GCGGTTGCAC GGCCTCGATC CGGCGGATCA CGCGCACGCG
CAGGGCGCGG GCACCAACGG CAACGCGGGC GACAGCGCGC CGATCCGCGT GGCGTCGAGC
CTCGCCGGCG CCTATTTCGG CGCATCGTTT CTGCCGCGCG TCGGCATGGA AGTCCTCCTC
GGGTGTCTCG GCGGCGATCC GGACCGGCTG GTGATCATCG GCGTGCTCGG TAACGGCGCG
CATCCGCCGG CGACGTTCAG CCACGCGGGC GGGCTGCCGG GCAATCGCTA CCTGTCGGGC
ATCAAGACGA AGGAGATTCG TGGGCAACGG TACAACCAGC TGCGTCTCGA CGACACGCCG
AACCAGATCA GCGCGCAACT GGCGAGCGAG CACGCGCATT CGCAGCTCAA TCTCGGATAT
CTGACGCAAC CGCGCGAGAA CGGCCACGGG AACGACCGCG GCGAGGGCGT GGAGTTGCGT
ACCGACGCGG CGGCGGCGCT GCGGGCGGCG CAAGGCATGC TGCTGACGAC CTACGCGCGC
ACGCAGGCGA GCGGCGGGCA ACTGGACCGT GACGAGCTGA TTCGGTTGCT CGGCGAATGC
GCGGAGCTGT TCAAGGCGCT GGGCGACTAC GCGGGGCAGC ACGGCGGGCA GGCCGCGGAT
ACGGCCGGCC AGCACGCGGT GGCCGCCGCG TTCAAGCGCT GGGCGCCGGG CACGGGCACG
GACGGCGCCG ATGCGCCGTC CGACGGCGCA GCGCGCGCGC TGATGGCGTT CGGCGCGCAG
GCCGGTTCGG TGAACGTCAC GCCGAAGACG CATGTGACGT ATGCCGGCGA GAACATCGAT
CAGGTCGCGC AGCAGCACCT GCAACTGATG AGCGGCCAGC GGCTGAACGC GACGGCCGGG
CAGGGCATGC AGCTCTTCGC GCGGGGCGCG GGGGTGCAGG CCGTGGCGGG CGAAGGGCCG
ATGCTGCTGC AGGCGCAAGC CGGCACGCTG ACGGCGAACG CGCAGAAGGG CGTCAAGATC
ACGACGAACG AGCACGAGGT GTTCGTGAGC GCGCCGAAGA TTCGGCTCGT TGCCGAGGAC
GGCAGCTACC TCGAGCTCGG CGGCGGCATC ACGCTCGGCA CGAACGGCGA CATCAAGCTG
CTGTCGGCGT CGCACCAGTG GGGCGGGCCG TCGACCGCGC AGGCGGCGAA GAGCGGGTTC
GGCAATCAGC CGACGGATCA GCGTTTCAAG CTGCACTATC CGGGCGAGGA CGGCGATTTG
CAGGCGGCGG CGAACAAGCG GTTCCGGATC ACGCTGGACG ACGGGCGCGT CATCGAAGGC
AAGTCCGACG CGAGCGGCCT GACGGATCTG GTCAAGGACG ACGCGATGCG TATCGCGAAG
ATCGACTATC TGAAACCGAA GCTCTGA
 
Protein sequence
MTNLNDTLRN FASGAVDWNK RPVALHFGAA QAALGHLLAL QHASVQEGLM TGIHGRLTCV 
STRRDLPPGV LLGIPVSIRL ITDRGQPHTV NAIISDVQIG QSDGELCVYQ LTVCDALSLM
DKRTNSRVFR KRSVIEVLAT LFNEWQQRSP ALARAFEFDL SGLRADRYPP RELTRQVNES
DAHFVRRLLR REGITVFAKA GPAKGERPLQ GDAPVHTLVC CDDPMSLPQA PAGTVRLHPR
DGGAAQRDTV TLFALRRQLA PGKAGRPSWD YKKARIDESS VASSLDQGEA GNDLAKLLTD
IAIDIAHAGD SWRDHERLTR ARMLAHEFEA ERHDGVSSVR DLAVGAWITL TGDPDWDRQL
ADKRQFVITS IDHDIWNNLP KGLNERVHAL FAASRNLACA PRALPSALAN DADTRYENTF
TCVRRGVPLA PAYDPQADLP PAHLLTGTIV GAEGEEVFCD EDGRVRVRLH GLDPADHAHA
QGAGTNGNAG DSAPIRVASS LAGAYFGASF LPRVGMEVLL GCLGGDPDRL VIIGVLGNGA
HPPATFSHAG GLPGNRYLSG IKTKEIRGQR YNQLRLDDTP NQISAQLASE HAHSQLNLGY
LTQPRENGHG NDRGEGVELR TDAAAALRAA QGMLLTTYAR TQASGGQLDR DELIRLLGEC
AELFKALGDY AGQHGGQAAD TAGQHAVAAA FKRWAPGTGT DGADAPSDGA ARALMAFGAQ
AGSVNVTPKT HVTYAGENID QVAQQHLQLM SGQRLNATAG QGMQLFARGA GVQAVAGEGP
MLLQAQAGTL TANAQKGVKI TTNEHEVFVS APKIRLVAED GSYLELGGGI TLGTNGDIKL
LSASHQWGGP STAQAAKSGF GNQPTDQRFK LHYPGEDGDL QAAANKRFRI TLDDGRVIEG
KSDASGLTDL VKDDAMRIAK IDYLKPKL