Gene BURPS1106A_3680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3680 
Symbol 
ID4900504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3593469 
End bp3596606 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content63% 
IMG OID640136906 
ProductSNF2-related:helicase, C-terminal 
Protein accessionYP_001067911 
Protein GI126452260 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCC AGAGCGAACG GCAAACTCAG GCATACCGAG CCCGACCAGC ACCGGGACAA 
CTGGTCGAAG TGCGCCGCCG TCAGTGGGTT GTTGCGGATA TCGATGCGGC AGGATTGAGC
TTCGGACTTC CAACGCCCCA ACATTGCGTC ACGCTCTCAT CGATCGATGA AGACGGCCTT
GGCGAGGAAT TGGAAGTCAT CTGGGAGATC GAGCCGGGAG CCCAAGTTAT CGAGCGGGCA
GGTCTGCCTT CAATTACGGG GCAGGATGAT TCCGATGTGC TTGATGCCTT CCTTGATGCC
GTGCGCTGGG GCGCGGCCAC CAACGCCGAC AGGGGGTTCC TCCAGGCTCC GTTCCGAAGT
GGCGTCAGCA TCGAGGACTT TCAACTCGAT CCACTGGTCC GAGCCATCGA CATGGCTCGC
GTCAATCTTC TTATTGCCGA CGATGTCGGC CTCGGCAAGA CCGTCGAGGC TGGTTTGGTC
ATTCAGGAGT TACTGCTGCG GCATCGCGCC CGCACGGTAT TGATCGTTTG CCCGGCGTCG
CTTCAGGAGA AGTGGCGCGT TGAGATGCTG GAGAAATTCG GACTCGATTT CCGCGTTGTC
GATACTGACT ACATCAAGCG GTTGCGGCGC GAGCGTGGCA TCCATACCAA TCCATGGACG
TCGCATCCGC GCCTCATCAC GTCTATGGAT TGGGCCAAGG GCGGAGAAGG CTTGCGGGCC
ATGCGTGACG TGCTCCCGCC GCACGTCGGC CATCCGCGCA AGTTTGACCT GCTGGTCGTG
GACGAAGCGC ACAACGTCGC GCCCTCGGCA GGCGCGCACT ACGCGCTGGA GAGTCAGCGT
ACGCGCTTCG TCCGTGCCAT CGGCCCACAC TTCCAGCATC GTCTCTTCCT GACCGCGACG
CCGCACAACG GCTACACCGA GTCGTTCACC TCGCTGCTGG AATTGCTCGA CGACCAGCGT
TTCGCGCGCA ACATCCTCCC CGACGAAAAT CGTCTTAGTC AGGTGATGAT CCGCCGTCTG
AAGAGCGATC TGGTTGATGC GGACGGCAAT CCCCTGTACG CCCGGCGCAC CTTGCAGGCA
CTCGAAGTCC CATACACGGC GGAAGAGCGC GAGGTTCATC GCAAACTGGA CGATTACTGC
GCGAGCCGTG AAAAGGATGC CGAGAACGCA GGCAACGGCT TTGGCACGGC CTTCGTCAAT
CGTCTCCTCA AGAAACGTCT GCTCTCGTCG CCAGCGGCGT TCGCATCCAC GCTCGAAAAG
CACGTCACGT CACTGTCAGA AGCGCGGCCC GCGAAGCTGG ACACGATGGC CGAACGCATC
CTGCACAAGG CCATCCTGAA AGCCGACGAG GACTATGCCG ACGACGGGGA TGTCGAGAAC
GCTCAAGCCG AAGCCGTCGA GGAAGCCACG CGCCGCTCAA TACCGCTGAC GCCAGAGCAG
CGGGCGACGC TGGACGACTT GCGGGCATGG GCGCAGCGAG CCAGGAATCA GGCTGACTCC
AAGGCCCAAG CCATCCTCCG CTGGCTCTCG GCCTACCTCA AGCCAGATGG TCAGTGGAAC
GATCGCCGGG TGATCCTGTT CACGGAATAC CGCACCACGC ACCAGTGGAT GCATCAAATC
CTCGCCAGCC ACGGCTTTGG CGGCGAGCGT CTCGGTCTGC TCCACGGTGG CCTATCGCAA
GAAGAACGCG AACCCATCAA AGCGGCGTTC CAAGCTTCGC CGCAGGATTC GCCCGTGCGC
ATCCTGCTCG CCACCGACGC GGCCTCCGAA GGCATCGACT TGCAGAACCA CTGCAATCGG
CTCATCCACT TGGAGATTCC CTACAACCCC AACGTGATGG AGCAGCGTAA CGGGCGTATC
GACCGCCACG GCCAGCGCGA GAAGGAAGTG CTGATCTGGC ACCCGGTCGA TGGTGGCGGC
GCGAACGGCG CATCGGTCGG CGGCCTCGGC GAGGACATCC TTCGCGCCCT GCGGAAACTG
GACTCGATGC GCGCCGACAT GGGCAGTGTC AATCCGGTCA TCGCGCCGCA GATGTCCGGC
CTTATTGAAG GCTCCCTGAA GGACTTGGAC ACTCGCCTCG CCGAGGCCCG GATTGCCCGC
GCCAAAAACT TCGTGCGCGC TGAACGAGAG TTGAAGGAGC GCGTCGCCAA GCTGCACGAG
CGTCTGCTCA CCACCAAGCA GGATTTCCAC CTCACGCCCG ACCACGTCCT GATGGCCGTA
AAGACCGGCC TCTCGCTGGC GGGCCGTCCG CCGCTGGAAC CGGTCGAACT TGCGGGCGCG
CCTTCTGGCA GCGTCTTCCG GATGCCTGCG CTGTCCGGTT CGTGGGCGCG CTGTCTGCAA
GGGCTGCGCC ACCCGCACAC CCAAAAGATT CGGCCCATCA CCTTCGACCA CGCCATCGCC
AGTGGCCGCG ACGACGTCGT GCTCGTCCAC TTGAACCATC GCTTGGTGCA GATGTGCCTG
CGTCTGCTGC GCGCCGAAAT CTGGGCACGG GACGACGTGA AGAAGCTGCA TCGTGTCACC
ATCCGCACCA TGCCGGACGC GCTCGTCGAT GGCCCCGCCG TGGTCGTCGT TTCGCGGCTG
GTAGTCACCG GCGGCAACCA CCACCGGCTG CACGAAGAAC TGACGGTATC GGGCGGCTAC
CTACGCGACC AGTCCTTCCG CCGCGAAGAA GGTGTCACCC GCGTCCAGCA ATGGCTGGAT
GAATCGAAAC CGATCACGGC GGCCCCGCCG CTGTTCGACG CGCTGCGCGT CCGCTTCGAC
CGTCAGCAGG AAGCCATCCT GAAAGCCGTG GATGCCCGTT CCAAAGAACG CCTTCGTTAC
CTGACCAACA CGCTTCAGAC TCGCAAGCAG CAGGAAATCG AGGACATCGG TACCGTGCTC
GACGAATTGG AGAAGGCGAT CCAGTCCGAA TTGAAGAAAG GCGAGCAGCC CGAGCAGCTC
ACGCTCTTCA CCGAGGACGA ACGCACGCAG CTCCACCGCG ACATCGCCGC GCTGGAGGCC
CGCCTTGCAC GCATCCCCGG CGAGCGCCAG ATGGAGACTC AGGCCATCGA ATCCCGTTAC
GCCAAGCTCG ACGACCGCAC CTTTCCGGTC GCCGTGATCT TCGTCGTCCC CGAGTCTACG
TTAGAGGTGG CGATATGA
 
Protein sequence
MGVQSERQTQ AYRARPAPGQ LVEVRRRQWV VADIDAAGLS FGLPTPQHCV TLSSIDEDGL 
GEELEVIWEI EPGAQVIERA GLPSITGQDD SDVLDAFLDA VRWGAATNAD RGFLQAPFRS
GVSIEDFQLD PLVRAIDMAR VNLLIADDVG LGKTVEAGLV IQELLLRHRA RTVLIVCPAS
LQEKWRVEML EKFGLDFRVV DTDYIKRLRR ERGIHTNPWT SHPRLITSMD WAKGGEGLRA
MRDVLPPHVG HPRKFDLLVV DEAHNVAPSA GAHYALESQR TRFVRAIGPH FQHRLFLTAT
PHNGYTESFT SLLELLDDQR FARNILPDEN RLSQVMIRRL KSDLVDADGN PLYARRTLQA
LEVPYTAEER EVHRKLDDYC ASREKDAENA GNGFGTAFVN RLLKKRLLSS PAAFASTLEK
HVTSLSEARP AKLDTMAERI LHKAILKADE DYADDGDVEN AQAEAVEEAT RRSIPLTPEQ
RATLDDLRAW AQRARNQADS KAQAILRWLS AYLKPDGQWN DRRVILFTEY RTTHQWMHQI
LASHGFGGER LGLLHGGLSQ EEREPIKAAF QASPQDSPVR ILLATDAASE GIDLQNHCNR
LIHLEIPYNP NVMEQRNGRI DRHGQREKEV LIWHPVDGGG ANGASVGGLG EDILRALRKL
DSMRADMGSV NPVIAPQMSG LIEGSLKDLD TRLAEARIAR AKNFVRAERE LKERVAKLHE
RLLTTKQDFH LTPDHVLMAV KTGLSLAGRP PLEPVELAGA PSGSVFRMPA LSGSWARCLQ
GLRHPHTQKI RPITFDHAIA SGRDDVVLVH LNHRLVQMCL RLLRAEIWAR DDVKKLHRVT
IRTMPDALVD GPAVVVVSRL VVTGGNHHRL HEELTVSGGY LRDQSFRREE GVTRVQQWLD
ESKPITAAPP LFDALRVRFD RQQEAILKAV DARSKERLRY LTNTLQTRKQ QEIEDIGTVL
DELEKAIQSE LKKGEQPEQL TLFTEDERTQ LHRDIAALEA RLARIPGERQ METQAIESRY
AKLDDRTFPV AVIFVVPEST LEVAI