Gene BURPS1710b_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3658 
SymbolhepA 
ID3688422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3988044 
End bp3990926 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content56% 
IMG OID637730113 
ProductSNF2 family helicase 
Protein accessionYP_335023 
Protein GI76809589 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.041395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGC GACTTCTGAC ACCGCATCAG AGCCAATACT TCGCATGGCT GCTGACGCGG 
CGCGCGGCGG GCGACACAGT GGAGTCGCTG GCATCGGCGC TGGTCGACTC ACAGGTGGAC
CTCAATCCGC ATCAAGTCGA AGCGGCGCTC TTCGCATGCC GCAACCCCCT GTCTCGGGGC
GTGATCCTTG CGGACGAAGT CGGTCTAGGA AAGACGATCG AAGCGGGATT GGTAATTTCG
CAGCGATGGG CCGAGCGTCG CCGCCGCATT CTCATCATCG TCCCGGCAAA CCTGCGCAAG
CAATGGCACC AGGAGCTGCA GGACAAGTTC AGCTTGCAAG GACTCATCCT TGAGTCCAAG
AACTACAACG CGCTCCGCAA GAGCGAGCGT CAAAATCCGT TTCTGACATC GTTCGAGCCG
GTGATCTGTT CGTATCAGTT TGCGAAGTCC AAGGCTAAGG ACATCAAAGA GATCGAGTGG
GACCTGGTTG TATTTGATGA AGCTCACCGC CTTCGCAATG TCTATAAAAC AAGTAATGTC
ATAGCCAAAA CGCTGAAAGA TGCGCTGGTG CATGTCCATT CCAAGGTATT GTTAACCGCG
ACACCGTTGC AAAACTCGTT GCTGGAGCTC TACGGCCTTG TCAGCATGAT CGATGAACGG
GTCTTTGGCG ACATCGATAG CTTTCGCGTG CAATTCACGG CCCAAGGCAA AGACCAAGCC
TATGGGGGTC TCCGAGAACG GTTAGCCGCA GTTTGCAAGC GTACCTTGCG CCAGCAAGTA
CAGCCCTACG TTTCGTTTAC GGCGCGAAAG GCGATTGTCC ATGAGTTTAC GCCGTCGAGC
GAAGAGCAGG AACTGTCCCA ACTCGTCGCC AATTATCTAC GTCGCCCTAA CCTTAAAGCC
CTGCCCGAAG GACAGCGGCA ACTGATTTCG CTCGTATTGT GGAAGCTGCT TGCCTCCAGC
ACGCGCGCGA TAGCTGGTGC TCTTGAAACA ATGGCAAATC GCCTTCAGAG CGTGCTCGAC
AAAGCCGTCG CGGTGAACGA TTTGGCTGAA GAATTGGATG AAGACTACGA GGCGCTTGAC
GAGACCGCCG ACGAATGGAA CGGACAAAAT TCGAGCGCGA ATACGCACGA CGGCACCGAA
TCCGACGCCA TCGCGCAGGA GATTGAGGAG CTTCGGCACT TCAAGGCGTT GGCGACCAAT
ATAAGGGATG ATGCAAAGGG CAAGGCGTTG CTTACAGCAC TGGATGGCGC CTTCGCGGAG
CTGGACCGAT TGGGGGCAGA AAAGAAGGCG ATTATTTTCA CGGAGTCGAA GCGGACGCAG
GAGTACCTGC TCCATCTATT GGCTGACACA CCCCACGGTG ACGGTGTCGT GCTGTTCAAT
GGCACTAACA GTGATGCACG TGCCCAGGCT ATCTACAAGG ACTGGCTGAA GCGACATGCA
GGCACCGATC GCATCACTGA CTCAAAGACA GCCGACACCC GCGCCGCCCT AGTCGAGCAT
TTCAAGGAAC GCGGCAAGAT CATGATCGCC ACCGAGGCTG GGGCTGAAGG GATCAATCTC
CAGTTTTGCT CGTTGGTCAT CAACTACGAC TTGCCGTGGA ACCCGCAGCG CATCGAGCAA
CGCATTGGTC GCTGCCATCG TTACGGCCAG AAGTTCGATG TCGTCGTGGT GAACTTCGTC
GATCTCAGTA ACGAAGCTGA CAGGCGCGTC TACGATTTGT TGTCGCAAAA ATTCCAACTC
TTCGAGGGCG TATTTGGAGC CAGCGATGAA GTGTTGGGCG CGATCGGCTC GGGGGTCGAT
TTCGAGCGAC GCATCGCTGA TATTTATCGA AACTGCCGAG AGTCCGATGA GATCAAGGCC
AGCTTCGAGC AGCTCCAACT CGACCTCTCG AGCGAGATTA ACGAAGCGAT GATCAAGACG
CGCCAACTCC TGCTGGAAAA CTTCGATGAG GAGGTACAGG AGAAACTGCG CATCCGAGCA
GAAGACAGCC GCAGTACTCG CAGCCGCTTT GAGCGAATAT TGATGGACTT GACACGTTCG
GAACTAGGCG CATGCGGGGT ATTCGACGAC CATGGGTTTG TCCTCCACCA CTCGCCGGAG
GGCATCGAGA GCAGCTCCAT TGACGCAATC GAGCTTGGGC GCTACGAACT ACCTCGTCGC
TCGGGTGATG CGCATCTGTA TCGCGTCAAC CATCCGCTGG CCCTTTTGAT AACAGAGCGG
GCCAAAACTC GTGCTCTTGG CGGGGCTCGC CTCGTATTTG ACTACGACGC ACACGGGTCG
CAGGTCAGTA CGCTCAAAGC TTATCGCGGC AAGGCCGGGT GGCTCACCGT GAAGCTGATT
TCGGTCGAGG CGCTTGGCAA ACAGGAGCAG CATTTGCTCG TCGCCGCGAC CACGACCGAC
GGCCTCGTCC TCGCCGAGGA AGACCCGGAG AAGCTGCTGC GCCTGCCCGC CACGGTAGAA
GCAGAGGGAC TGTTCAGCAC CGCAGACAGT TCCCTGCTGG CCGACGCGGA AGTGCGGAAG
ACTGTGCTGC TGCGCGGAGT CAATGAGCGC AACCTCGGTT ACTTTGAACA GGAAGTTCAG
AAGCTGGATG CCTGGGCAGA CGACTTGAAG GTCGGGCTGG AGCAAGAAAT CAAGGAGATC
GACCGCGAGA TCAAGGAAGT ACGGCGTACT GCCGCTACCT CACCAACGTT GCAGGAAAAG
CTGTCGTGGC AAAAAAAACA ACGCGAATTG GAAGGCAAGC GAAGCAAGCT ACGGCGTGAG
CTGTTCGTTC GGCAAGACGA AATAGAAGCG CAGCGCAACG ACCTCATCAG CGAGCTGGAG
GCGAAGCTTC AGCAGCAAGT AGACGAACGC ACACTGTTTA CTGTCGAGTG GGAGTTGATT
TAA
 
Protein sequence
MAERLLTPHQ SQYFAWLLTR RAAGDTVESL ASALVDSQVD LNPHQVEAAL FACRNPLSRG 
VILADEVGLG KTIEAGLVIS QRWAERRRRI LIIVPANLRK QWHQELQDKF SLQGLILESK
NYNALRKSER QNPFLTSFEP VICSYQFAKS KAKDIKEIEW DLVVFDEAHR LRNVYKTSNV
IAKTLKDALV HVHSKVLLTA TPLQNSLLEL YGLVSMIDER VFGDIDSFRV QFTAQGKDQA
YGGLRERLAA VCKRTLRQQV QPYVSFTARK AIVHEFTPSS EEQELSQLVA NYLRRPNLKA
LPEGQRQLIS LVLWKLLASS TRAIAGALET MANRLQSVLD KAVAVNDLAE ELDEDYEALD
ETADEWNGQN SSANTHDGTE SDAIAQEIEE LRHFKALATN IRDDAKGKAL LTALDGAFAE
LDRLGAEKKA IIFTESKRTQ EYLLHLLADT PHGDGVVLFN GTNSDARAQA IYKDWLKRHA
GTDRITDSKT ADTRAALVEH FKERGKIMIA TEAGAEGINL QFCSLVINYD LPWNPQRIEQ
RIGRCHRYGQ KFDVVVVNFV DLSNEADRRV YDLLSQKFQL FEGVFGASDE VLGAIGSGVD
FERRIADIYR NCRESDEIKA SFEQLQLDLS SEINEAMIKT RQLLLENFDE EVQEKLRIRA
EDSRSTRSRF ERILMDLTRS ELGACGVFDD HGFVLHHSPE GIESSSIDAI ELGRYELPRR
SGDAHLYRVN HPLALLITER AKTRALGGAR LVFDYDAHGS QVSTLKAYRG KAGWLTVKLI
SVEALGKQEQ HLLVAATTTD GLVLAEEDPE KLLRLPATVE AEGLFSTADS SLLADAEVRK
TVLLRGVNER NLGYFEQEVQ KLDAWADDLK VGLEQEIKEI DREIKEVRRT AATSPTLQEK
LSWQKKQREL EGKRSKLRRE LFVRQDEIEA QRNDLISELE AKLQQQVDER TLFTVEWELI