Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3658 |
Symbol | hepA |
ID | 3688422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 3988044 |
End bp | 3990926 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637730113 |
Product | SNF2 family helicase |
Protein accession | YP_335023 |
Protein GI | 76809589 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.041395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAGC GACTTCTGAC ACCGCATCAG AGCCAATACT TCGCATGGCT GCTGACGCGG CGCGCGGCGG GCGACACAGT GGAGTCGCTG GCATCGGCGC TGGTCGACTC ACAGGTGGAC CTCAATCCGC ATCAAGTCGA AGCGGCGCTC TTCGCATGCC GCAACCCCCT GTCTCGGGGC GTGATCCTTG CGGACGAAGT CGGTCTAGGA AAGACGATCG AAGCGGGATT GGTAATTTCG CAGCGATGGG CCGAGCGTCG CCGCCGCATT CTCATCATCG TCCCGGCAAA CCTGCGCAAG CAATGGCACC AGGAGCTGCA GGACAAGTTC AGCTTGCAAG GACTCATCCT TGAGTCCAAG AACTACAACG CGCTCCGCAA GAGCGAGCGT CAAAATCCGT TTCTGACATC GTTCGAGCCG GTGATCTGTT CGTATCAGTT TGCGAAGTCC AAGGCTAAGG ACATCAAAGA GATCGAGTGG GACCTGGTTG TATTTGATGA AGCTCACCGC CTTCGCAATG TCTATAAAAC AAGTAATGTC ATAGCCAAAA CGCTGAAAGA TGCGCTGGTG CATGTCCATT CCAAGGTATT GTTAACCGCG ACACCGTTGC AAAACTCGTT GCTGGAGCTC TACGGCCTTG TCAGCATGAT CGATGAACGG GTCTTTGGCG ACATCGATAG CTTTCGCGTG CAATTCACGG CCCAAGGCAA AGACCAAGCC TATGGGGGTC TCCGAGAACG GTTAGCCGCA GTTTGCAAGC GTACCTTGCG CCAGCAAGTA CAGCCCTACG TTTCGTTTAC GGCGCGAAAG GCGATTGTCC ATGAGTTTAC GCCGTCGAGC GAAGAGCAGG AACTGTCCCA ACTCGTCGCC AATTATCTAC GTCGCCCTAA CCTTAAAGCC CTGCCCGAAG GACAGCGGCA ACTGATTTCG CTCGTATTGT GGAAGCTGCT TGCCTCCAGC ACGCGCGCGA TAGCTGGTGC TCTTGAAACA ATGGCAAATC GCCTTCAGAG CGTGCTCGAC AAAGCCGTCG CGGTGAACGA TTTGGCTGAA GAATTGGATG AAGACTACGA GGCGCTTGAC GAGACCGCCG ACGAATGGAA CGGACAAAAT TCGAGCGCGA ATACGCACGA CGGCACCGAA TCCGACGCCA TCGCGCAGGA GATTGAGGAG CTTCGGCACT TCAAGGCGTT GGCGACCAAT ATAAGGGATG ATGCAAAGGG CAAGGCGTTG CTTACAGCAC TGGATGGCGC CTTCGCGGAG CTGGACCGAT TGGGGGCAGA AAAGAAGGCG ATTATTTTCA CGGAGTCGAA GCGGACGCAG GAGTACCTGC TCCATCTATT GGCTGACACA CCCCACGGTG ACGGTGTCGT GCTGTTCAAT GGCACTAACA GTGATGCACG TGCCCAGGCT ATCTACAAGG ACTGGCTGAA GCGACATGCA GGCACCGATC GCATCACTGA CTCAAAGACA GCCGACACCC GCGCCGCCCT AGTCGAGCAT TTCAAGGAAC GCGGCAAGAT CATGATCGCC ACCGAGGCTG GGGCTGAAGG GATCAATCTC CAGTTTTGCT CGTTGGTCAT CAACTACGAC TTGCCGTGGA ACCCGCAGCG CATCGAGCAA CGCATTGGTC GCTGCCATCG TTACGGCCAG AAGTTCGATG TCGTCGTGGT GAACTTCGTC GATCTCAGTA ACGAAGCTGA CAGGCGCGTC TACGATTTGT TGTCGCAAAA ATTCCAACTC TTCGAGGGCG TATTTGGAGC CAGCGATGAA GTGTTGGGCG CGATCGGCTC GGGGGTCGAT TTCGAGCGAC GCATCGCTGA TATTTATCGA AACTGCCGAG AGTCCGATGA GATCAAGGCC AGCTTCGAGC AGCTCCAACT CGACCTCTCG AGCGAGATTA ACGAAGCGAT GATCAAGACG CGCCAACTCC TGCTGGAAAA CTTCGATGAG GAGGTACAGG AGAAACTGCG CATCCGAGCA GAAGACAGCC GCAGTACTCG CAGCCGCTTT GAGCGAATAT TGATGGACTT GACACGTTCG GAACTAGGCG CATGCGGGGT ATTCGACGAC CATGGGTTTG TCCTCCACCA CTCGCCGGAG GGCATCGAGA GCAGCTCCAT TGACGCAATC GAGCTTGGGC GCTACGAACT ACCTCGTCGC TCGGGTGATG CGCATCTGTA TCGCGTCAAC CATCCGCTGG CCCTTTTGAT AACAGAGCGG GCCAAAACTC GTGCTCTTGG CGGGGCTCGC CTCGTATTTG ACTACGACGC ACACGGGTCG CAGGTCAGTA CGCTCAAAGC TTATCGCGGC AAGGCCGGGT GGCTCACCGT GAAGCTGATT TCGGTCGAGG CGCTTGGCAA ACAGGAGCAG CATTTGCTCG TCGCCGCGAC CACGACCGAC GGCCTCGTCC TCGCCGAGGA AGACCCGGAG AAGCTGCTGC GCCTGCCCGC CACGGTAGAA GCAGAGGGAC TGTTCAGCAC CGCAGACAGT TCCCTGCTGG CCGACGCGGA AGTGCGGAAG ACTGTGCTGC TGCGCGGAGT CAATGAGCGC AACCTCGGTT ACTTTGAACA GGAAGTTCAG AAGCTGGATG CCTGGGCAGA CGACTTGAAG GTCGGGCTGG AGCAAGAAAT CAAGGAGATC GACCGCGAGA TCAAGGAAGT ACGGCGTACT GCCGCTACCT CACCAACGTT GCAGGAAAAG CTGTCGTGGC AAAAAAAACA ACGCGAATTG GAAGGCAAGC GAAGCAAGCT ACGGCGTGAG CTGTTCGTTC GGCAAGACGA AATAGAAGCG CAGCGCAACG ACCTCATCAG CGAGCTGGAG GCGAAGCTTC AGCAGCAAGT AGACGAACGC ACACTGTTTA CTGTCGAGTG GGAGTTGATT TAA
|
Protein sequence | MAERLLTPHQ SQYFAWLLTR RAAGDTVESL ASALVDSQVD LNPHQVEAAL FACRNPLSRG VILADEVGLG KTIEAGLVIS QRWAERRRRI LIIVPANLRK QWHQELQDKF SLQGLILESK NYNALRKSER QNPFLTSFEP VICSYQFAKS KAKDIKEIEW DLVVFDEAHR LRNVYKTSNV IAKTLKDALV HVHSKVLLTA TPLQNSLLEL YGLVSMIDER VFGDIDSFRV QFTAQGKDQA YGGLRERLAA VCKRTLRQQV QPYVSFTARK AIVHEFTPSS EEQELSQLVA NYLRRPNLKA LPEGQRQLIS LVLWKLLASS TRAIAGALET MANRLQSVLD KAVAVNDLAE ELDEDYEALD ETADEWNGQN SSANTHDGTE SDAIAQEIEE LRHFKALATN IRDDAKGKAL LTALDGAFAE LDRLGAEKKA IIFTESKRTQ EYLLHLLADT PHGDGVVLFN GTNSDARAQA IYKDWLKRHA GTDRITDSKT ADTRAALVEH FKERGKIMIA TEAGAEGINL QFCSLVINYD LPWNPQRIEQ RIGRCHRYGQ KFDVVVVNFV DLSNEADRRV YDLLSQKFQL FEGVFGASDE VLGAIGSGVD FERRIADIYR NCRESDEIKA SFEQLQLDLS SEINEAMIKT RQLLLENFDE EVQEKLRIRA EDSRSTRSRF ERILMDLTRS ELGACGVFDD HGFVLHHSPE GIESSSIDAI ELGRYELPRR SGDAHLYRVN HPLALLITER AKTRALGGAR LVFDYDAHGS QVSTLKAYRG KAGWLTVKLI SVEALGKQEQ HLLVAATTTD GLVLAEEDPE KLLRLPATVE AEGLFSTADS SLLADAEVRK TVLLRGVNER NLGYFEQEVQ KLDAWADDLK VGLEQEIKEI DREIKEVRRT AATSPTLQEK LSWQKKQREL EGKRSKLRRE LFVRQDEIEA QRNDLISELE AKLQQQVDER TLFTVEWELI
|
| |