Gene EcHS_A4046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4046 
SymbolrecQ 
ID5593227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4036341 
End bp4038170 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content55% 
IMG OID640923150 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001460616 
Protein GI157163298 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCAGG CGGAAGTGTT GAATCTGGAG TCCGGAGCTA AACAGGTTTT ACAAGAAACC 
TTTGGCTACC AACAGTTTCG CCCCGGCCAG GAAGAAATTA TCGACACTGT GCTTTCCGGT
CGCGATTGCC TGGTCGTCAT GCCCACCGGT GGCGGAAAAT CCCTTTGCTA TCAAATTCCT
GCCTTATTGC TAAACGGCCT TACCGTGGTT GTTTCACCGC TGATTTCGTT GATGAAAGAT
CAGGTGGATC AACTGCAAGC CAACGGCGTG GCGGCGGCGT GCCTTAACTC GACGCAAACC
CGCGAACAGC AACTTGAAGT GATGACAGGC TGCCGCACCG GGCAAATTCG CTTACTGTAT
ATCGCCCCGG AACGCCTGAT GCTGGATAAC TTTCTTGAGC ATCTGGCGCA CTGGAATCCG
GTGTTATTAG CCGTCGATGA AGCGCACTGT ATCTCCCAAT GGGGCCACGA TTTCCGCCCG
GAATATGCCG CGCTCGGTCA GTTGCGCCAG CGGTTCCCGA CGCTGCCGTT TATGGCGCTG
ACCGCCACAG CCGACGACAC CACGCGCCAG GATATCGTGC GCCTGCTGGG GCTGAACGAT
CCGCTGATTC AAATCAGCAG TTTTGACCGT CCGAATATTC GCTACATGCT GATGGAGAAG
TTCAAACCGC TCGATCAGTT GATGCGCTAC GTGCAGGAAC AGCGCGGTAA GTCAGGCATT
ATCTACTGCA ACAGCCGCGC GAAAGTAGAA GACACCGCTG CGCGCCTGCA AAGCAAGGGA
ATTAGCGCGG CGGCCTATCA TGCCGGGCTG GAAAATAATG TTCGCGCCGA TGTGCAGGAA
AAATTCCAGC GCGATGACCT GCAAATTGTG GTGGCGACGG TGGCGTTCGG CATGGGCATC
AATAAACCAA ACGTTCGCTT CGTGGTCCAC TTTGATATTC CGCGCAATAT CGAATCCTAT
TATCAGGAAA CCGGACGCGC CGGGCGTGAT GGCCTGCCCG CGGAAGCGAT GCTGTTTTAC
GATCCGGCTG ATATGGCGTG GCTGCGCCGT TGTCTGGAAG AGAAGCCGCA GGGGCAGTTG
CAGGATATCG AGCGCCACAA ACTCAATGCG ATGGGCGCGT TTGCCGAAGC GCAAACTTGC
CGTCGTCTGG TATTGCTGAA CTATTTTGGC GAAGGGCGTC AGGAGCCGTG CGGGAACTGC
GATATCTGCC TCGATCCGCC GAAACAGTAC GACGGTTCAA CCGATGCTCA GATTGCCCTT
TCCACCATTG GTCGTGTGAA TCAGCGGTTT GGGATGGGTT ATGTGGTGGA AGTGATTCGT
GGTGCTAATA ACCAGCGTAT CCGCGACTAT GGTCATGACA AACTGAAAGT CTATGGCATG
GGCCGTGATA AAAGCCATGA ACATTGGGTG AGCGTGATCC GCCAGCTGAT TCACCTCGGC
CTGGTGACGC AAAATATTGC CCAGCATTCT GCCCTACAAC TGACAGAGGC CGCGCGCCCG
GTGCTGCGCG GCGAATCCTC TTTGCAACTT GCCGTGCCGC GTATCGTGGC GCTCAAACCG
AAAGCGATGC AGAAATCGTT CGGCGGCAAC TATGATCGCA AACTGTTCGC CAAATTACGC
AAACTGCGTA AATCGATTGC CGATGAAAGC AATGTCCCGC CGTACGTGGT GTTTAACGAC
GCAACCTTGA TTGAGATGGC TGAACAGATG CCGATCACCG CCAGCGAAAT GCTCAGCGTT
AACGGCGTTG GGATGCGCAA GCTGGAACGC TTTGGTAAAC CGTTTATGGC GCTTATCCGC
GCGCATGTTG ACGGCGACGA CGAAGAGTAG
 
Protein sequence
MAQAEVLNLE SGAKQVLQET FGYQQFRPGQ EEIIDTVLSG RDCLVVMPTG GGKSLCYQIP 
ALLLNGLTVV VSPLISLMKD QVDQLQANGV AAACLNSTQT REQQLEVMTG CRTGQIRLLY
IAPERLMLDN FLEHLAHWNP VLLAVDEAHC ISQWGHDFRP EYAALGQLRQ RFPTLPFMAL
TATADDTTRQ DIVRLLGLND PLIQISSFDR PNIRYMLMEK FKPLDQLMRY VQEQRGKSGI
IYCNSRAKVE DTAARLQSKG ISAAAYHAGL ENNVRADVQE KFQRDDLQIV VATVAFGMGI
NKPNVRFVVH FDIPRNIESY YQETGRAGRD GLPAEAMLFY DPADMAWLRR CLEEKPQGQL
QDIERHKLNA MGAFAEAQTC RRLVLLNYFG EGRQEPCGNC DICLDPPKQY DGSTDAQIAL
STIGRVNQRF GMGYVVEVIR GANNQRIRDY GHDKLKVYGM GRDKSHEHWV SVIRQLIHLG
LVTQNIAQHS ALQLTEAARP VLRGESSLQL AVPRIVALKP KAMQKSFGGN YDRKLFAKLR
KLRKSIADES NVPPYVVFND ATLIEMAEQM PITASEMLSV NGVGMRKLER FGKPFMALIR
AHVDGDDEE