Gene ECH74115_5263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5263 
SymbolrecQ 
ID6970470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4905283 
End bp4907118 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content55% 
IMG OID643388927 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_002273341 
Protein GI209400089 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.695622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGTGG CGCAGGCGGA AGTGTTGAAT CTGGAGTCCG GAGCTAAACA GGTTTTACAA 
GAAACCTTTG GCTACCAACA GTTTCGCCCC GGCCAGGAAG AAATTATCGA CACTGTGCTT
TCCGGTCGCG ATTGCCTGGT CGTCATGCCC ACCGGTGGCG GAAAATCCCT TTGCTATCAA
ATTCCTGCCT TATTGCTAAA CGGCCTTACC GTGGTTGTTT CACCGCTGAT TTCGTTGATG
AAAGATCAGG TGGATCAACT GCAAGCCAAC GGCGTGGCGG CGGCGTGCCT TAACTCGACG
CAAACCCGCG AACAGCAACT TGAAGTGATG ACAGGCTGCC GCACCGGGCA AATTCGTCTG
CTTTATATCG CCCCGGAACG CCTGATGCTG GATAACTTTC TTGAGCATCT GGCGCACTGG
AATCCGGTGT TATTAGCCGT TGATGAAGCG CACTGTATCT CCCAATGGGG CCACGATTTC
CGCCCGGAAT ATGCCGCGCT CGGTCAGTTG CGCCAGCGGT TCCCGACGCT GCCGTTTATG
GCGCTGACCG CCACAGCCGA CGACACCACG CGCCAGGATA TCGTGCGCCT GCTGGGGCTG
AACGATCCGC TGATTCAAAT CAGCAGTTTT GACCGTCCGA ATATTCGCTA CATGCTGATG
GAGAAATTCA AACCGCTCGA TCAGTTGATG CGCTACGTGC AGGAACAGCG CGGTAAGTCC
GGCATTATCT ACTGCAACAG CCGGGCGAAA GTAGAAGACA CCGCTGCGCG CCTGCAAAGC
AAGGGTATTA GCGCGGCGGC CTATCATGCC GGGCTGGAAA ATAATGTCCG CGCCGACGTG
CAGGAGAAAT TCCAGCGCGA TGACCTGCAA ATTGTGGTGG CGACGGTGGC GTTCGGCATG
GGCATCAATA AACCAAACGT TCGCTTCGTG GTCCACTTTG ATATTCCGCG CAATATCGAA
TCCTATTATC AGGAAACCGG TCGCGCCGGG CGTGATGGCC TGCCTGCGGA AGCGATGTTG
TTTTACGATC CAGCTGATAT GGCGTGGCTG CGCCGTTGTC TGGAAGAGAA GCCGCAGGGG
CAGTTGCAGG ATATCGAGCG CCACAAACTC AATGCGATGG GCGCGTTTGC CGAAGCGCAA
ACTTGCCGTC GTCTGGTATT GCTGAACTAT TTTGGCGAAG GGCGTCAGGA GCCGTGCGGG
AACTGCGATA TCTGCCTCGA TCCGCCGAAA CAGTACGACG GTTCAACCGA TGCTCAGATT
GCCCTTTCCA CCATTGGTCG TGTGAATCAG CGGTTTGGGA TGGGTTATGT GGTGGAAGTG
ATTCGTGGTG CTAATAACCA GCGTATCCGC GACTATGGTC ATGACAAACT GAAAGTCTAT
GGCATGGGCC GTGATAAAAG CCATGAACAT TGGGTGAGCG TGATCCGCCA GCTGATTCAC
CTCGGCCTGG TGACGCAAAA TATTGCCCAG CATTCTGCCC TACAACTGAC AGAGGCCGCG
CGCCCGGTGC TGCGCGGCGA ATCCTCTTTA CAACTTGCCG TGCCGCGTAT CGTGGCGCTC
AAACCGAAAG CGATGCAGAA ATCGTTCGGC GGCAACTATG ATCGCAAACT GTTCGCCAAA
TTACGCAAAC TGCGTAAATC GATTGCCGAT GAAAGTAATG TCCCGCCGTA CGTGGTGTTT
AACGACGCAA CCTTGATTGA GATGGCTGAA CAGATGCCGA TCACCGCCAG CGAAATGCTC
AGCGTTAACG GCGTTGGGAT GCGCAAGCTG GAACGCTTTG GTAAACCGTT TATGGCGCTG
ATTCGTGCGC ATGTCGATGG CGACGACGAA GAGTAG
 
Protein sequence
MNVAQAEVLN LESGAKQVLQ ETFGYQQFRP GQEEIIDTVL SGRDCLVVMP TGGGKSLCYQ 
IPALLLNGLT VVVSPLISLM KDQVDQLQAN GVAAACLNST QTREQQLEVM TGCRTGQIRL
LYIAPERLML DNFLEHLAHW NPVLLAVDEA HCISQWGHDF RPEYAALGQL RQRFPTLPFM
ALTATADDTT RQDIVRLLGL NDPLIQISSF DRPNIRYMLM EKFKPLDQLM RYVQEQRGKS
GIIYCNSRAK VEDTAARLQS KGISAAAYHA GLENNVRADV QEKFQRDDLQ IVVATVAFGM
GINKPNVRFV VHFDIPRNIE SYYQETGRAG RDGLPAEAML FYDPADMAWL RRCLEEKPQG
QLQDIERHKL NAMGAFAEAQ TCRRLVLLNY FGEGRQEPCG NCDICLDPPK QYDGSTDAQI
ALSTIGRVNQ RFGMGYVVEV IRGANNQRIR DYGHDKLKVY GMGRDKSHEH WVSVIRQLIH
LGLVTQNIAQ HSALQLTEAA RPVLRGESSL QLAVPRIVAL KPKAMQKSFG GNYDRKLFAK
LRKLRKSIAD ESNVPPYVVF NDATLIEMAE QMPITASEML SVNGVGMRKL ERFGKPFMAL
IRAHVDGDDE E