Gene EcSMS35_4189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4189 
SymbolrecQ 
ID6145557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4287169 
End bp4289004 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content55% 
IMG OID641619012 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001746140 
Protein GI170680861 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00948478 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATGTGG CGCAGGCGGA AGTGTTGAAT CTGGAGTCCG GGGCTAAACA GGTTTTACAA 
GAAACCTTTG GCTACCAACA GTTTCGCCCC GGCCAGGAAG AAATTATCGA CACTGTGCTT
TCCGGCCGCG ATTGCCTCGT TGTCATGCCC ACTGGTGGCG GAAAATCCCT TTGCTATCAA
ATCCCTGCCT TATTGCTAAA CGGCCTTACC GTGGTTGTTT CACCGCTGAT TTCGTTGATG
AAAGATCAGG TGGATCAACT GCAAGCCAAC GGCGTGGCGG CGGTGTGCCT TAACTCGACG
CAAACCCGCG AGCAGCAACT TGAAGTGATG ACAGGCTGTC GCACCGGGCA AATCCGCCTG
CTTTATATCG CGCCGGAACG CCTGATGCTG GATAACTTTC TTGAGCATCT GGCGCACTGG
AATCCGGTGT TATTAGCCGT TGATGAAGCG CACTGTATCT CCCAATGGGG TCACGATTTC
CGCCCGGAAT ACGCTGCCCT CGGTCAGCTG CGTCAGCGTT TTCCGACGCT GCCGTTTATG
GCGCTGACTG CCACAGCCGA TGACACCACG CGCCAGGATA TCGTGCGCCT GCTGGGGCTG
AACGATCCGC TGATTCAAAT CAGCAGTTTT GATCGTCCGA ATATTCGCTA CATGCTGATG
GAGAAGTTCA AACCGCTCGA TCAGTTGATG CGCTACGTGC AGGAACAGCG CGGTAAGTCC
GGCATTATCT ACTGCAACAG CCGGGCGAAA GTAGAAGACA CCGCTGCGCG CCTGCAAAGC
AAGGGTATTA GCGCGGCGGC CTATCATGCC GGGCTGGAAA ATAATGTCCG CGCCGACGTG
CAGGAAAAAT TTCAGCGCGA TGACCTGCAA ATTGTGGTGG CGACCGTGGC GTTTGGCATG
GGCATCAACA AGCCTAACGT CCGCTTCGTG GTCCACTTTG ATATTCCGCG CAATATCGAA
TCCTATTATC AGGAAACCGG TCGCGCCGGG CGTGATGGCC TGCCCGCGGA AGCGATGCTG
TTTTACGATC CGGCCGATAT GGCGTGGCTA CGCCGTTGTC TGGAAGAGAA GCCGCAGGGG
CAATTGCAGG ATATCGAGCG CCATAAACTC AATGCGATGG GCGCGTTTGC CGAAGCGCAA
ACTTGCCGTC GTCTGGTGCT GCTGAACTAC TTTGGCGAAG GGCGGCAGGA GCCGTGCGGG
AACTGCGATA TCTGCCTCGA TCCGCCGAAA CAATACGATG GTTCAACCGA TGCTCAGATT
GCCCTTTCCA CCATTGGTCG TGTGAATCAG CGGTTTGGGA TGGGCTATGT GGTGGAAGTT
ATTCGCGGTG CTAATAATCA GCGTATCCGC GACTATGGTC ATGACAAACT GAAAGTCTAT
GGCATGGGCC GTGATAAAAG CCATGAACAT TGGGTGAGCG TGATCCGCCA GCTGATTCAC
CTCGGCCTGG TGACGCAAAA TATTGCCCAG CATTCTGCCC TACAACTGAC AGAGGCTGCG
CGTCCGGTGC TGCGCGGCGA ATCCTCTTTG CAACTTGCCG TGCCGCGTAT CGTGGCGCTC
AAACCGAAAG CGATGCAGAA ATCCTTCGGC GGCAACTATG ATCGCAAACT GTTCGCCAAA
TTACGCAAAC TGCGTAAATC GATTGCCGAT GAAAGCAATG TCCCGCCGTA CGTGGTGTTT
AACGACGCAA CCTTGATTGA GATGGCTGAA CAGATGCCGA TCACCGCCAG CGAAATGCTC
AGCGTTAACG GCGTTGGGAT GCGCAAGCTG GAACGCTTTG GTAAACCGTT TATGGCGCTT
ATCCGCGCGC ATGTCGATGG TGACGACGAA GAGTAG
 
Protein sequence
MNVAQAEVLN LESGAKQVLQ ETFGYQQFRP GQEEIIDTVL SGRDCLVVMP TGGGKSLCYQ 
IPALLLNGLT VVVSPLISLM KDQVDQLQAN GVAAVCLNST QTREQQLEVM TGCRTGQIRL
LYIAPERLML DNFLEHLAHW NPVLLAVDEA HCISQWGHDF RPEYAALGQL RQRFPTLPFM
ALTATADDTT RQDIVRLLGL NDPLIQISSF DRPNIRYMLM EKFKPLDQLM RYVQEQRGKS
GIIYCNSRAK VEDTAARLQS KGISAAAYHA GLENNVRADV QEKFQRDDLQ IVVATVAFGM
GINKPNVRFV VHFDIPRNIE SYYQETGRAG RDGLPAEAML FYDPADMAWL RRCLEEKPQG
QLQDIERHKL NAMGAFAEAQ TCRRLVLLNY FGEGRQEPCG NCDICLDPPK QYDGSTDAQI
ALSTIGRVNQ RFGMGYVVEV IRGANNQRIR DYGHDKLKVY GMGRDKSHEH WVSVIRQLIH
LGLVTQNIAQ HSALQLTEAA RPVLRGESSL QLAVPRIVAL KPKAMQKSFG GNYDRKLFAK
LRKLRKSIAD ESNVPPYVVF NDATLIEMAE QMPITASEML SVNGVGMRKL ERFGKPFMAL
IRAHVDGDDE E