Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4189 |
Symbol | recQ |
ID | 6145557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4287169 |
End bp | 4289004 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619012 |
Product | ATP-dependent DNA helicase RecQ |
Protein accession | YP_001746140 |
Protein GI | 170680861 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0514] Superfamily II DNA helicase |
TIGRFAM ID | [TIGR00614] ATP-dependent DNA helicase, RecQ family [TIGR01389] ATP-dependent DNA helicase RecQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00948478 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATGTGG CGCAGGCGGA AGTGTTGAAT CTGGAGTCCG GGGCTAAACA GGTTTTACAA GAAACCTTTG GCTACCAACA GTTTCGCCCC GGCCAGGAAG AAATTATCGA CACTGTGCTT TCCGGCCGCG ATTGCCTCGT TGTCATGCCC ACTGGTGGCG GAAAATCCCT TTGCTATCAA ATCCCTGCCT TATTGCTAAA CGGCCTTACC GTGGTTGTTT CACCGCTGAT TTCGTTGATG AAAGATCAGG TGGATCAACT GCAAGCCAAC GGCGTGGCGG CGGTGTGCCT TAACTCGACG CAAACCCGCG AGCAGCAACT TGAAGTGATG ACAGGCTGTC GCACCGGGCA AATCCGCCTG CTTTATATCG CGCCGGAACG CCTGATGCTG GATAACTTTC TTGAGCATCT GGCGCACTGG AATCCGGTGT TATTAGCCGT TGATGAAGCG CACTGTATCT CCCAATGGGG TCACGATTTC CGCCCGGAAT ACGCTGCCCT CGGTCAGCTG CGTCAGCGTT TTCCGACGCT GCCGTTTATG GCGCTGACTG CCACAGCCGA TGACACCACG CGCCAGGATA TCGTGCGCCT GCTGGGGCTG AACGATCCGC TGATTCAAAT CAGCAGTTTT GATCGTCCGA ATATTCGCTA CATGCTGATG GAGAAGTTCA AACCGCTCGA TCAGTTGATG CGCTACGTGC AGGAACAGCG CGGTAAGTCC GGCATTATCT ACTGCAACAG CCGGGCGAAA GTAGAAGACA CCGCTGCGCG CCTGCAAAGC AAGGGTATTA GCGCGGCGGC CTATCATGCC GGGCTGGAAA ATAATGTCCG CGCCGACGTG CAGGAAAAAT TTCAGCGCGA TGACCTGCAA ATTGTGGTGG CGACCGTGGC GTTTGGCATG GGCATCAACA AGCCTAACGT CCGCTTCGTG GTCCACTTTG ATATTCCGCG CAATATCGAA TCCTATTATC AGGAAACCGG TCGCGCCGGG CGTGATGGCC TGCCCGCGGA AGCGATGCTG TTTTACGATC CGGCCGATAT GGCGTGGCTA CGCCGTTGTC TGGAAGAGAA GCCGCAGGGG CAATTGCAGG ATATCGAGCG CCATAAACTC AATGCGATGG GCGCGTTTGC CGAAGCGCAA ACTTGCCGTC GTCTGGTGCT GCTGAACTAC TTTGGCGAAG GGCGGCAGGA GCCGTGCGGG AACTGCGATA TCTGCCTCGA TCCGCCGAAA CAATACGATG GTTCAACCGA TGCTCAGATT GCCCTTTCCA CCATTGGTCG TGTGAATCAG CGGTTTGGGA TGGGCTATGT GGTGGAAGTT ATTCGCGGTG CTAATAATCA GCGTATCCGC GACTATGGTC ATGACAAACT GAAAGTCTAT GGCATGGGCC GTGATAAAAG CCATGAACAT TGGGTGAGCG TGATCCGCCA GCTGATTCAC CTCGGCCTGG TGACGCAAAA TATTGCCCAG CATTCTGCCC TACAACTGAC AGAGGCTGCG CGTCCGGTGC TGCGCGGCGA ATCCTCTTTG CAACTTGCCG TGCCGCGTAT CGTGGCGCTC AAACCGAAAG CGATGCAGAA ATCCTTCGGC GGCAACTATG ATCGCAAACT GTTCGCCAAA TTACGCAAAC TGCGTAAATC GATTGCCGAT GAAAGCAATG TCCCGCCGTA CGTGGTGTTT AACGACGCAA CCTTGATTGA GATGGCTGAA CAGATGCCGA TCACCGCCAG CGAAATGCTC AGCGTTAACG GCGTTGGGAT GCGCAAGCTG GAACGCTTTG GTAAACCGTT TATGGCGCTT ATCCGCGCGC ATGTCGATGG TGACGACGAA GAGTAG
|
Protein sequence | MNVAQAEVLN LESGAKQVLQ ETFGYQQFRP GQEEIIDTVL SGRDCLVVMP TGGGKSLCYQ IPALLLNGLT VVVSPLISLM KDQVDQLQAN GVAAVCLNST QTREQQLEVM TGCRTGQIRL LYIAPERLML DNFLEHLAHW NPVLLAVDEA HCISQWGHDF RPEYAALGQL RQRFPTLPFM ALTATADDTT RQDIVRLLGL NDPLIQISSF DRPNIRYMLM EKFKPLDQLM RYVQEQRGKS GIIYCNSRAK VEDTAARLQS KGISAAAYHA GLENNVRADV QEKFQRDDLQ IVVATVAFGM GINKPNVRFV VHFDIPRNIE SYYQETGRAG RDGLPAEAML FYDPADMAWL RRCLEEKPQG QLQDIERHKL NAMGAFAEAQ TCRRLVLLNY FGEGRQEPCG NCDICLDPPK QYDGSTDAQI ALSTIGRVNQ RFGMGYVVEV IRGANNQRIR DYGHDKLKVY GMGRDKSHEH WVSVIRQLIH LGLVTQNIAQ HSALQLTEAA RPVLRGESSL QLAVPRIVAL KPKAMQKSFG GNYDRKLFAK LRKLRKSIAD ESNVPPYVVF NDATLIEMAE QMPITASEML SVNGVGMRKL ERFGKPFMAL IRAHVDGDDE E
|
| |