Gene EcSMS35_3718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3718 
Symbol 
ID6142785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3787297 
End bp3789393 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content49% 
IMG OID641618544 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001745684 
Protein GI170684175 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGC ACGGAGCTGA ATTGCTTTTA CAGCGGATGT TAGGAAATAC CACCGCTACT 
TTCAGGGAAG GGCAATGGGA AGCCATAGAT GCAGTGGTTA ATCAGCGGCG AAAACTGCTG
GTTGTACAGC GCACCGGCTG GGGGAAAAGT GCCGTGTACT TCATCGCCAG TAAAATCTTC
CGCGATCGCG GCGCTGGCAC GACAATTATC ATCTCTCCTT TGCTTGCCCT GATGCGTAAC
CAGGTTGCCG CCGCTGAACG GTTAGGTATT ACGGCTGAAA CGTTGAATTC TACAAACAGG
GAAGAATGGC AGCGCATTAG CGATAAGTTG CTGCAAGGTG AGGTCGATTG CTTGCTGGTT
TCCCCGGAAC GTTTGGCAAA CCAGGACTTT ATCGAAACGG TTTTATATCC TATAGCCGAT
CGTATTGGCT TGCTTGTGGT CGACGAGGCG CACTGCATCT CTGATTGGGG CCACGATTTT
CGCCCGGATT ACCGACGTAT ATTAGATATT TTGCGCCAAC TGCCTGCGAA TACCCCTGTT
CTGGGTACAA CCGCGACAGC GAATAACCGT GTTGTTGAGG ATATCCGTCA GCAATTGGGT
GACATTGTGA TTCAGCGTGG AACGCTGGCT CGCGAAAGTT TGGCGCTTGA TGCCTTAGTC
TTAGGAGAAC AGTCATCCCG TCTGGCATGG TTAGCAACGG TTATCCCTCA GTTTTCCAAA
TCGGGAATTG TTTATACCCT GACGACTCGC GATGCTGAAC TTGTCGCCGA GTGGTTAAGG
AAAAATGGTA TCAGTGCATT TGCTTACTAC AGCGGCGTGA CTTGTGAAGG CGCGGAAGAT
TCTAATACTG CCAGGGAATA TCTGGAGCAG GCACTGCTGG CAAATAAAAT CAAAGTGCTG
GTCGCGACGA CGGCATTAGG TATGGGGTTT GATAAACCTG ATTTAGGTTT CGTCATTCAT
TATCAGATGC CGGGGTCTAT TGTCGGTTAC TACCAACAGG TGGGGCGTGC CGGGCGTGCT
ATAGATTCCG CAGTTGGCAT ATTGCTTTGT GGTGGTGAAG ACCGCGCTAT TCATCAATTC
TTCCGTGAAA GCGCCTTTCC TGCGGAGGCG CAAATTCATG AAATACTCAA CGTACTTAGC
GAGAATGACG GTCTTACCCT ACGAGGCATT GAACAACGGA CGAATCTTCG TTATGGGCAA
ATAGAAAAAG CACTGAAATT ATTGGTAGCG GAAAATCCAT CGCCTGTGGT GTATACCGAG
AAATTATGGC GCAGAACTAT CGTCAGTTTT TCTCCTGATC ATGAACGAAT TAACCATTTG
ATGAATCAGA GAAAAAGTGA ACTGGCAGAC GTTGAAAGCT ATATCACGAC CAAGGAGTGC
AAAATGCAAT TTCTGCGCCG TGCACTCGAT GAGCCAAGTG CCGAACGTTG TGGTAAATGT
AGCAGTTGTC TTCAGCATCC GTTATTGTCG CCCGACATTG ATAGCGGCTT ACTCCATGCG
GCAAATTTAT TTATTAAACA CGCTGACCTG CCGTTAAATC TCAATAAACA GATGGCAGCT
GGGGCTTTTA CTCAATACGG ATTTAAAGGG AACTTGCCTG CGGGTTTACA AGGATCTACG
GGGCGAATCC TTTCTCGTTG GGGAGATTCC GGGTGGGGAA AGCAGGTAGC ACAGGAGAAA
AAAACGGGGC GCTTAAGTGA TGAGCTGGTA GAAGCATGTG CGGAAATGGT TTGCCAACGC
TGGAATCCGC ATCCTGAACC AACCTGGGTA TGCTGCGTTC CTTCATTAAG GCACCTCGAC
CTGGTTCCTG ATTTTGCCCG GCGACTGGCG GCGAAACTTG GCTTACCTTT TATTGATGCC
ATTGAAAAAG TCGTGGACAA TCCACCGCAG AAAATGCAGC AAAACCGTTT TCACCAGTGT
CAAAATCTCG ACGGGGCGTT TGTGATTATC CCTCCTTTGA TGCCAGGTCC GGCGTTGCTG
GTTGACGATA TCGTGGATTC TGCATGGACG CTGACAGTTC TGACAGCACT GTTACGCCAG
GCAGGTTGCC CGACGGTTTA TCCTCTTGCC CTTGCGTCTA CCTCGGTAAA AAATTGA
 
Protein sequence
MEKHGAELLL QRMLGNTTAT FREGQWEAID AVVNQRRKLL VVQRTGWGKS AVYFIASKIF 
RDRGAGTTII ISPLLALMRN QVAAAERLGI TAETLNSTNR EEWQRISDKL LQGEVDCLLV
SPERLANQDF IETVLYPIAD RIGLLVVDEA HCISDWGHDF RPDYRRILDI LRQLPANTPV
LGTTATANNR VVEDIRQQLG DIVIQRGTLA RESLALDALV LGEQSSRLAW LATVIPQFSK
SGIVYTLTTR DAELVAEWLR KNGISAFAYY SGVTCEGAED SNTAREYLEQ ALLANKIKVL
VATTALGMGF DKPDLGFVIH YQMPGSIVGY YQQVGRAGRA IDSAVGILLC GGEDRAIHQF
FRESAFPAEA QIHEILNVLS ENDGLTLRGI EQRTNLRYGQ IEKALKLLVA ENPSPVVYTE
KLWRRTIVSF SPDHERINHL MNQRKSELAD VESYITTKEC KMQFLRRALD EPSAERCGKC
SSCLQHPLLS PDIDSGLLHA ANLFIKHADL PLNLNKQMAA GAFTQYGFKG NLPAGLQGST
GRILSRWGDS GWGKQVAQEK KTGRLSDELV EACAEMVCQR WNPHPEPTWV CCVPSLRHLD
LVPDFARRLA AKLGLPFIDA IEKVVDNPPQ KMQQNRFHQC QNLDGAFVII PPLMPGPALL
VDDIVDSAWT LTVLTALLRQ AGCPTVYPLA LASTSVKN