Gene EcSMS35_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2157 
SymbolhelD 
ID6146618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2161058 
End bp2163112 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content53% 
IMG OID641617033 
ProductDNA helicase IV 
Protein accessionYP_001744207 
Protein GI170681833 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00794949 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.209046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGA AAGCGACAAC GCTTGGAAAA CGTCTGGCAC AGCACCCTTA CGATCGGGCG 
GTGATCCTCA ATGCCGGGAT TAAAGTCTCC GGCGATCGCC ACGAATACCT TATTCCTTTT
AATCAATTAC TGGCGATTCA CTGTAAGCGC GGTCTGGTAT GGGGCGAGCT GGAATTTGTA
CTGCCGGACG AAAAAGTGGT GCGTCTGCAC GGCACCGAAT GGGGCGAGAC GCAGCGTTTT
TACCATCATC TTGATACTCA CTGGCGGCGG TGGAGTGGGG AGATGAGCGA AATTGCGTCT
GGCGTTTTAC GTCAGCAACT GGATTTGATT GCCACGCGCA CCGGAGAAAA TAAATGGCTG
ACGCGTGAGC AAACCTCTGG TGTGCAGCAA CAAATCCGCC AGGCTTTGTC GGCGTTGCCG
TTGCCGGTTA ACCGACTGGA AGAATTCGAT AACTGCCGTG AGGCGTGGCG TAAATGTCAG
GCCTGGTTGA AGGACATTGA AGGCGCTCGG TTGCAGCATA ATCAGGCGTA TACCGAAGCC
ATGCTTACCG AGTATGCGGA TTTTTTCCGC CAGGTCGAGT CTTCACCGCT GAATCCGGCG
CAGGCCCGGG CAGTCGTTAA TGGCGAGCAT TCTTTGTTAG TGCTGGCTGG TGCAGGAAGC
GGAAAAACGT CGGTGCTGGT GGCCCGTGCA GGCTGGTTGC TGGCACGTGG TGAAGCGTCC
CCTGAGCAAA TTTTATTGCT GGCGTTTGGT CGCAAAGCCG CTGAAGAGAT GGACGAGCGT
ATTCGCGAAC GGCTACATAC CGAAGACATT ACCGCACGCA CATTTCATGC GCTGGCGCTG
CATATTATTC AGCAGGGCAG CAAAAAAGTT CCGATAGTCA GCAAACTGGA AAATGATACC
GCTGCCCGTC ATGAACTTTT TATTGCTGAG TGGCGCAAGC AATGCAGCGA AAAGAAAGCG
CAGGCGAAGG GCTGGCGGCA ATGGCTGACG GAAGAAATGC AGTGGTCAGT GCCAGAAGGT
AACTTCTGGG ATGATGAAAA ATTACAGCGT CGTCTTGCCT CACGCCTCGA TCGTTGGGTA
AGTCTGATGC GGATGCACGG TGGTGCACAG GCAGAAATGA TTGCCAGTGC ACCCGAAGAG
ATTCGCGATC TGTTCAGTAA ACGTATCAAG TTGATGGCTC CGTTATTAAA AGCCTGGAAA
GGTGCGCTGA AGGCAGAAAA CGCTGTCGAT TTTTCGGGCC TTATTCACCA GGCGATTGTG
ATTCTGGAGA AAGGTCGCTT TATCAGCCCG TGGAAGCATA TTCTGGTTGA TGAATTTCAG
GATATCTCGC CGCAGCGCGC AGCGTTGTTA GCGGCATTAC GCAAGCAAAA CAGTCAGACG
ACGTTGTTCG CCGTTGGTGA TGACTGGCAG GCGATTTATC GCTTCAGCGG TGCGCAAATG
TCGCTCACCA CCGCTTTCCA TGAAAACTTT GGTGAAGGTG ATCGCTGCGA TTTAGACACG
ACTTACCGTT TTAACAGTCG TATCGGTGAG GTGGCAAACC GGTTTATTCA GCAGAACCCA
GGCCAGCTGA AAAAGCCGCT AAACAGCTTA ACCAATGGAG ACAAAAAAGC CGTCACGTTA
TTGGATGAGA GTCAACTTGA CGCTTTGCTG GATAAGCTCT CTGGTTATGC CAAACCGGAA
GAGCGCATTC TGATCCTGGC GCGTTACCAT CACATGAGGC CTGCCAGCCT GGAAAAAGCG
GCAACACGCT GGCCGAAGTT GCAAATCGAC TTTATGACCA TTCATGCCAG CAAAGGGCAA
CAGGCGGATT ACGTCATCAT CGTTGGCTTG CAGGAGGGGA GTGATGGTTT TCCGGCTGCG
GCGCGGGAGT CGATTATGGA AGAGGCGCTA CTGCCACCGG TTGAGGATTT CCCGGACGCT
GAAGAACGGC GGTTAATGTA CGTGGCGCTG ACCCGGGCAC GCCATCGGGT ATGGGCACTG
TTTAACAAAG AGAATCCCTC TCCCTTTGTG GAAATACTGA AAAATCTGGA TGTGCCGGTG
GCGAGAAAAC CGTAA
 
Protein sequence
MELKATTLGK RLAQHPYDRA VILNAGIKVS GDRHEYLIPF NQLLAIHCKR GLVWGELEFV 
LPDEKVVRLH GTEWGETQRF YHHLDTHWRR WSGEMSEIAS GVLRQQLDLI ATRTGENKWL
TREQTSGVQQ QIRQALSALP LPVNRLEEFD NCREAWRKCQ AWLKDIEGAR LQHNQAYTEA
MLTEYADFFR QVESSPLNPA QARAVVNGEH SLLVLAGAGS GKTSVLVARA GWLLARGEAS
PEQILLLAFG RKAAEEMDER IRERLHTEDI TARTFHALAL HIIQQGSKKV PIVSKLENDT
AARHELFIAE WRKQCSEKKA QAKGWRQWLT EEMQWSVPEG NFWDDEKLQR RLASRLDRWV
SLMRMHGGAQ AEMIASAPEE IRDLFSKRIK LMAPLLKAWK GALKAENAVD FSGLIHQAIV
ILEKGRFISP WKHILVDEFQ DISPQRAALL AALRKQNSQT TLFAVGDDWQ AIYRFSGAQM
SLTTAFHENF GEGDRCDLDT TYRFNSRIGE VANRFIQQNP GQLKKPLNSL TNGDKKAVTL
LDESQLDALL DKLSGYAKPE ERILILARYH HMRPASLEKA ATRWPKLQID FMTIHASKGQ
QADYVIIVGL QEGSDGFPAA ARESIMEEAL LPPVEDFPDA EERRLMYVAL TRARHRVWAL
FNKENPSPFV EILKNLDVPV ARKP