Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5808 |
Symbol | |
ID | 6969836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5447429 |
End bp | 5450287 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643389436 |
Product | helicase family protein |
Protein accession | YP_002273828 |
Protein GI | 209398792 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGC AAAATTATGC ACCTGGTATG CGGGTGGTTA TTCGTGATGC CGAATGGCGT ATTCGCCGGG CGGATGACAG TGGTGATGGT GGGTATTTGC TGACCTGCGA TGGTATTTCA GAGCTTGTGC GCGGTAAAGA AGGGTTATTT CTGACCAAAC TGGAACAAAA AGTAGAGATC CTGGATCCCG CAAAAACGCA TTTGGTGGAA GATGAATCCG CCAATTATCA AGCGGCACAG TTGTATATTG AAAGTCAGTT ACGCCAACGC GTACCGACAG ACAGTAAAGT CCATTTTGGT CATCTGGCGG CGATGGATTC CATGCCGTTT CAGCTCGATC CTACGCGGAT GGCGCTGGCA CAACCGCGTC AGCGGATACT CATTGCCGAC GCTGTTGGTC TGGGTAAAAC GCTGGAAGCC GGTATTCTGG TTTCGGAACT GATCCGTCGT GGGCGCGGAA AACGTATTCT GGTTCTGGCG GTTAAATCGA TGCTGACACA ATTCCAGAAG GAGTTCTGGA GCCGTTTTGC GATCCCATTG ACCCGCCTGG ATTCTGCTGG TTTACAAAAG GTGCGTAACC GCATCCCAAC CAACCACAAC CCGTTCCACT ATTTTGACAA GACCATTATC TCCATCGATA CGCTGAAACA AGACATTGAA TATCGTCACC ATCTGGAAAA TGCCTGGTGG GATATTATTG TCATTGATGA GGCCCATAAC GTCGCAGAAC GTGGAACCAG TTCGTTACGC AGTAAACTGG CAAAACTGCT CGCCGGGCGC AGCGATACGC TGATTATGCT TTCAGCAACA CCGCATGATG GTAAGGCTGA AAGTTTTGCC AGCCTGATGA ACATGCTTGA CCCGACGGCG ATCGCCAACC CGAAAGAGTA TGAATACGCC GATTTTGCCG ATAAAAATCT GGTTGTCCGT CGCTTCAAAA AAGACGTGAA AGATCAGATG TCCGGCGAGT TCCCCGAGCG CAATATTGTC AAATTGACTC GCCTGGCATC CGGTGCCGAA GAAGAAGCAT ACCGTCGTCT GGTTGAGAGC CAGTTCCGTG ATGATGACGA TGAACAGGCC CAGTCTAACA AAGGTCGTCT GTTCAAGATA ACCCTCGAAA AAGCGCTATT TTCCAGCCCG ATGGCGTGTG CCAGCGTCGT GGCGAATCGT CTGAAGCGTC TTGAGAGCCG TAAAGATCAT AATAGCCAGA GCCAGATCAA CGAACTGGAG TCATTGCTGT TGGCGCTGAA TAACATTGAT GCCAGCCAGT TTAGCAAATA CCAGTTGCTG CTCGACACCA TCCGAAAAGA CCTCGCCTGG AAAGCCAATA ACACGGAAGA TCGCCTGGTG ATCTTCACCG AAAGTATTAA AACGCTGGAG TTCCTCGAAC AGCAACTGCG AGCGGATCTG AAATTGAAAG ATGACCAGAT CGCCACGCTG CGCGGCGATC AGGGCGATAC GGTATTGATG GAAACCGTAG AAGCCTTTGG TAAAACGCAG TCGCCGTTGC GTCTTCTGGT TTGTTCGGAT GTGGCATCTG AAGGGATCAA CCTGCATCAC CTTAGCCACA AAATGATTCA CTTTGATATT CCGTGGTCGT TGATGGTATT CCAGCAACGT AACGGGCGTA TTGACCGCTA TGGGCAAAAA CATCAGCCCC AAATCCGCTA TCTGCTTACC GAGGCCAGCG AACCACAGAT CAATGGCGAT ATGCGCGTAC TGGAAGTGTT GATCAACAAA GACGAACAGG CGCAGAAGAA CATTGGGGAC AGTTCGGAGT TTACCGGCAA ATTTACCCAG GAAGAGGAAG AAGAGCAGGT TGCAGAGTTC ATGATGCAGG ATGATGGTGC CAGCCTGTTT GATCAACTGC TGAACAGCAA CGTCTCGGAA AGCGCCGAAC ACGATCTGTT TGGCGAAATA TGCAGTGCGG TTTCCAGCGA TGCCTCAATG GTCACCGAAA CAGACACCAG CTTGTTTGCC AGCGAACAGG CGTACTGCGA AAGAGCATTG GGTTACCTGA AAGCCAGTGG TCAAACTATC CAGTATGAAA CCTTGCCTGA TAATACCTTG TCGCTGGTGG CACCGGAGGA GTTACGCCGC CGCTTCAACC AGCTACCGCC TGAAATTGCC CCGGAGAACT GGCAGCTCTA TTTAAGTCAG GATAAAACCG TCATCACCGA CGCGATTGCG CGCGCTCGCG GTGAGCAACA TGCCTGGCCC GATGTGCAGT ATCTCTGGCA AATCAACCCG GTAGTGCAGT GGCTGGACGA TAAAATCTCT TCGGCTTTTG GTCGTCATCA GGCTCCGGTT ATCCGGCTGC CATACCTGCT TGAACCCGAT GAAGATCACT TCATTCTTTC CGGGTTATTC CCGAACCGTA AATCACATCC GATGGTGAAC CCGTGGATAG TGGTGAGCTT TAACCGTGAA TCGCTAATCG GCAGTCAGCC TTTCGCTGAG TTTTTACAAC GTCATCCGCA GTTGAGTAAC AAGCTGACTA ACAGCGGCGG TAAAGATCGT AATCACCAAC GCCAGCAGGA TTTACTGGAA GCGGCTATTG CTCATGCCCG GGAGGTATTC ATCCATGATC GGAATGCGTT TGAAACGCAC ATTAACCAGC AACTGAATGA GCATCTGCAA AAGCTGGACG TTTTGCGTGG GCGGCAGTTG AGCCAACTTG AGCTGGATTT TGCCGATAAC AAACAGCAAT TGTCAGTCAA GCAGAGCCGT AAAGAGCAGA GACAACGCGA AATCGAACAC AATTTTGACA GCTATATCGA ATGGATTGAG GACACCATGA CGACTGAAAA AGAACCCTAC ATTCAGGTAA TTGCTGTTAT CACCGGAGCG GAGGGTTAA
|
Protein sequence | MNKQNYAPGM RVVIRDAEWR IRRADDSGDG GYLLTCDGIS ELVRGKEGLF LTKLEQKVEI LDPAKTHLVE DESANYQAAQ LYIESQLRQR VPTDSKVHFG HLAAMDSMPF QLDPTRMALA QPRQRILIAD AVGLGKTLEA GILVSELIRR GRGKRILVLA VKSMLTQFQK EFWSRFAIPL TRLDSAGLQK VRNRIPTNHN PFHYFDKTII SIDTLKQDIE YRHHLENAWW DIIVIDEAHN VAERGTSSLR SKLAKLLAGR SDTLIMLSAT PHDGKAESFA SLMNMLDPTA IANPKEYEYA DFADKNLVVR RFKKDVKDQM SGEFPERNIV KLTRLASGAE EEAYRRLVES QFRDDDDEQA QSNKGRLFKI TLEKALFSSP MACASVVANR LKRLESRKDH NSQSQINELE SLLLALNNID ASQFSKYQLL LDTIRKDLAW KANNTEDRLV IFTESIKTLE FLEQQLRADL KLKDDQIATL RGDQGDTVLM ETVEAFGKTQ SPLRLLVCSD VASEGINLHH LSHKMIHFDI PWSLMVFQQR NGRIDRYGQK HQPQIRYLLT EASEPQINGD MRVLEVLINK DEQAQKNIGD SSEFTGKFTQ EEEEEQVAEF MMQDDGASLF DQLLNSNVSE SAEHDLFGEI CSAVSSDASM VTETDTSLFA SEQAYCERAL GYLKASGQTI QYETLPDNTL SLVAPEELRR RFNQLPPEIA PENWQLYLSQ DKTVITDAIA RARGEQHAWP DVQYLWQINP VVQWLDDKIS SAFGRHQAPV IRLPYLLEPD EDHFILSGLF PNRKSHPMVN PWIVVSFNRE SLIGSQPFAE FLQRHPQLSN KLTNSGGKDR NHQRQQDLLE AAIAHAREVF IHDRNAFETH INQQLNEHLQ KLDVLRGRQL SQLELDFADN KQQLSVKQSR KEQRQREIEH NFDSYIEWIE DTMTTEKEPY IQVIAVITGA EG
|
| |