Gene ECH74115_5808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5808 
Symbol 
ID6969836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5447429 
End bp5450287 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content50% 
IMG OID643389436 
Producthelicase family protein 
Protein accessionYP_002273828 
Protein GI209398792 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGC AAAATTATGC ACCTGGTATG CGGGTGGTTA TTCGTGATGC CGAATGGCGT 
ATTCGCCGGG CGGATGACAG TGGTGATGGT GGGTATTTGC TGACCTGCGA TGGTATTTCA
GAGCTTGTGC GCGGTAAAGA AGGGTTATTT CTGACCAAAC TGGAACAAAA AGTAGAGATC
CTGGATCCCG CAAAAACGCA TTTGGTGGAA GATGAATCCG CCAATTATCA AGCGGCACAG
TTGTATATTG AAAGTCAGTT ACGCCAACGC GTACCGACAG ACAGTAAAGT CCATTTTGGT
CATCTGGCGG CGATGGATTC CATGCCGTTT CAGCTCGATC CTACGCGGAT GGCGCTGGCA
CAACCGCGTC AGCGGATACT CATTGCCGAC GCTGTTGGTC TGGGTAAAAC GCTGGAAGCC
GGTATTCTGG TTTCGGAACT GATCCGTCGT GGGCGCGGAA AACGTATTCT GGTTCTGGCG
GTTAAATCGA TGCTGACACA ATTCCAGAAG GAGTTCTGGA GCCGTTTTGC GATCCCATTG
ACCCGCCTGG ATTCTGCTGG TTTACAAAAG GTGCGTAACC GCATCCCAAC CAACCACAAC
CCGTTCCACT ATTTTGACAA GACCATTATC TCCATCGATA CGCTGAAACA AGACATTGAA
TATCGTCACC ATCTGGAAAA TGCCTGGTGG GATATTATTG TCATTGATGA GGCCCATAAC
GTCGCAGAAC GTGGAACCAG TTCGTTACGC AGTAAACTGG CAAAACTGCT CGCCGGGCGC
AGCGATACGC TGATTATGCT TTCAGCAACA CCGCATGATG GTAAGGCTGA AAGTTTTGCC
AGCCTGATGA ACATGCTTGA CCCGACGGCG ATCGCCAACC CGAAAGAGTA TGAATACGCC
GATTTTGCCG ATAAAAATCT GGTTGTCCGT CGCTTCAAAA AAGACGTGAA AGATCAGATG
TCCGGCGAGT TCCCCGAGCG CAATATTGTC AAATTGACTC GCCTGGCATC CGGTGCCGAA
GAAGAAGCAT ACCGTCGTCT GGTTGAGAGC CAGTTCCGTG ATGATGACGA TGAACAGGCC
CAGTCTAACA AAGGTCGTCT GTTCAAGATA ACCCTCGAAA AAGCGCTATT TTCCAGCCCG
ATGGCGTGTG CCAGCGTCGT GGCGAATCGT CTGAAGCGTC TTGAGAGCCG TAAAGATCAT
AATAGCCAGA GCCAGATCAA CGAACTGGAG TCATTGCTGT TGGCGCTGAA TAACATTGAT
GCCAGCCAGT TTAGCAAATA CCAGTTGCTG CTCGACACCA TCCGAAAAGA CCTCGCCTGG
AAAGCCAATA ACACGGAAGA TCGCCTGGTG ATCTTCACCG AAAGTATTAA AACGCTGGAG
TTCCTCGAAC AGCAACTGCG AGCGGATCTG AAATTGAAAG ATGACCAGAT CGCCACGCTG
CGCGGCGATC AGGGCGATAC GGTATTGATG GAAACCGTAG AAGCCTTTGG TAAAACGCAG
TCGCCGTTGC GTCTTCTGGT TTGTTCGGAT GTGGCATCTG AAGGGATCAA CCTGCATCAC
CTTAGCCACA AAATGATTCA CTTTGATATT CCGTGGTCGT TGATGGTATT CCAGCAACGT
AACGGGCGTA TTGACCGCTA TGGGCAAAAA CATCAGCCCC AAATCCGCTA TCTGCTTACC
GAGGCCAGCG AACCACAGAT CAATGGCGAT ATGCGCGTAC TGGAAGTGTT GATCAACAAA
GACGAACAGG CGCAGAAGAA CATTGGGGAC AGTTCGGAGT TTACCGGCAA ATTTACCCAG
GAAGAGGAAG AAGAGCAGGT TGCAGAGTTC ATGATGCAGG ATGATGGTGC CAGCCTGTTT
GATCAACTGC TGAACAGCAA CGTCTCGGAA AGCGCCGAAC ACGATCTGTT TGGCGAAATA
TGCAGTGCGG TTTCCAGCGA TGCCTCAATG GTCACCGAAA CAGACACCAG CTTGTTTGCC
AGCGAACAGG CGTACTGCGA AAGAGCATTG GGTTACCTGA AAGCCAGTGG TCAAACTATC
CAGTATGAAA CCTTGCCTGA TAATACCTTG TCGCTGGTGG CACCGGAGGA GTTACGCCGC
CGCTTCAACC AGCTACCGCC TGAAATTGCC CCGGAGAACT GGCAGCTCTA TTTAAGTCAG
GATAAAACCG TCATCACCGA CGCGATTGCG CGCGCTCGCG GTGAGCAACA TGCCTGGCCC
GATGTGCAGT ATCTCTGGCA AATCAACCCG GTAGTGCAGT GGCTGGACGA TAAAATCTCT
TCGGCTTTTG GTCGTCATCA GGCTCCGGTT ATCCGGCTGC CATACCTGCT TGAACCCGAT
GAAGATCACT TCATTCTTTC CGGGTTATTC CCGAACCGTA AATCACATCC GATGGTGAAC
CCGTGGATAG TGGTGAGCTT TAACCGTGAA TCGCTAATCG GCAGTCAGCC TTTCGCTGAG
TTTTTACAAC GTCATCCGCA GTTGAGTAAC AAGCTGACTA ACAGCGGCGG TAAAGATCGT
AATCACCAAC GCCAGCAGGA TTTACTGGAA GCGGCTATTG CTCATGCCCG GGAGGTATTC
ATCCATGATC GGAATGCGTT TGAAACGCAC ATTAACCAGC AACTGAATGA GCATCTGCAA
AAGCTGGACG TTTTGCGTGG GCGGCAGTTG AGCCAACTTG AGCTGGATTT TGCCGATAAC
AAACAGCAAT TGTCAGTCAA GCAGAGCCGT AAAGAGCAGA GACAACGCGA AATCGAACAC
AATTTTGACA GCTATATCGA ATGGATTGAG GACACCATGA CGACTGAAAA AGAACCCTAC
ATTCAGGTAA TTGCTGTTAT CACCGGAGCG GAGGGTTAA
 
Protein sequence
MNKQNYAPGM RVVIRDAEWR IRRADDSGDG GYLLTCDGIS ELVRGKEGLF LTKLEQKVEI 
LDPAKTHLVE DESANYQAAQ LYIESQLRQR VPTDSKVHFG HLAAMDSMPF QLDPTRMALA
QPRQRILIAD AVGLGKTLEA GILVSELIRR GRGKRILVLA VKSMLTQFQK EFWSRFAIPL
TRLDSAGLQK VRNRIPTNHN PFHYFDKTII SIDTLKQDIE YRHHLENAWW DIIVIDEAHN
VAERGTSSLR SKLAKLLAGR SDTLIMLSAT PHDGKAESFA SLMNMLDPTA IANPKEYEYA
DFADKNLVVR RFKKDVKDQM SGEFPERNIV KLTRLASGAE EEAYRRLVES QFRDDDDEQA
QSNKGRLFKI TLEKALFSSP MACASVVANR LKRLESRKDH NSQSQINELE SLLLALNNID
ASQFSKYQLL LDTIRKDLAW KANNTEDRLV IFTESIKTLE FLEQQLRADL KLKDDQIATL
RGDQGDTVLM ETVEAFGKTQ SPLRLLVCSD VASEGINLHH LSHKMIHFDI PWSLMVFQQR
NGRIDRYGQK HQPQIRYLLT EASEPQINGD MRVLEVLINK DEQAQKNIGD SSEFTGKFTQ
EEEEEQVAEF MMQDDGASLF DQLLNSNVSE SAEHDLFGEI CSAVSSDASM VTETDTSLFA
SEQAYCERAL GYLKASGQTI QYETLPDNTL SLVAPEELRR RFNQLPPEIA PENWQLYLSQ
DKTVITDAIA RARGEQHAWP DVQYLWQINP VVQWLDDKIS SAFGRHQAPV IRLPYLLEPD
EDHFILSGLF PNRKSHPMVN PWIVVSFNRE SLIGSQPFAE FLQRHPQLSN KLTNSGGKDR
NHQRQQDLLE AAIAHAREVF IHDRNAFETH INQQLNEHLQ KLDVLRGRQL SQLELDFADN
KQQLSVKQSR KEQRQREIEH NFDSYIEWIE DTMTTEKEPY IQVIAVITGA EG