Gene ECH74115_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3008 
Symbol 
ID6968785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2789714 
End bp2791654 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content52% 
IMG OID643386846 
Producthypothetical protein 
Protein accessionYP_002271314 
Protein GI209397213 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.351476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA CTTTATATAC TGCTACTGGT GAGTGCGTTA CGCCAGGCCG TGAACTGGGC 
AAAGGTGGCG AAGGCGCGGT TTATGATATC AATGAGTTTG TCGATAGCGT CGCCAAGATT
TATCACACGC CGCCACCCGC CTTAAAACAG GACAAACTTG CCTTTATGGC TGCGACAGCT
GACGCGCAGT TGTTGAATTA TGTCGCCTGG CCGCAGGCAA CGCTTCACGG TGGACGAGGC
GGAAAAGTTA TCGGTTTTAT GATGCCAAAA GTTTCTGGTA AAGAACCGAT TCATATGATC
TATAGCCCGG CACATCGTCG CCAGAGTTAT CCTCATTGTG CGTGGGATTT TCTACTCTAT
GTTGCGCGCA ATATTGCTTC ATCTTTTGCT ACGGTTCACG AGCACGGGCA CGTCGTGGGG
GACGTAAACC AGAACAGCTT TATGGTAGGT CGCGACAGCA AAGTGGTGTT GATCGATAGT
GACTCCTTTC AGATTAACGC CAATGGCACA CTGCATTTAT GCGAAGTCGG CGTGTCGCAT
TTTACGCCGC CAGAGCTACA AACCTTGCCA TCATTTGTCG GTTTTGAACG TACCGCGAAT
CACGATAATT TTGGCCTTGC GTTGCTGATT TTTCACGTCT TGTTTGGTGG GCGGCATCCT
TATTCTGGTG TGCCGCTTAT CTCTGATGCG GGTAATGCGC TGGAGACGGA TATTGCCCAT
TTCCGTTATG CCTACGCGTC AGACAACCAG CGACGTGGTT TAAAACCGCC GCCACGATCG
ATTCCGCTGT CGATGTTACC GGGCGATGTT GAAGCCATGT TTCAGCAGGC ATTCACGGAA
AGTGGCGTGG CAACCGCGCG TCCGACGGCA AAAGCGTGGG TAGCGGCACT GGATTTACTA
CGCCAACAGT TAAAGAAATG TACCGTTTCG GCAATGCATG TTTACCCCGG TCATTTGACT
GACTGCCCGT GGTGTGCGCT GGATAATCAA GGCGTTATCT ATTTTATTGA TCTCGGCGAA
GAGGTCATTA CCACCAGTGG TGATTTTGTG CTGGCGAAAG TCTGGGCGAT GGTGATGGCG
TCAGTAGCAC CGCCAGCATT GCAATTGCCA TTACCCGATC ATTTCCAACC GACTGGCAGG
CCGCTTCCTT TAGGCCTGTT ACGGCGCGAA TACATCATTC TGATTGAGAT CGCACTGTCA
GCGTTATCGC TGTTGCTTTG CGGCCTTCAG GCAGAACCGC GTTATATTAT TTTGGTTCCT
GTGCTGGCGG CTATCTGGAT TATTGGCAGT CTGACAAGCA AAGTTTATAA AGCAGAAATC
CAGCAACGCC GTGAGGCATT TAATCGCGCG AAAATGGACT ATGACCATTT AGTCAGCCAG
ATCCAACAGT TGGGCGGGCT GGAAGGTTTT ATCGCCAAAC GGGCGATGCT CGAAAAAATG
AAGGACGAAA TTCTCGGGTT ACCGGAAGAG GAGAAACGCG CTCTGGCAGC ACTTCATGAC
ACTGCAAGGG AACGGCAGAA GCAGAAGTTT CTGGAGGGAT TTTTTATTGA TGCTGCCTCT
ATTCCCGGCG TTGGCCCTGC GCGTAAAGCG GCGTTACGGT CTTTTGGTAT TGAAACAGCC
GCGGATGTTA CCCGTCGTGG GGTTAAGCAA GTTAAAGGGT TTGGTGATCA TCTGACCCAG
GCGGTTATCG ACTGGAAAGC GAGCTGTGAA CGCCGTTTTG TGTTCAGGCC GAACGAAGCG
GTAACGCCTG CAGAAAGACA AGCGGTAATG GCGAAAATGG CCGCCAAACG ACATCGGCTG
GAATCGGCGT TGACTGTCGG CGCGACAGAG TTGCAGCGAT TCCGCCTTCA TGCTCCAGCA
CGGACCATGC CGTTGATGGA ACCGTTACGT CAGGCGGCAG AAAAACTGGC TCAGGCGCAG
GCAGATTTAA GCCGCTGCTG A
 
Protein sequence
MKPTLYTATG ECVTPGRELG KGGEGAVYDI NEFVDSVAKI YHTPPPALKQ DKLAFMAATA 
DAQLLNYVAW PQATLHGGRG GKVIGFMMPK VSGKEPIHMI YSPAHRRQSY PHCAWDFLLY
VARNIASSFA TVHEHGHVVG DVNQNSFMVG RDSKVVLIDS DSFQINANGT LHLCEVGVSH
FTPPELQTLP SFVGFERTAN HDNFGLALLI FHVLFGGRHP YSGVPLISDA GNALETDIAH
FRYAYASDNQ RRGLKPPPRS IPLSMLPGDV EAMFQQAFTE SGVATARPTA KAWVAALDLL
RQQLKKCTVS AMHVYPGHLT DCPWCALDNQ GVIYFIDLGE EVITTSGDFV LAKVWAMVMA
SVAPPALQLP LPDHFQPTGR PLPLGLLRRE YIILIEIALS ALSLLLCGLQ AEPRYIILVP
VLAAIWIIGS LTSKVYKAEI QQRREAFNRA KMDYDHLVSQ IQQLGGLEGF IAKRAMLEKM
KDEILGLPEE EKRALAALHD TARERQKQKF LEGFFIDAAS IPGVGPARKA ALRSFGIETA
ADVTRRGVKQ VKGFGDHLTQ AVIDWKASCE RRFVFRPNEA VTPAERQAVM AKMAAKRHRL
ESALTVGATE LQRFRLHAPA RTMPLMEPLR QAAEKLAQAQ ADLSRC