Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3008 |
Symbol | |
ID | 6968785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2789714 |
End bp | 2791654 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386846 |
Product | hypothetical protein |
Protein accession | YP_002271314 |
Protein GI | 209397213 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.351476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCA CTTTATATAC TGCTACTGGT GAGTGCGTTA CGCCAGGCCG TGAACTGGGC AAAGGTGGCG AAGGCGCGGT TTATGATATC AATGAGTTTG TCGATAGCGT CGCCAAGATT TATCACACGC CGCCACCCGC CTTAAAACAG GACAAACTTG CCTTTATGGC TGCGACAGCT GACGCGCAGT TGTTGAATTA TGTCGCCTGG CCGCAGGCAA CGCTTCACGG TGGACGAGGC GGAAAAGTTA TCGGTTTTAT GATGCCAAAA GTTTCTGGTA AAGAACCGAT TCATATGATC TATAGCCCGG CACATCGTCG CCAGAGTTAT CCTCATTGTG CGTGGGATTT TCTACTCTAT GTTGCGCGCA ATATTGCTTC ATCTTTTGCT ACGGTTCACG AGCACGGGCA CGTCGTGGGG GACGTAAACC AGAACAGCTT TATGGTAGGT CGCGACAGCA AAGTGGTGTT GATCGATAGT GACTCCTTTC AGATTAACGC CAATGGCACA CTGCATTTAT GCGAAGTCGG CGTGTCGCAT TTTACGCCGC CAGAGCTACA AACCTTGCCA TCATTTGTCG GTTTTGAACG TACCGCGAAT CACGATAATT TTGGCCTTGC GTTGCTGATT TTTCACGTCT TGTTTGGTGG GCGGCATCCT TATTCTGGTG TGCCGCTTAT CTCTGATGCG GGTAATGCGC TGGAGACGGA TATTGCCCAT TTCCGTTATG CCTACGCGTC AGACAACCAG CGACGTGGTT TAAAACCGCC GCCACGATCG ATTCCGCTGT CGATGTTACC GGGCGATGTT GAAGCCATGT TTCAGCAGGC ATTCACGGAA AGTGGCGTGG CAACCGCGCG TCCGACGGCA AAAGCGTGGG TAGCGGCACT GGATTTACTA CGCCAACAGT TAAAGAAATG TACCGTTTCG GCAATGCATG TTTACCCCGG TCATTTGACT GACTGCCCGT GGTGTGCGCT GGATAATCAA GGCGTTATCT ATTTTATTGA TCTCGGCGAA GAGGTCATTA CCACCAGTGG TGATTTTGTG CTGGCGAAAG TCTGGGCGAT GGTGATGGCG TCAGTAGCAC CGCCAGCATT GCAATTGCCA TTACCCGATC ATTTCCAACC GACTGGCAGG CCGCTTCCTT TAGGCCTGTT ACGGCGCGAA TACATCATTC TGATTGAGAT CGCACTGTCA GCGTTATCGC TGTTGCTTTG CGGCCTTCAG GCAGAACCGC GTTATATTAT TTTGGTTCCT GTGCTGGCGG CTATCTGGAT TATTGGCAGT CTGACAAGCA AAGTTTATAA AGCAGAAATC CAGCAACGCC GTGAGGCATT TAATCGCGCG AAAATGGACT ATGACCATTT AGTCAGCCAG ATCCAACAGT TGGGCGGGCT GGAAGGTTTT ATCGCCAAAC GGGCGATGCT CGAAAAAATG AAGGACGAAA TTCTCGGGTT ACCGGAAGAG GAGAAACGCG CTCTGGCAGC ACTTCATGAC ACTGCAAGGG AACGGCAGAA GCAGAAGTTT CTGGAGGGAT TTTTTATTGA TGCTGCCTCT ATTCCCGGCG TTGGCCCTGC GCGTAAAGCG GCGTTACGGT CTTTTGGTAT TGAAACAGCC GCGGATGTTA CCCGTCGTGG GGTTAAGCAA GTTAAAGGGT TTGGTGATCA TCTGACCCAG GCGGTTATCG ACTGGAAAGC GAGCTGTGAA CGCCGTTTTG TGTTCAGGCC GAACGAAGCG GTAACGCCTG CAGAAAGACA AGCGGTAATG GCGAAAATGG CCGCCAAACG ACATCGGCTG GAATCGGCGT TGACTGTCGG CGCGACAGAG TTGCAGCGAT TCCGCCTTCA TGCTCCAGCA CGGACCATGC CGTTGATGGA ACCGTTACGT CAGGCGGCAG AAAAACTGGC TCAGGCGCAG GCAGATTTAA GCCGCTGCTG A
|
Protein sequence | MKPTLYTATG ECVTPGRELG KGGEGAVYDI NEFVDSVAKI YHTPPPALKQ DKLAFMAATA DAQLLNYVAW PQATLHGGRG GKVIGFMMPK VSGKEPIHMI YSPAHRRQSY PHCAWDFLLY VARNIASSFA TVHEHGHVVG DVNQNSFMVG RDSKVVLIDS DSFQINANGT LHLCEVGVSH FTPPELQTLP SFVGFERTAN HDNFGLALLI FHVLFGGRHP YSGVPLISDA GNALETDIAH FRYAYASDNQ RRGLKPPPRS IPLSMLPGDV EAMFQQAFTE SGVATARPTA KAWVAALDLL RQQLKKCTVS AMHVYPGHLT DCPWCALDNQ GVIYFIDLGE EVITTSGDFV LAKVWAMVMA SVAPPALQLP LPDHFQPTGR PLPLGLLRRE YIILIEIALS ALSLLLCGLQ AEPRYIILVP VLAAIWIIGS LTSKVYKAEI QQRREAFNRA KMDYDHLVSQ IQQLGGLEGF IAKRAMLEKM KDEILGLPEE EKRALAALHD TARERQKQKF LEGFFIDAAS IPGVGPARKA ALRSFGIETA ADVTRRGVKQ VKGFGDHLTQ AVIDWKASCE RRFVFRPNEA VTPAERQAVM AKMAAKRHRL ESALTVGATE LQRFRLHAPA RTMPLMEPLR QAAEKLAQAQ ADLSRC
|
| |