Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4059 |
Symbol | |
ID | 6970975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3753105 |
End bp | 3754469 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387818 |
Product | hypothetical protein |
Protein accession | YP_002272261 |
Protein GI | 209399193 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0267999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTACAC ATATTAGCCC GCTTGGCTCC ATGGATATGT TGTCGCAGCT GGAAGTGGAT ATGCTTAAAC GCACCGCCAG CAGCGACCTC TATCAACTGT TTCGCAACTG TTCACTTGCC GTACTGAACT CCGGTAGTTT GACCGATAAC AGCAAAGAAT TGCTGTCTCG TTTTGAAAAT TTCGATATTA ACGTCTTGCG CCGTGAACGC GGCGTAAAGC TGGAACTGAT TAATCCCCCG GAAGAGGCTT TTGTCGATGG GCGAATTATT CGCGCTTTGC AGGCCAACTT GTTCGCGGTT CTGCGAGACA TTCTCTTCGT TTACGGGCAA ATCCATAATA CCGTTCGTTT TCCCAACCTG AATCTCGACA ACTCCGTCCA CATCACTAAC CTGGTCTTTT CCATCTTGCG TAACGCTCGC GCGCTGCATG TGGGTGAAGC GCCAAATATG GTGGTCTGCT GGGGCGGTCA CTCAATTAAC GAAAATGAGT ATTTGTATGC CCGTCGCGTC GGAAACCAGC TGGGCCTGCG TGAGCTGAAT ATCTGCACCG GCTGTGGTCC GGGAGCGATG GAAGCGCCGA TGAAAGGTGC TGCGGTCGGA CACGCGCAGC AGCGTTACAA AGACAGTCGT TTTATTGGTA TGACAGAGCC GTCGATTATC GCCGCTGAAC CGCCTAACCC GCTGGTCAAC GAATTGATCA TCATGCCGGA TATCGAAAAA CGTCTGGAAG CGTTTGTCCG TATCGCTCAC GGCATCATTA TCTTCCCTGG CGGTGTGGGT ACGGCAGAAG AGTTGCTGTA TTTGCTGGGA ATTTTAATGA ACCCGGCCAA CAAAGATCAG GTTTTACCAT TGATCCTCAC CGGCCCGAAA GAGAGCGCCG ACTACTTCCG CGTACTGGAC GAGTTTGTCG TACATACGCT GGGCGAAAAC GCGCGCCGCC ATTACCGCAT AATCATTGAT GACGCCGCTG AAGTCGCCCG TCAGATGAAA AAATCGATGC CGCTGGTGAA AGAAAATCGC CGTGATACAG GCGATGCCTA CAGCTTTAAC TGGTCAATGC GCATTGCGCC AGATTTGCAA ATGCCATTTG AGCCGTCTCA CGAGAATATG GCTAATCTGA AGCTTTACCC GGATCAACCT GTTGAAGTGC TGGCTGCCGA TCTGCGCCGT GCGTTCTCCG GTATTGTGGC GGGTAACGTA AAAGAAGTCG GTATTCGCGC CATTGAAGAG TTTGGTCCTT ACAAAATCAA CGGCGATAAA GAGATTATGC GTCGTATGGA CGACCTGCTA CAGGGTTTTG TTGCCCAGCA TCGTATGAAG TTGCCAGGCT CAGCCTACAT CCCTTGCTAC GAAATCTGCA CGTAA
|
Protein sequence | MITHISPLGS MDMLSQLEVD MLKRTASSDL YQLFRNCSLA VLNSGSLTDN SKELLSRFEN FDINVLRRER GVKLELINPP EEAFVDGRII RALQANLFAV LRDILFVYGQ IHNTVRFPNL NLDNSVHITN LVFSILRNAR ALHVGEAPNM VVCWGGHSIN ENEYLYARRV GNQLGLRELN ICTGCGPGAM EAPMKGAAVG HAQQRYKDSR FIGMTEPSII AAEPPNPLVN ELIIMPDIEK RLEAFVRIAH GIIIFPGGVG TAEELLYLLG ILMNPANKDQ VLPLILTGPK ESADYFRVLD EFVVHTLGEN ARRHYRIIID DAAEVARQMK KSMPLVKENR RDTGDAYSFN WSMRIAPDLQ MPFEPSHENM ANLKLYPDQP VEVLAADLRR AFSGIVAGNV KEVGIRAIEE FGPYKINGDK EIMRRMDDLL QGFVAQHRMK LPGSAYIPCY EICT
|
| |