Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1966 |
Symbol | |
ID | 6967381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1858946 |
End bp | 1860343 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385892 |
Product | hypothetical protein |
Protein accession | YP_002270381 |
Protein GI | 209397259 |
COG category | [R] General function prediction only |
COG ID | [COG3106] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.575033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC TTAAAAATGA ACTTAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG CGCCTCGCCG TAACCGGACT TAGCCGCAGC GGCAAAACGG CGTTTATCAC TGCGATGGTC AATCAGTTGC TCAATATTCA TGCCGGAGCA CGTTTGCCGC TATTAAGTGC GGTGCGTGAA GAGCGTCTGC TGGGCGTAAA ACGCATTCCT CAGCGTGACT TTGGCATTCC GCGTTTTACC TACGACGAAG GGTTGGCGCA GCTGTATGGC GATCCTCCCG CCTGGCCGAC GCCAACGCGC GGCGTCAGCG AAATTCGCCT GGCACTACGC TATAAATCGA ACGATTCGCT GCTGCGCCAC TTTAAGGATA CCTCCACGCT GTATCTGGAG ATTGTGGATT ACCCTGGCGA ATGGTTGCTC GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCGC GCCAGATGAC GGGCTTACTC AATGGTCAGC GCGGCGAATG GTCGGCGAAA TGGCGAATGA TGTGCGAAGG GCTGGACCCG CTAGCGCCTG CCGACGAAAA CCGGCTGGCA GACATTGCCG CCGCGTGGAC CGATTATCTC CACCACTGTA AACAGCAGGG GCTGCACTTT ATTCAGCCTG GGCGCTTTGT CTTGCCGGGG GATATGGCAG GTGCGCCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TGCCTGGGGC GAGTCCAAAC TGGCGCAGGC CGATAAGCAC ACCAATGCCG GAATGCTGCG CGAGCGGTTT AATTATTACT GCGAGAAGGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC CGCCAGATTG TGCTGGTGGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT GATATGCGTC TGGCGCTGAC GCAGCTGATG CAAAGTTTCC ACTATGGGCA GCGTACCCTG TTCAGGCGTT TGTTTTCGCC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCGGAC CATGTGACCA TCGATCAGCA CGCCAATATG GTTTCATTGC TGCAACAACT AATTCAGGAT GCCTGGCAAA ATGCGGCGTT CGAAGGGATC AGCATGGATT GCCTGGGGCT GGCGTCAGTT CAGGCGACCA CCAGTGGCAT TATTGATGTT AACGGTGAGA AAATCCCGGC GCTGCGCGGT AATCGACTTA GCGATGGCGC ACCGCTCACT GTTTATCCTG GCGAAGTTCC CGCACGTTTG CCTGGTCAGG CGTTCTGGGA TAAGCAAGGG TTCCAGTTTG AAGCGTTTCG CCCGCAGGTG ATGGATGTCG ACAAACCGCT GCCGCATATT CGTCTTGATG CCGCGCTGGA ATTTTTAATA GGAGATAAAT TGCGATGA
|
Protein sequence | MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR YKSNDSLLRH FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSAK WRMMCEGLDP LAPADENRLA DIAAAWTDYL HHCKQQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDAWG ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR
|
| |