Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5519 |
Symbol | |
ID | 6967208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5168239 |
End bp | 5169819 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643389162 |
Product | hypothetical protein |
Protein accession | YP_002273559 |
Protein GI | 209398240 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0795524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGGTC ATATCTCAAA GTTTGACGGC AATAACTCTT TGATAAAACA TGGTGTGGTG CAAGGAAATA ATATAGTAGA TTTTGATTTA CTACGTAATT TTAATGGGGG GCCAGGGTTA AATCGAGAAA ACTTTATTTA TATCAGCAAT ATTTTTTTAA ATATAAAACA ACGGAACGAA AAAAATCATT CAATAAATAT GTTTCGTGAA GTCTCAATCA GTGGTGATAT TGTAAGCGTA AAATTTTATA GAAATGAAAA AATAGAATGC GCTTGTGATT TTATGATGGC TAAAGATGCG CAGGGGTATA TCGACCTGTC TGAATTGGAT TTAACAAGTT GTCATTTTAA AGGTGACGTT ATTTCGAAGG TGTCTTTCAT ATCATCAAAT CTACAACATG TAACATTCGA ATGTAAAGAA ATTGGGGATT GCAATTTTAC TACTGCAATA GTTGATAATG TCATATTTAA ATGTCGACGT TTACACAATG TAATTTTTAT CAAAGCGAGT GGTGATTATG TCGATTTTAG CAAAAATATT CTTGATACAG TTGACTTCTC GCAGAGTCAA CTTACTCATA GTAATTTTTG TGAATGTCAG ATTAGAAATT CAAACTTCGA TCATTGTTAT CTTTATGCTT CGCACTTCAC CAGAGCAGAA TTTCTTACTG ACAAAGAAAT ATCATTTATT AAATCGAATT TAACAGCTGT TATGTTTGAT CATGTGCGAA TATCGACAGG GAATTTTAAA GACAGCGTTA CACAACTAAT GGTATTATCT ATTGATTACT CAGATATATT TGGAAATGAA TATCTCGATG GTTATATCAA TAACATTATA AAAATGATTG ATTCGTTGCC AGATGATCCA GCGATATTGA AATCCGTTCT GGCAGTAAAA CTGGTGATGC AATTAAAAAT TCTTAATATT GTTAATAAAA ACTTTATTGA GAATATGAAG AAAATATTTA GCCATGGTCC TTATATAAAA GATCCCATTA TACGTAGTTA TATCCATCCT GATGAAGATA ACAAGTTCGA TAATTTTATG CGTCAAAATC GATTCAGTAA GGTGAATTTC GATACCCAAC AGATGATCGA TTTTATTAAC AGATTTAATA TGAATAAATG GCTGATTGAT CGAAATAACA ATTTTTTTAT CCAACTTATC GATCAGGCTC TACGATCAAC GAATGATACG ATCAAAGAAA ATGCCTGGCA TCTTTATAAA GAGTGGATTC GTAGTGATGA TGTTTCACCT TTATTTATAG AAATTGAAGA TAATTTAAGA ACCTTTAACA CGAATGAATT AACACGAAAC GATAATATCT TTATCTTGTT CTCCTCTGTC GATGATGGGC CAGTTATGGT GGTAAGCTCC CAGCGCTTAC ATGATATGTT GAATCCTACA AAAGATACCA ATTGGAATTC CACGTATATC TATAAATCCA GACATGAGAT GTTGCCTGTT AATCTTACTC CGGAAACACT TTTCGGCTCC AAATCTTATG ATAAACATGC GCTTTTCCCC ATTTTTACTG CGAGTTGGCG AGCTAATCGT ATAAAGAATA AAGGTATTTA A
|
Protein sequence | MLGHISKFDG NNSLIKHGVV QGNNIVDFDL LRNFNGGPGL NRENFIYISN IFLNIKQRNE KNHSINMFRE VSISGDIVSV KFYRNEKIEC ACDFMMAKDA QGYIDLSELD LTSCHFKGDV ISKVSFISSN LQHVTFECKE IGDCNFTTAI VDNVIFKCRR LHNVIFIKAS GDYVDFSKNI LDTVDFSQSQ LTHSNFCECQ IRNSNFDHCY LYASHFTRAE FLTDKEISFI KSNLTAVMFD HVRISTGNFK DSVTQLMVLS IDYSDIFGNE YLDGYINNII KMIDSLPDDP AILKSVLAVK LVMQLKILNI VNKNFIENMK KIFSHGPYIK DPIIRSYIHP DEDNKFDNFM RQNRFSKVNF DTQQMIDFIN RFNMNKWLID RNNNFFIQLI DQALRSTNDT IKENAWHLYK EWIRSDDVSP LFIEIEDNLR TFNTNELTRN DNIFILFSSV DDGPVMVVSS QRLHDMLNPT KDTNWNSTYI YKSRHEMLPV NLTPETLFGS KSYDKHALFP IFTASWRANR IKNKGI
|
| |