Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4904 |
Symbol | |
ID | 6969223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4543465 |
End bp | 4545144 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388590 |
Product | hypothetical protein |
Protein accession | YP_002273018 |
Protein GI | 209398532 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03368] cellulose synthase operon protein YhjU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC GGGATATCTT AACTTCCATC CGCTCCTCAA TTTGGTGTTT GCCGCGTTTC TGCTGATGCC CATTCCGCGC TACAGCCTGC ATCGCTTGCG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT TTTGTGGTTG CCATACTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC TTGTGGCCAG CCGGACAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT AACATCTGTT CGCTTTCCTG GTCGGATATA GAAGCCGCTG GGTTGATGTC GCATCCACTG TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCTA CAGTGGCCCG GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA CCGGCAAATA ACGACTGCTA TCTGTTTGAT AACCTGTCGA AACTGGGCTT TACCCAGCAC CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGC GGCATGCAGA CTGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT GGTTCGCCGG TTTATGACGA TACCGCCGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT TATCCGGGCG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA CTGGACGCCT TCTTTACTGA ACTGGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG CCGGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC CCTAGCCCGT CTATCACCGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCA CATCAGGGGG CACCGATTGT CATCGACCAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG GTGGTTCGCG TTCTTGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCAGAGAACT CAAATGCAGT AGTTATTCAA TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
|
Protein sequence | MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPIPR YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSYSGP AAIRLLRASC GQTSHTNLYQ PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQTELMDQT NLPVILLGFD GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP HQGAPIVIDQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ YQDKPYVRLN GGDWVPYPQ
|
| |