Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2104 |
Symbol | |
ID | 6967012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2003350 |
End bp | 2004564 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386003 |
Product | putative lipoprotein |
Protein accession | YP_002270492 |
Protein GI | 209396682 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.50348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACAC CACCAGCGGG TTCAAAGCCA CCAGCCACGA CGCAACAATC GTCACAACCG ATGCGTGGCA TCTGGCTGGC CACGGTGTCT CGCCTCGACT GGCCACCGGT TTCCTCGGTT AACATTAGTA ACCCCACCAG CCGGGCCCGT GTACAACAAC AGGCGATGAT CGACAAACTG GATCATTTGC AACGTCTCGG TATAAACACG GTCTTTTTCC AGGTCAAGCC GGACGGTACC GCCCTGTGGC CATCGAAAAT TTTGCCGTGG TCCGATCTTA TGACCGGTAA GATTGGTGAA AATCCGGGTT ACGATCCGCT GCAATTTATG CTCGACGAAG CCCACAAGCG TGGGATGAAA GTACACGCCT GGTTTAACCC CTATCGCGTA TCGGTTAATA CGAAGCCCGG TACTATCAGG GAACTGAATA GCACTCTGTC TCAACAACCG GCGAGCGTCT ATGTGCAACA CCGTGACTGG ATCAGAACGT CTGGCGATCG CTTTGTCCTC GACCCGGGCA TCCCTGAGGT TCAGGACTGG ATCACATCAA TAGTCGCAGA AGTGGTTTCC CGCTATCCGG TAGATGGCGT GCAGTTTGAC GACTATTTCT ATACTGAGTC ACCGGGTTCA CGGCTAAATG ATAACGAAAC GTACCGTAAA TACGGAGGCG CATTTGCGTC AAAAGCAGAC TGGCGGCGCA ACAATACTCA GCAGTTAATT GCAAAGGTAT CGCACACCAT TAAAAGCATT AAGCCGGGAG TCGAATTTGG TGTTAGCCCG GCAGGCGTGT GGCGTAACCG ATCACACGAT CCGCTCGGTT CCGATACCCG AGGCGCGGCA GCCTATGACG AATCCTACGC TGACACTCGT CGATGGGTGG AACAAGGATT GCTGGATTAC ATTGCTCCCC AAATTTACTG GCCGTTCTCA CGGAGTGCCG CGCGTTATGA CGTGTTGGCA AAATGGTGGG CGGATGTCGT TAAACCGACC AGGACCCGCC TGTATATCGG TATCGCCTTC TATAAAGTGG GTGAACCTTC AAAGATAGAG CCAGACTGGA TGATTAACGG CGGCGTACCG GAGCTGAAAA AGCAGCTCGA TCTTAACGAT GCGGTACCAG AAATTAGCGG CACCATCTTG TTCCGTGAGG ACTATCTGAA TAAACCGCAG ACTCAACAAG CGGTCAGCTA TCTGCAAAGT CGTTGGGGCA GTTAA
|
Protein sequence | MVTPPAGSKP PATTQQSSQP MRGIWLATVS RLDWPPVSSV NISNPTSRAR VQQQAMIDKL DHLQRLGINT VFFQVKPDGT ALWPSKILPW SDLMTGKIGE NPGYDPLQFM LDEAHKRGMK VHAWFNPYRV SVNTKPGTIR ELNSTLSQQP ASVYVQHRDW IRTSGDRFVL DPGIPEVQDW ITSIVAEVVS RYPVDGVQFD DYFYTESPGS RLNDNETYRK YGGAFASKAD WRRNNTQQLI AKVSHTIKSI KPGVEFGVSP AGVWRNRSHD PLGSDTRGAA AYDESYADTR RWVEQGLLDY IAPQIYWPFS RSAARYDVLA KWWADVVKPT RTRLYIGIAF YKVGEPSKIE PDWMINGGVP ELKKQLDLND AVPEISGTIL FREDYLNKPQ TQQAVSYLQS RWGS
|
| |