Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0233 |
Symbol | |
ID | 6968407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 243962 |
End bp | 245371 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384305 |
Product | ImpA domain protein |
Protein accession | YP_002268821 |
Protein GI | 209395773 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.845449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGTA ACGTACTGAC ACAAACTATC GTTACCGGCA GTGACCCGCG CGGGCTGCCG GAATTCAGCG CCATCCGCGA GGAAATAAAC AAAGCCAGCC ACCCGTCACA GCCTGAGCTG AACTGGAAAC TGGTGGAGTC GCTGGCGCTG GCGATTTTTA AAGCCAACGG TGTGGATTTA CACACCGCCA CCTACTATAC GCTTGCCCGG ACACGGACAC AGGGACTGGC GGGATTCTGC GAAGGTGCGG AACTGCTGGC GGCAATGGTA AGCCACGACT GGGATAAGTT CTGGCCGCAG GGCGGCCCGG CGCGTACTGA AATGCTGGAC TGGTTTAACT CCCGAACCGG CAATATTCTG CGTCAGCAAA TCTCCTTTGC GGAATCCGAC CTACCACTGA TCTACCGCAC AGAGCGGGCA TTGCAGCTTA TCTGCGATAA GCTCCAGCAG GTGGAACTGA AGCGCGTTCC GCGCGTGGAG AATCTGCTCT ATTTTATGCA GAACACGCGT AAACGGCTTG AACCCCAGCT GAAGAGTAAC ACTGAGAACG CCGCACAGAC CACGGTCAGA ACGCTGATTT ATGCCCCGGA AACACAGGCA TCTTCCACAC CAGAAGCGGT AGTGCCTCCC CTGCCCGGCC TGCCTGAGAT GAAAGTGGAA GTGCGCAGTC TGACAGAGAA TCCCCCACAG GCCAGTGTGA TAAAGCAAGG CAGTACGGTA AGAGGGTTTA TCGCAGGGAT CGCCTGTTCA GTGGCTGTCG CCTCAGCATT GTGGTGGTGG CAGGTCTATC CGGTGCAGCA GCAACTGTTA CAGGTTAACG ACACCGCTCA GGGCGCAGCA ACGGTGTGGA TGGCCTCACC TGAACTCGAA AACTATGAGC GCAGGCTGCA ACAACTTCTT GATACCTCCC CGGTACAGCC GCTGGAAACC GGGATGCAGA TGATGCGTGT TGCCGACAGT CGCTGGCCGG AAAGCCTGCA ACAGCAACAG GCCTCGACAC AATGGAATGA GGCACTCAAA ACCCGCGCAC AGAGTAGCCC GCAGTTGCGT GGCTGGTTGC AGACCCGCCA GGACTTACAT GCTTTTGCAG ATCTAGTGAT GCAGCGCGAG AAAGAGGGAC TAACCCTTTC CTATATCAAA AATGTCATCT GGCAGGCGGA GCGGGGACTG GGGCAGGAAA CACCCGTTGA GTCTCTGTTG ACGCAGTACC AGGATGCCCG TGCGCAGAAG CAGAATACAG ATGCGCTGGA AAAACAAATT AATGAGCGAC TCGAAGGCGT GTTAAGCCGC TGGCTGCTGC TGAAGAATAA TACGATACCG ACGATAAAAA AAGCATTGAA TTTCAATAAC ATTCATGAAT ACAAAGGAGT TTTAAATGGC GAATTTAATT TATTTAACAC TAAATGGTGA
|
Protein sequence | MNSNVLTQTI VTGSDPRGLP EFSAIREEIN KASHPSQPEL NWKLVESLAL AIFKANGVDL HTATYYTLAR TRTQGLAGFC EGAELLAAMV SHDWDKFWPQ GGPARTEMLD WFNSRTGNIL RQQISFAESD LPLIYRTERA LQLICDKLQQ VELKRVPRVE NLLYFMQNTR KRLEPQLKSN TENAAQTTVR TLIYAPETQA SSTPEAVVPP LPGLPEMKVE VRSLTENPPQ ASVIKQGSTV RGFIAGIACS VAVASALWWW QVYPVQQQLL QVNDTAQGAA TVWMASPELE NYERRLQQLL DTSPVQPLET GMQMMRVADS RWPESLQQQQ ASTQWNEALK TRAQSSPQLR GWLQTRQDLH AFADLVMQRE KEGLTLSYIK NVIWQAERGL GQETPVESLL TQYQDARAQK QNTDALEKQI NERLEGVLSR WLLLKNNTIP TIKKALNFNN IHEYKGVLNG EFNLFNTKW
|
| |