Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2058 |
Symbol | |
ID | 6969379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1953661 |
End bp | 1954722 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385970 |
Product | hypothetical protein |
Protein accession | YP_002270459 |
Protein GI | 209399084 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.132417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00041859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATTTAC GTCATCTGTT TTCATTGCGC CTGCGTGGTT CATTACTGTT AGGTTCATTG CTTGTTGCTT CATCATTCAG TACGCAGGCC GCAGAAGAAA TGCTGCGTAA AGCGGTAGGT AAAGGTGCCT ACGAAATGGC TTATAGCCAG CAAGAAAACG CGCTGTGGCT CGCCACTTCG CAAAGCCGCA AACTGGATAA AGGCGGCGTG GTTTATCGTC TTGATCCGGT CACTCTGGAA GTGACGCAGG CGATCCATAA CGATCTCAAG CCGTTTGGTG CCACCATCAA TAACACGACT CAGACGTTGT GGTTTGGTAA CACCGTAAAC AGCGCGGTCA CGGCGATAGA TGCCAAAACT GGCGAGGTGA AAGGCCGTCT GGTGCTGGAT GATCGTAAGC GCACGGAAGA GGTGCGCCCG CTGCAACCGC GTGAGCTGGT AGCTGATGAT GCCACGAACA CCGTTTACAT CAGTGGTATT GGTAAAGATA GCGTGATTTG GGTCGTTGAT GGCGAGAATA TCAAACTGAA AACCGCCATC CAGAACACCG GTAAAATGAG TACCGGTCTG GCACTGGATA GCAAAGGCAA ACGTCTTTAC ACCACTAACG CTGACGGCGA ATTGATTACC ATCGACACCG CCGACAATAA AATCCTCAGC CGTAAAAAGC TGCTGGATGA CGGCAAAGAG CACTTCTTTA TCAACATCAG CCTTGATACC GCCAGGCAGC GTGCATTTAT CACCGATTCT AAAGCGGCAG AAGTGTTAGT GGTCGATACC CGTAATGGCA ATATTCTGGC GAAGGTTGCG GCACCGGAAT CACTGGCTGT GCTGTTTAAC CCAGCGCGTA ATGAAGCCTA CGTAACGCAT CGTCAGGCAG GTAAAGTCAG TGTGATTGAC GCGAAAAGCT ATAAAGTGGT GAAAACGTTC GATACGCCGA CTCATCCGAA CAGCCTGGCG CTGTCTGCCG ATGGCAAAAC GCTGTATGTC AGTGTGAAAC AAAAATCCAC TAAACAGCAG GAAGCTACCC AGCCAGACGA TGTGATTCGT ATTGCGCTGT AA
|
Protein sequence | MHLRHLFSLR LRGSLLLGSL LVASSFSTQA AEEMLRKAVG KGAYEMAYSQ QENALWLATS QSRKLDKGGV VYRLDPVTLE VTQAIHNDLK PFGATINNTT QTLWFGNTVN SAVTAIDAKT GEVKGRLVLD DRKRTEEVRP LQPRELVADD ATNTVYISGI GKDSVIWVVD GENIKLKTAI QNTGKMSTGL ALDSKGKRLY TTNADGELIT IDTADNKILS RKKLLDDGKE HFFINISLDT ARQRAFITDS KAAEVLVVDT RNGNILAKVA APESLAVLFN PARNEAYVTH RQAGKVSVID AKSYKVVKTF DTPTHPNSLA LSADGKTLYV SVKQKSTKQQ EATQPDDVIR IAL
|
| |