Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1132 |
Symbol | |
ID | 6969020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1161605 |
End bp | 1162795 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385137 |
Product | hypothetical protein |
Protein accession | YP_002269636 |
Protein GI | 209400981 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.250632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG TGGGTCTTTT CCGGGGCCGT TGCCCGTATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGCGCAG AATATCAGCG CGCGGCATTA ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG GTACGTAAAA AAGAAGGGAT GGAGCTGACC CAGGGCCTCG TCACCGGTGA GTTGCCGCCT GCCCTGCTGC CGATTGAAGA ACATGGCATG AAGCTGCTGG TGGACATACA GCACGGACAC AAAACGGGCT ACTACCTGGA CCAGCGAGAC AGCCGCCTGG CTACCCGCCG CTACGTTGAA AATAAACGCG TACTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCACTGGA TATTGCACGG CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC TTTAAATTGC TGCGTACCTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAT ATCAACATGC TGGCGATTCA GCTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC GGTCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCGG CCGATCATCC GGTGATCGCT ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
|
Protein sequence | MSVRLVLAKG REKSLLRRHP WVFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CAIYDRSDVA VRKKEGMELT QGLVTGELPP ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM
|
| |