Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4718 |
Symbol | |
ID | 6967083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4358422 |
End bp | 4359300 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643388419 |
Product | hypothetical protein |
Protein accession | YP_002272847 |
Protein GI | 209396746 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAA AGCAGAGTTC CACCCCACAC GATGCGCTGT TCAAACTCTT TTTACGCCAA CCGGAGACGG CACGCGATTT TCTTGCGTTT CATTTACCAG CACCCATTCA CGCGCTCTGT GATATGAAAA CCCTCAAGCT GGAGTCGAGC AGCTTTATTG ATGACGATCT GCGTGAAAGC TATCCCGATG TGCTGTGGTC GGTGAAAACA GAACAAGGAC CAGGATACAT CTATTGTCTG ATTGAACATC AAAGCACCTC AAACAAACTG ATCGCATTTC GCATGATGCG TTACGCTATT GCCGCAATGC AAAATCACCT TGATGCCGGA TACAAAACGT TGCCGATGGT GGTGCCATTG TTGTTTTACC ACGGTATTGA AAGCCCCTAT CCCTATTCGC TGTGTTGGCT GGATTGTTTC GCCGATCCCA AACTGGCAAG GCAGCTTTAT GCCTCCGCAT TTCCGCTGAT TGATATCACC GTCATGCCTG ATGATGAAAT CATGCAGCAC CGACGCATGA CGCTGCTGGA GTTAATTCAA AAACATATTC GTCAACGCGA CCTGATGGGG CTGGTAGAGC AAATGGCCTG CTTATTAAGT AGTGGATACG CTAATGACAG ACAAATCAAA GGGCTGTTTA ATTACATACT GCAAACCGGC GATGCGGTAC GTTTTAACGA TTTTATCGAC GGCGTTGCCG AACGATCACC GAAACACAAG GAGAGTTTAA TGACTATTGC GGAAAGATTG CGGCAGGAGG GGGAACAATC CAAAGCCCTG CATATAGCCA AAATAATGCT TGAATCCGGA GTCCCTCTTG CAGACATCAT GCGCTTTACC GGGCTGTCAG AAGAAGAGTT GGCTGCGGCG AGTCAGTAA
|
Protein sequence | MSKKQSSTPH DALFKLFLRQ PETARDFLAF HLPAPIHALC DMKTLKLESS SFIDDDLRES YPDVLWSVKT EQGPGYIYCL IEHQSTSNKL IAFRMMRYAI AAMQNHLDAG YKTLPMVVPL LFYHGIESPY PYSLCWLDCF ADPKLARQLY ASAFPLIDIT VMPDDEIMQH RRMTLLELIQ KHIRQRDLMG LVEQMACLLS SGYANDRQIK GLFNYILQTG DAVRFNDFID GVAERSPKHK ESLMTIAERL RQEGEQSKAL HIAKIMLESG VPLADIMRFT GLSEEELAAA SQ
|
| |