Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3382 |
Symbol | |
ID | 6967971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3127638 |
End bp | 3128564 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387191 |
Product | hypothetical protein |
Protein accession | YP_002271654 |
Protein GI | 209399653 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT CAACAACCTC CTCCCCGCAT GATGCAGTAT TTAAAACCTT TATGTTCACA CCCGAAACCG CACGGGATTT TCTCGAAATA CATTTACCAG AACCACTGCG CAAGCTTTGC AACCTGCAAA CCTTACGCCT GGAACCCACT AGTTTTATTG AAAAAAGTTT ACGCGCTTAC TACTCGGATG TTTTGTGGTC CGTGGAAACC AGCGACGGTG ACGGCTATAT CTACTGCGTG ATTGAACATC AAAGCTCTGC AGAAAAGAAT ATGGCTTTTC GGCTAATGCG CTATGCCACT GCCGCCATGC AGCGTCACCT GGACAAAGGC TATGACAGAG TCCCGCTGGT GGTGCCGTTG CTGTTTTATC ATGGCGAAAC CTCGCCCTAC CCGTACTCAC TCAACTGGCT GGATGAGTTT GACGATCCGC AACTTGCCCG GCAGTTGTAC ACCGAAGCTT TTCCGTTGGT GGATATTACC ATCGTACCTG ACGATGAGAT CATGCAACAT CGGCGTATAG CTCTGCTTGA ACTGATTCAA AAGCATATTC GCGACCGCGA TTTAATCGGT ATGGTCGACA GGATCACCAC GCTTTTGGTT AGAGGCTTCA CTAATGACAG CCAGCTACAA ACACTGTTTA ATTATCTGCT GCAATGCGGC GATACCTCCC GTTTCACCCG TTTTATTCAG GAGATTGCCG AACGTTCACC ACTACAAAAG GAGAGATTAA TGACTATTGC TGAACGGCTA CGGCAGGAAG GACATCAAAT TGGTTGGCAG GAAGGTAAAT TAGAAGGTTT GCAGGAAGGC ATGCATGAAC AAGCTATTAA AATTGCCTTG CGCATGCTGG AACAGGGCTT TGATCGTGAC CTGGTGCTCG CGGCCACCCA GCTAAGCGAA GCCGATCTGG CAGCGAATAA CCACTAA
|
Protein sequence | MTESTTSSPH DAVFKTFMFT PETARDFLEI HLPEPLRKLC NLQTLRLEPT SFIEKSLRAY YSDVLWSVET SDGDGYIYCV IEHQSSAEKN MAFRLMRYAT AAMQRHLDKG YDRVPLVVPL LFYHGETSPY PYSLNWLDEF DDPQLARQLY TEAFPLVDIT IVPDDEIMQH RRIALLELIQ KHIRDRDLIG MVDRITTLLV RGFTNDSQLQ TLFNYLLQCG DTSRFTRFIQ EIAERSPLQK ERLMTIAERL RQEGHQIGWQ EGKLEGLQEG MHEQAIKIAL RMLEQGFDRD LVLAATQLSE ADLAANNH
|
| |