Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4423 |
Symbol | |
ID | 6971477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4098053 |
End bp | 4099363 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643388144 |
Product | hypothetical protein |
Protein accession | YP_002272581 |
Protein GI | 209399521 |
COG category | [S] Function unknown |
COG ID | [COG3681] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0648812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGATT CGACTTTAAA TCCGTTATGG CAGCGTTACA TCCTCGCCGT TCAGGAGGAA GTAAAACCGG CGCTGGGATG TACTGAACCG ATTTCACTGG CGCTGGCGGC GGCGGTTGCT GCGGCAGAAC TGGAAGGTCC GGTTGAACGT GTAGAAGCCT GGGTTTCGCC AAATCTGATG AAGAACGGTC TGGGCGTCAC CGTTCCCGGC ACGGGAATGG TGGGGCTGCC GATTGCGGCG GCGCTGGGGG CGTTAGGTGG AAATGCCAAC GCCGGGCTGG AAGTGCTGAA AGACGCAACT GCGCAGGCAA TTGCCGATGC CAAAGCACTG CTGGCGGCGG GGAAAGTCTC CGTTAAGATC CAGGAACCTT GCAATGAAAT CCTCTTCTCA CGCGCCAAAG TCTGGAACGG TGAGAAGTGG GCGTGTGTCA CTATCGTCGG CGGGCATACC AACATTGTGC ATATTGAGAC GCACGATGGT GTGGTGTTTA CCCAGCAGGC GTGTGTGGCA GAGGGCGAGC AAGAGTCTCC GCTATCGGTG CTTTCCAGAA CGACGCTGGC TGAGATCCTG AAGTTCGTCA ATGAAGTCCC GTTTGCGGCG ATCCGCTTTA TTCTCGATTC CGCGAAGCTA AATTGTGCGT TATCGCAGGA AGGTTTGAGC GGTAAGTGGG GGCTGCATAT TGGCGCGACG CTGGAAAAAC AGTGCGAGCG CGGTTTGCTG GCGAAAGATC TCTCTTCATC CATTGTGATT CGTACCAGCG CGGCATCCGA TGCGCGTATG GGCGGCGCCA CGCTTCCGGC AATGAGTAAC TCCGGCTCGG GTAACCAGGG GATCACCGCA ACAATGCCCG TGGTGGTGGT AGCAGAACAC TTCGGAGCGG ATGATGAACG GCTGGCGCGT GCGCTGATGC TTTCTCATTT GAGCGCAATT TACATCCATA ACCAGTTACC GCGTTTGTCT GCGCTGTGTG CCGCAACGAC CGCAGCAATG GGGGCCGCCG CCGGGATGGC ATGGCTGGTG GATGGGCGTT ATGAAACCAT CTCGATGGCG ATCAGCAGTA TGATCGGCGA TGTCAGCGGC ATGATTTGCG ATGGTGCGTC GAACAGCTGC GCGATGAAGG TTTCGACCAG TGCTTCGGCT GCGTGGAAAG CGGTGTTAAT GGCGCTGGAT GATACCGCCG TGACCGGCAA TGAAGGGATC GTGGCGCATG ATGTTGAGCA GTCGATTGCC AACCTGTGTG CGTTAGCAAG CCATTCGATG CAGCAAACGG ATCGGCAGAT TATCGAGATT ATGGCGAGCA AGGCCAGATA A
|
Protein sequence | MFDSTLNPLW QRYILAVQEE VKPALGCTEP ISLALAAAVA AAELEGPVER VEAWVSPNLM KNGLGVTVPG TGMVGLPIAA ALGALGGNAN AGLEVLKDAT AQAIADAKAL LAAGKVSVKI QEPCNEILFS RAKVWNGEKW ACVTIVGGHT NIVHIETHDG VVFTQQACVA EGEQESPLSV LSRTTLAEIL KFVNEVPFAA IRFILDSAKL NCALSQEGLS GKWGLHIGAT LEKQCERGLL AKDLSSSIVI RTSAASDARM GGATLPAMSN SGSGNQGITA TMPVVVVAEH FGADDERLAR ALMLSHLSAI YIHNQLPRLS ALCAATTAAM GAAAGMAWLV DGRYETISMA ISSMIGDVSG MICDGASNSC AMKVSTSASA AWKAVLMALD DTAVTGNEGI VAHDVEQSIA NLCALASHSM QQTDRQIIEI MASKAR
|
| |