Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1924 |
Symbol | |
ID | 6966873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1816694 |
End bp | 1817821 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643385855 |
Product | hypothetical protein |
Protein accession | YP_002270344 |
Protein GI | 209400889 |
COG category | [S] Function unknown |
COG ID | [COG4950] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0400808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000000100313 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACAAC GCCACATCAC CGGCAAAAGC CACTGGTATC ATGAAACGCA ATCCAGTACT ACGGAGTATG ACGTTCTGCC TCTGGTCCCG GAAGCCGCAA AGGTCAGCGA TCCCTTTCTA CTCGACGTGA TCCTTGAAAA AGAAACGCTG GCCCCCTTCC TTTCATGGCT GGACCCTGCG CGTGTTCTTG CAGTGGAGTT GTTCCCTGAC CAGCTTACCG TGACCCGTTC ACAGACCTTC ACCGCTTATG AACGCTTGTC GACGGCCCTG ACGGTTGCTC AGGTTTGCGG CGTCCAGCGG TTATGTAACT ACTATTCGGC GCGACTTACG CCGCTCCCCG GGCCTGATTC CACCAGGGAA AGTAATCATC GGTTGGCACA AATCACGCAA TATGCCCGCC AACTGGCTAG CTCGCCTTCT ATTATCGACA ACCGATCGCG CCAGCATCTG AATGACGTCG GTCTTACTGC CTGGGACTGT GTGATCATTA ACCAAATCAT TGGTTTTATT GGCTTTCAGG CGCGGACAAT TGCGACATTT CAGGCTTATC TCGGGCATCC GGTACGCTGG TTACCCGGGC TGGAGATACA AAACTACGCC GACGCGTCAC TGTTTGCTGA TGAATCATTA CGCTGGCGAA GCAGCTATGA GGTGGAAAAA CTACCTGAAG AACACACAAA AAGTTCAACT GCAGAACTTT GCCAACTGGC CGAAATACTC TCTCTCCACC CTATTTCACT TTCCCTTCTC GAAAGGTTGT TAAACAGCAC ACGGGTTAAT ACACAGCCGG ATAATCAGCT TGCGGCGTTG TTATGCGCGC GGATAAATGG CAGTCCTGCT TGTTTTGCCG CCTGTATGGA TTCATCAAAT GAATATAAAA AAATCAGCCC CCTTCTGCGC AAGGGCGAAA ATGAAATTAA CCAATGGGCT GACCGTCATT CTGTTGAGCG CGCTACCGTT CAGGCGATAC AATGGCTGAC CCGAGCACCC GATCGCTTTA GCGCCGCCCA GTTCAGCCCT TTACTCGAAC ACGAAAAATC ATCAACGCAG ATTATTAATC TGCTGGTATG GAGCGGGCTG TGTGGCTGGA TAAATCGCTT AAAAATCGCG TTGGGTGAGA CATATTAA
|
Protein sequence | MEQRHITGKS HWYHETQSST TEYDVLPLVP EAAKVSDPFL LDVILEKETL APFLSWLDPA RVLAVELFPD QLTVTRSQTF TAYERLSTAL TVAQVCGVQR LCNYYSARLT PLPGPDSTRE SNHRLAQITQ YARQLASSPS IIDNRSRQHL NDVGLTAWDC VIINQIIGFI GFQARTIATF QAYLGHPVRW LPGLEIQNYA DASLFADESL RWRSSYEVEK LPEEHTKSST AELCQLAEIL SLHPISLSLL ERLLNSTRVN TQPDNQLAAL LCARINGSPA CFAACMDSSN EYKKISPLLR KGENEINQWA DRHSVERATV QAIQWLTRAP DRFSAAQFSP LLEHEKSSTQ IINLLVWSGL CGWINRLKIA LGETY
|
| |