Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3891 |
Symbol | |
ID | 6967650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3598505 |
End bp | 3600136 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643387669 |
Product | conserved DNA-binding protein |
Protein accession | YP_002272118 |
Protein GI | 209396348 |
COG category | [S] Function unknown |
COG ID | [COG4688] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA TATTTATTTT TGAACCAAGC AATAAAAACA ACCCTCTCGA TAATGTTATT AAGTTTATCG AGTTTTGTAA GAGTACTATT TCTAATAATA ATTTAACAAC TTCATGGGAA AGCAATAAAT GGAAAGGTTT ATATAGGTTT ACTAAGTTTA ACTCAAAAAA CAACCTAAAC AGCAAGGAGT GCTTAGATGA TAGCTTTATT AATTTTGCTA AAGCATATAT GTTGCATGTG CATTCATTCA ATAAATCTAA GACGAAACAC TCAACATTAT CAATGTTGAA GATTGTCGAA TTCGTTTTAC TTAAAATCAA TATGGAAGCT AATGTAAACT ATTGCAACAA TTCAATCTAT GATGAATGCA TAAGGATAGC CTCTGAAAAG TATTCTAAAG CACATGCATT TGCTATTGGG AAAGAACTTG AGAAATTGAG TTCATTTTTA AATGATAATA GGATGACTAA CTCATTTTAT TTATTTTGGG TAAATCCAAT TAGGTATAGG ATTACTCAGT CTTGGACTGG TTATGATTCT TCACTGGAAG GCCATTCTAG ATTGCCTGAT ATCAAATCAG TTATTGCGAT TGCTGAGATT TTTTCAAAGC GGGATGAACA ATTATCGTCA AGAGATATAT TTACTACATC TGTGCTTGCT TTACTTATGT GTGCACCGAG TAGGATATCG GAAATTTTAG CTTTGCCAGC GGATTGTGAA ATCACAGAAT GTGATGGAAA GGGCATTCAA AGATACGGTT TAAGATTTTT TTCAGCAAAA GGGTATGAAG GCAATATAAA ATGGATTCCA ACTTTAATGA TACCTGTAGC TAAAAAGGCT ATTAGCAGAT TAAAAGAATT ATCAAGTCAA GCGAGGTTAT TGGCTGCTGA AATTCAAAAG AATTACTCTA ATTCAACGAA GGGAACCCTT AAAGAAAATA TACCTCCTGA TCTCTTTTGG TATGATAGAG AGAAGAAAAT CAAATATTCT AATGCGCTTT GCTTGTTAAC TGAAGGACAG TTAAATCAAA ATAAAAAGGA AATGTCAGAT AAATTATTCA GACCTACAAC GAATTTTTTT AAAACTGATA TCATTGATTC TGATTATATA AAAGGGTATT TTAATGTTTT TAAAAGACAT GGTTATATAA ATGAAGATGG TAGCCCATAT TTGCTAAGAA CACATCAACT AAGGCATCTT CTCAACACAT TTGCTCAAAT AAATGGTATG GATGAATTTA GTATTGCTCG CTGGTCTGGA CGTAAGCTTA TTTCTCAAAA TGTTTCTTAT GACCACAGAT CGCATCTTCA AATGTCTAAA GCAATAAGAG AAAAAAAGTT ATCAGTATGT GTTAATGAGC ACAGAATAAA GGATATTCCA GTAGTGGATC TTAATGAGTT TGACTCACTT AGTAGTGGTG CAGTACTTGT ATCAAAACAT GGCTACTGCA AGCACTCATA TGCGTTTAAG CCGTGTGATA ATTATCCAAT TAAGAACTCT GGTTTAGATA ACGAAACGAT TTCAAATATC CACGATAAAA TTTTAAAAAG AACACTGTAT GATAAAAATG ATGGGAACAT AAATGCTGAT AAATGGTATG AATTCCATAA AAAAATAAAA AAAGGAGAAT AA
|
Protein sequence | MNNIFIFEPS NKNNPLDNVI KFIEFCKSTI SNNNLTTSWE SNKWKGLYRF TKFNSKNNLN SKECLDDSFI NFAKAYMLHV HSFNKSKTKH STLSMLKIVE FVLLKINMEA NVNYCNNSIY DECIRIASEK YSKAHAFAIG KELEKLSSFL NDNRMTNSFY LFWVNPIRYR ITQSWTGYDS SLEGHSRLPD IKSVIAIAEI FSKRDEQLSS RDIFTTSVLA LLMCAPSRIS EILALPADCE ITECDGKGIQ RYGLRFFSAK GYEGNIKWIP TLMIPVAKKA ISRLKELSSQ ARLLAAEIQK NYSNSTKGTL KENIPPDLFW YDREKKIKYS NALCLLTEGQ LNQNKKEMSD KLFRPTTNFF KTDIIDSDYI KGYFNVFKRH GYINEDGSPY LLRTHQLRHL LNTFAQINGM DEFSIARWSG RKLISQNVSY DHRSHLQMSK AIREKKLSVC VNEHRIKDIP VVDLNEFDSL SSGAVLVSKH GYCKHSYAFK PCDNYPIKNS GLDNETISNI HDKILKRTLY DKNDGNINAD KWYEFHKKIK KGE
|
| |