Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4328 |
Symbol | sufI |
ID | 6971013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4004690 |
End bp | 4006102 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388055 |
Product | repressor protein for FtsI |
Protein accession | YP_002272493 |
Protein GI | 209400459 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCA GTCGGCGTCA GTTCATTCAG GCATCGGGGA TTGCACTTTG TGCAGGCGCT GTTCCCCTGA AGGCCAGCGC AGCCGGGCAA CAGCAACCGC TACCCGTTCC GCCGCTGCTT GAATCTCGCC GTGGGCAACC GCTGTTTATG ACTGTACAAC GTGCGCACTG GTCATTTACG CCAGGGACAC GCGCGTCGGT CTGGGGAATC AATGGTCGTT ACCTGGGGCC GACTATCCGC GTCTGGAAGG GCGACGATGT TAAGCTTATT TACAGCAACC GCCTGACAGA AAATGTCTCA ATGACGGTGG CCGGGCTACA GGTACCAGGC CCGCTGATGG GCGGTCCGGC ACGGATGATG TCGCCAAACG CTGACTGGGC ACCCGTACTG CCTATTCGCC AGAACGCAGC TACTCTGTGG TATCACGCCA ATACTCCCAA CCGCACGGCT CAGCAGGTCT ATAACGGCCT TGCCGGAATG TGGCTGGTGG AAGATGAAGT CAGCAAGTCG CTGCCTATCC CCAACCATTA TGGTGTGGAT GATTTTCCGG TCATTATCCA GGATAAACGG CTGGATAACT TTGGTACGCC AGAATACAAC GAACCGGGAA GCGGCGGCTT TGTTGGTGAT ACGCTGCTGG TTAACGGTGT ACAAAGCCCG TACGTTGAAG TCTCGCGTGG CTGGGTGCGC TTGCGACTGC TGAACGCGTC GAACTCTCGT CGCTATCAAC TACAGATGAG CGATGGTCGC CCGTTACATG TGATTTCTGG CGATCAGGGA TTCCTGCCTG CTCCTGTATC GGTGAAGCAA CTTTCGTTGG CACCGGGCGA GCGCCGCGAG ATTCTGGTGG ATATGAGCAA CGGTGATGAA GTGTCGATCA CCTGTGGCGA AGCGGCGAGC ATTGTTGATC GTATTCGTGG CTTCTTTGAG CCATCCAGCA TTCTGGTTTC TACCCTGGTG CTAACGCTGC GCCCAACCGG CCTTCTGCCG CTGGTCACAG ACAGTCTTCC GATGCGCTTG CTGCCAACTG AAATCATGGC CGGTTCGCCG ATTCGCAGTC GCGATATCAG TCTGGGTGAT GACCCGGGTA TTAATGGACA ACTGTGGGAC GTCAACCGTA TTGATGTCAC CGCGCAGCAA GGAACGTGGG AACGCTGGAC GGTACGCGCG GACGAGCCGC AAGCGTTCCA TATTGAAGGC GTGATGTTCC AGATCCGTAA CGTGAATGGT GCGATGCCGT TCCCGGAAGA CAGAGGCTGG AAAGATACCG TTTGGGTTGA CGGACAAGTG GAGCTGCTTG TTTATTTCGG TCAGCCTTCC TGGGCGCACT TCCCGTTCTA CTTCAACAGT CAGACGCTGG AAATGGCGGA CCGTGGCTCG ATTGGGCAAC TGTTAGTCAA TCCGGTACCG TAA
|
Protein sequence | MSLSRRQFIQ ASGIALCAGA VPLKASAAGQ QQPLPVPPLL ESRRGQPLFM TVQRAHWSFT PGTRASVWGI NGRYLGPTIR VWKGDDVKLI YSNRLTENVS MTVAGLQVPG PLMGGPARMM SPNADWAPVL PIRQNAATLW YHANTPNRTA QQVYNGLAGM WLVEDEVSKS LPIPNHYGVD DFPVIIQDKR LDNFGTPEYN EPGSGGFVGD TLLVNGVQSP YVEVSRGWVR LRLLNASNSR RYQLQMSDGR PLHVISGDQG FLPAPVSVKQ LSLAPGERRE ILVDMSNGDE VSITCGEAAS IVDRIRGFFE PSSILVSTLV LTLRPTGLLP LVTDSLPMRL LPTEIMAGSP IRSRDISLGD DPGINGQLWD VNRIDVTAQQ GTWERWTVRA DEPQAFHIEG VMFQIRNVNG AMPFPEDRGW KDTVWVDGQV ELLVYFGQPS WAHFPFYFNS QTLEMADRGS IGQLLVNPVP
|
| |