Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5682 |
Symbol | |
ID | 6972411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5321853 |
End bp | 5322992 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643389315 |
Product | iron-sulfur cluster binding protein |
Protein accession | YP_002273708 |
Protein GI | 209396170 |
COG category | [C] Energy production and conversion |
COG ID | [COG1600] Uncharacterized Fe-S protein |
TIGRFAM ID | [TIGR00276] iron-sulfur cluster binding protein, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000046188 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAGC CCCTCGATCT CAATCAGTTA GCGCAAAAAA TTAAACAGTG GGGGCTGGAA CTGGGCTTTC AGCAGGTAGG TATTACCGAT ACCGATCTCA GCGAGTCCGA GCCCAAACTG CAAGCATGGC TGGACAAACA ATACCACGGC GAAATGGACT GGATGGCACG TCACGGTATG CTGCGCGCTC GCCCCCATGA GTTATTGCCC GGTACGCTGC GCGTGATCAG CGTGCGGATG AATTACCTTC CTGCTAACGC CGCATTTGCC AGCACGCTGA AAAACCCCAA ACTCGGCTAT GTTAGCCGTT ATGCGCTGGG CCGTGACTAT CACAAACGTC TGCGCAACCG ACTCAAAAAG CTGGGCGAGA TGATTCAGCA ACATTGTGTT TCGCTGAATT TTAGACCGTT TGTCGATTCT GCGCCTATTC TCGAGCGCCC GTTAGCTGAA AAAGCTGGGC TCGGCTGGAC AGGTAAGCAC TCACTTATCC TCAATCGCGA GGCCGGTTCG TTCTTCTTTT TAGGCGAATT GCTGGTCGAT ATTCCGCTGC CCGTGGATCA ACCAGTCGAG GAAGGATGCG GCAAATGCGT GGCCTGTATG ACGATTTGCC CGACCGGTGC CATCGTCGAG CCATATACCG TCGATGCTCG CCGCTGTATC TCTTATCTCA CCATCGAACT GGAAGGGGCG ATCCCGGAAG AGTTGCGACC GTTAATGGGA AACCGTATTT ACGGTTGCGA TGACTGCCAG CTTATCTGCC CGTGGAATCG CTATTCACAA CTCACTACAG AAGACGATTT CAGCCCGCGT AAGCCGCTAC ACGCACCGGA ACTCATTGAG TTATTCGCCT GGAGCGAAGA GAAGTTTTTA AAAGTCACGG AAGGTTCGGC GATTCGCCGT ATAGGTCACC TGCGCTGGCT GCGTAATATC GCCGTGGCAT TAGGCAATGC GCCCTGGGAT GAAACGATTC TGGCGGCGCT CGAGAGTCGC AAAGGTGAGC ACCCACTTCT TGATGAGCAC ATAGCGTGGG CGATGGCGCA GCAAATCGAG AGGCGAAATG CGTGCATAGT CGAAGTGCAA TTGCCGAAAA AACAGCGTCT GGTTCGGGTA ATTGAAAAAG GGTTACCGCG TGACGCCTGA
|
Protein sequence | MSEPLDLNQL AQKIKQWGLE LGFQQVGITD TDLSESEPKL QAWLDKQYHG EMDWMARHGM LRARPHELLP GTLRVISVRM NYLPANAAFA STLKNPKLGY VSRYALGRDY HKRLRNRLKK LGEMIQQHCV SLNFRPFVDS APILERPLAE KAGLGWTGKH SLILNREAGS FFFLGELLVD IPLPVDQPVE EGCGKCVACM TICPTGAIVE PYTVDARRCI SYLTIELEGA IPEELRPLMG NRIYGCDDCQ LICPWNRYSQ LTTEDDFSPR KPLHAPELIE LFAWSEEKFL KVTEGSAIRR IGHLRWLRNI AVALGNAPWD ETILAALESR KGEHPLLDEH IAWAMAQQIE RRNACIVEVQ LPKKQRLVRV IEKGLPRDA
|
| |