Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1455 |
Symbol | flgE |
ID | 6967515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1435304 |
End bp | 1436509 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385428 |
Product | flagellar hook protein FlgE |
Protein accession | YP_002269922 |
Protein GI | 209395987 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00159718 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC TTTACCGATG GCACGACCAC CAACACCGGG CGTGGTCTGG ACGTTGCTAT CAGCCAGAAC GGTTTTTTCC GTCTGGTAGA TAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT AAGCTGGATG AAAATCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACTAACAT TTCGATCCCG AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC AGCGATCCGC TTCCCTCTGT TAACGCATTT GATGCCAGCA ATGCGGATAG CTATAACAAA AAAGGTTCGG TGACTGTTTT CGACAGTCAG GGTAATGCTC ATGACATGAG CGTCTACTTT GTGAAGACCG GGGATAATAA CTGGCAGGTC TACACCCAGG ATAGCAGTGA TCCAACAGGT ACAGCCGAGC CTGCAATGAA GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAAATCCA ACAGAGAATA TTACCACCGG CGCAATTAAC GGCGCAGAAC CCGCCACGTT TAGCCTGAGC TTCCTCAACT CCATGCAGCA AAATACCGGC GCTAACAACA TTGTGGCAAC CACCCAGAAT GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC AACTATTCCA ACGAACAAAC CCAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG GCGCTGTTGG GGACAGCCGG GACGGGCAAC TTTGGCACCC TGACCAACGG TGCGTTGGAA GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA CGCTAA
|
Protein sequence | MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVNAF DASNADSYNK KGSVTVFDSQ GNAHDMSVYF VKTGDNNWQV YTQDSSDPTG TAEPAMKLVF NANGVLTSNP TENITTGAIN GAEPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R
|
| |