Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2052 |
Symbol | flgE |
ID | 6144330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2073514 |
End bp | 2074719 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616928 |
Product | flagellar hook protein FlgE |
Protein accession | YP_001744104 |
Protein GI | 170683341 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.302696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTTT CTCAAGCGGT TAGCGGATTA AACGCTGCCG CCACCAACCT CGATGTTATT GGCAACAATA TCGCCAACTC CGCCACCTAC GGCTTTAAAT CAGGCACGGC CTCTTTTGCC GATATGTTTG CCGGTTCGAA AGTGGGACTG GGGGTAAAAG TTGCCGGTAT CACTCAGGAC TTTACCGATG GCACGACCAC CAACACCGGG CGAGGTCTGG ACGTTGCTAT CAGCCAGAAC GGTTTTTTCC GTCTGGTAGA CAGCAACGGT TCGGTGTTCT ACAGCCGTAA CGGACAATTT AAGCTGGATG AAAACCGTAA CCTGGTGAAT ATGCAAGGTT TACAGCTGAC GGGTTACCCG GCAACCGGTA CGCCGCCGAC TATTCAGCAA GGGGCGAATC CGACCAATAT TTCGATCCCG AATACCCTGA TGGCAGCGAA AACTACCACC ACGGCGTCGA TGCAGATCAA CCTGAATTCC AGTGATCCGC TTCCCTCTGT TAAAGCATTC GATGCCAGCA ATGCGGATAG CTATAACAAA AAAGGTTCGG TGACTGTTTT CGATAGTCAG GGTAATGCTC ATGATATGAG CGTTTATTTT GTGAAGACCG GGGATAACAA CTGGGATGTT TACACCCTGG ATAGCAGTGA TCCAACAGGC ACAGCTAACC CTGCAACGAC GCTGGTGTTT AATGCCAATG GCGTTCTGAC CTCAGATCCA ACAAAGGATA TTACCACCGG CGCAATTAAC GGCGCAGATC CCGCCACGTT TAGCCTGAGC TTCCTCAACT CCATGCAGCA AAATACCGGC GCGAACAACA TTGTGGCAAC CACCCAGAAC GGCTACAAAC CGGGCGATCT GGTGAGTTAT CAAATCAATG ATGACGGTAC GGTTGTCGGC AACTATTCCA ACGAACAAAC CCAACTGCTG GGGCAGATTG TACTGGCGAA CTTTGCCAAC AACGAAGGTC TGGCATCCGA AGGCGACAAC GTCTGGTCTG CGACGCAATC TTCTGGCGTG GCGCTGTTGG GGACAGCCGG GACGGGCAAC TTTGGCACCC TGACCAACGG TGCGCTGGAA GCGTCCAACG TCGATCTCAG TAAAGAACTG GTCAATATGA TCGTTGCCCA GCGTAACTAT CAGTCTAACG CCCAGACCAT CAAAACCCAG GACCAGATCC TCAACACGCT GGTTAACTTA CGCTAA
|
Protein sequence | MAFSQAVSGL NAAATNLDVI GNNIANSATY GFKSGTASFA DMFAGSKVGL GVKVAGITQD FTDGTTTNTG RGLDVAISQN GFFRLVDSNG SVFYSRNGQF KLDENRNLVN MQGLQLTGYP ATGTPPTIQQ GANPTNISIP NTLMAAKTTT TASMQINLNS SDPLPSVKAF DASNADSYNK KGSVTVFDSQ GNAHDMSVYF VKTGDNNWDV YTLDSSDPTG TANPATTLVF NANGVLTSDP TKDITTGAIN GADPATFSLS FLNSMQQNTG ANNIVATTQN GYKPGDLVSY QINDDGTVVG NYSNEQTQLL GQIVLANFAN NEGLASEGDN VWSATQSSGV ALLGTAGTGN FGTLTNGALE ASNVDLSKEL VNMIVAQRNY QSNAQTIKTQ DQILNTLVNL R
|
| |