Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2046 |
Symbol | flgK |
ID | 6144648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2067310 |
End bp | 2068953 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616922 |
Product | flagellar hook-associated protein FlgK |
Protein accession | YP_001744098 |
Protein GI | 170683657 |
COG category | [N] Cell motility |
COG ID | [COG1256] Flagellar hook-associated protein |
TIGRFAM ID | [TIGR02492] flagellar hook-associated protein FlgK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.147605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0499745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCT TGATCAATAA CGCCATGAGC GGACTGAACG CGGCCCAGGC GGCGTTAAAT ACGGCAAGTA ATAATATCTC CAGCTATAAC GTTGCCGGAT ATACCCGCCA AACCACTATT ATGGCGCAGG CCAATAGCAC GTTGGGCGCT GGCGGCTGGG TTGGCAATGG CGTCTACGTT TCTGGTGTGC AGCGTGAGTA TGATGCGTTT ATTACCAACC AGTTACGTGC TGCGCAGACG CAAAGTAGCG GCCTGACTGC CCGCTATGAG CAGATGTCGA AAATCGACAA TATGCTCTCC ACCAGTACCT CTTCGTTGGC AACACAGATG CAGGATTTCT TCACCAGCCT GCAAACGCTG GTGAGTAACG CGGAAGACCC GGCAGCGCGC CAGGCGCTGA TTGGGAAATC AGAAGGGTTG GTGAATCAGT TTAAAACCAC CGATCAATAT CTGCGCGACC AGGACAAACA GGTCAATATC GCGATAGGTG CCAGCGTTGA TCAGATCAAC AATTACGCTA AACAAATCGC CAGCCTGAAC GATCAGATCT CCCGCCTGAC AGGCGTGGGG GCAGGGGCGT CACCTAACAA TCTGCTGGAT CAACGCGATC AACTGGTGAG CGAATTAAAC CAGATTGTTG GCGTAGAAGT CAGCGTTCAG GATGGTGGCA CTTATAACAT CACGATGGCC AATGGCTACT CACTGGTTCA GGGAAGTACG GCGCGGCAGC TGGCGGCAGT TCCTTCCAGC GCCGACCCTT CTCGTACGAC TGTCGCTTAT ATTGATGGGA CGGCTGGCAA TATTGAGATC CCGGAGAAAT TGCTGAATAC CGGGTCGCTG GGCGGCATTC TGACATTCCG TTCTCAGGAT CTGGACCAGA CGCGTAATAC GCTCGGACAA CTGGCGCTGG CATTTGCCGA GGCTTTCAAC ACCCAACACA AAGCCGGATT TGATGCTAAC GGCGATGCCG GTGAAGATTT CTTTGCTATC GGTAAGCCCG CGGTTCTGCA AAACACGAAA AACAAAGGTG ACGTTGCGAT CGGTGCAACG GTAACTGATG CCTCCGCGGT ACTGGCGACA GATTACAAAA TCTCGTTCGA TAATAATCAG TGGCAGGCCA CCCGCCTTGC CAGCAATACC ACTTTTACGG TGACGCCGGA TGCCAACGGT AAAGTGGCAT TTGATGGTCT GGAGTTGACG TTTACAGGAA CGCCTGCCGT TAACGACAGC TTCACGTTGA AACCGGTAAG TGACGCCATC GTCAACATGG ATGTATTAAT CACTGACGAA GCGAAAATCG CCATGGCGAG CGAAGAAGAT GTGGGTGATA GCGACAACCG CAACGGTCAG GCCCTGCTGG ATCTGCAAAG CAACAGTAAA ACGGTGGGCG GTGCGAAATC CTTTAACGAC GCTTATGCCT CGTTAGTGAG TGATATCGGT AATAAAACCG CGACGTTGAA AACCAGTAGC GCCACGCAAG GTAATGTGGT GACGCAGCTT TCCAATCAGC AGCAGTCGAT TTCCGGTGTC AATCTCGATG AGGAGTACGG AAATCTGCAA CGTTTTCAGC AGTATTACCT GGCGAATGCG CAGGTTCTGC AGACGGCAAA CGCGATTTTT GATGCGCTGA TTAACATTCG CTAA
|
Protein sequence | MSSLINNAMS GLNAAQAALN TASNNISSYN VAGYTRQTTI MAQANSTLGA GGWVGNGVYV SGVQREYDAF ITNQLRAAQT QSSGLTARYE QMSKIDNMLS TSTSSLATQM QDFFTSLQTL VSNAEDPAAR QALIGKSEGL VNQFKTTDQY LRDQDKQVNI AIGASVDQIN NYAKQIASLN DQISRLTGVG AGASPNNLLD QRDQLVSELN QIVGVEVSVQ DGGTYNITMA NGYSLVQGST ARQLAAVPSS ADPSRTTVAY IDGTAGNIEI PEKLLNTGSL GGILTFRSQD LDQTRNTLGQ LALAFAEAFN TQHKAGFDAN GDAGEDFFAI GKPAVLQNTK NKGDVAIGAT VTDASAVLAT DYKISFDNNQ WQATRLASNT TFTVTPDANG KVAFDGLELT FTGTPAVNDS FTLKPVSDAI VNMDVLITDE AKIAMASEED VGDSDNRNGQ ALLDLQSNSK TVGGAKSFND AYASLVSDIG NKTATLKTSS ATQGNVVTQL SNQQQSISGV NLDEEYGNLQ RFQQYYLANA QVLQTANAIF DALINIR
|
| |