Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1258 |
Symbol | fliC |
ID | 6144795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1255771 |
End bp | 1257408 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616136 |
Product | flagellin |
Protein accession | YP_001743319 |
Protein GI | 170682489 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0516151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC GCGAAGGATG ACGCAGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATCTCCG TTGCGCAGAC CACTGAAGGC GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATCCGTG AACTGACGGT TCAGGCTTCT ACCGGGACTA ACTCCGATTC GGATCTGGAC TCCATTCAGG ACGAAATCAA ATCCCGTCTG GACGAAATTG ACCGCGTATC TGGCCAGACC CAGTTCAACG GCGTGAACGT ACTGGCGAAA GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCC AGACTATCAC TATTGATCTG AAGAAAATTG ACTCTGATAC GCTGGGGCTG AGTGGGTTTA ATGTGAATGG TAGCGCAGAT AAGGCAAGTG TCGCGGCGAC AGCTGACGGA ATGGTTAAAG ACGGATATAT CAAAGGGTTA ACTTCATCTG ACGGCAGCAC TGCATATACT AAAACTACAG CAAATACTGC AGCAAAAGGA TCTGATATTC TTGCGGCGCT TAAGACTGGC GATAAAATTA CCGCAACAGG TGCAAATAGC CTTGCTGATA ATGCGACATC GACAACTTAT ACTTATAATG CAACCAGCAA TACCTTCTCC TATACGGCTG ACGGTGTAAA CCAAACGAAT GCTGCAGCAA ATCTCATACC TGCAGCAGGG AAAACGACAG CTGCATCAGT TACTATTGGT GGGACAGCAC AGAATGTAAA TATTGATGAT TCGGGCAATA TTACTTCAAG TGATGGCGAT CAACTTTATC TGGATTCAAC AGGTAACCTG ACTAAAAACC AGGCCGGCAA CCCGAACAAA GCAACCGTTT CTGGGCTTCT CGGAAATACG GATGCGAAAG GTACTGCTGT TAAAACAACC ATCAAGACAG AGGCTGGTGT AACAGTTACA GCTGAAGGTA ATACAGGTAC TGTAAAAATT GAAGGTGCTA CTGTTTCAGC ATCTGCATTT ACGGGCATTG CATATTCCGC CAACACCGGT GGGAATACTT ATGCTGTTGC CGCAAATAAT ACTACAAATG GTTTCCTGGC GGGGGATGCC TTAACCCAGG ATGCTCAAAC TGTTTCAACC TACTACTCGC AAGCCGATGG CACGGTCACG AATAGCGCAG GCAAAGAAAT CTATAAAGAC GCTGATGGTG TCTACAGCAC AGAGAATAAA ACATCGAAGA CGTCCGATCC ATTGGCTGCG CTTGACGACG CAATCAGCTC CATCGACAAA TTCCGTTCAT CCCTGGGTGC TATCCAGAAC CGTCTGGATT CCGCGGTCAC CAACCTGAAC AACACCACTA CCAACCTGTC CGAAGCGCAG TCCCGTATTC AGGACGCCGA CTATGCGACC GAAGTGTCCA ACATGTCGAA AGCGCAGATC ATCCAGCAGG CCGGTAACTC CGTGCTGGCA AAAGCTAACC AGGTACCACA GCAGGTTCTG TCTCTGCTGC AGGGTTAA
|
Protein sequence | MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG LTQAARNAND GISVAQTTEG ALSEINNNLQ RIRELTVQAS TGTNSDSDLD SIQDEIKSRL DEIDRVSGQT QFNGVNVLAK DGSMKIQVGA NDGQTITIDL KKIDSDTLGL SGFNVNGSAD KASVAATADG MVKDGYIKGL TSSDGSTAYT KTTANTAAKG SDILAALKTG DKITATGANS LADNATSTTY TYNATSNTFS YTADGVNQTN AAANLIPAAG KTTAASVTIG GTAQNVNIDD SGNITSSDGD QLYLDSTGNL TKNQAGNPNK ATVSGLLGNT DAKGTAVKTT IKTEAGVTVT AEGNTGTVKI EGATVSASAF TGIAYSANTG GNTYAVAANN TTNGFLAGDA LTQDAQTVST YYSQADGTVT NSAGKEIYKD ADGVYSTENK TSKTSDPLAA LDDAISSIDK FRSSLGAIQN RLDSAVTNLN NTTTNLSEAQ SRIQDADYAT EVSNMSKAQI IQQAGNSVLA KANQVPQQVL SLLQG
|
| |