Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2022 |
Symbol | |
ID | 5593748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2021579 |
End bp | 2022628 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640921167 |
Product | putative flagellin |
Protein accession | YP_001458712 |
Protein GI | 157161394 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAGAACAA CATCAACAAA AACCAGTCTG CGCTGTCGAC TTCTATCGAG CGCCTCTCTT CTGGTCTGCG TATTAACAGC GCTAAAGATG ACGCCGCGGG CCAGGCGATT GCTAACCGCT TTACTTCTAA CATCAAAGGT CTGACTCAGG CCGCACGTAA CGCCAACGAC GGTATTTCTC TGGCGCAGAC GGCTGAAGGC GCGCTGTCAG AGATTAACAA CAACTTGCAG CGTATTCGTG AACTGACCGT TCAGGCCTCT ACCGGCACGA ACTCTGATTC CGACCTGTCT TCTATTCAGG ACGAAATCAA ATCCCGTCTT GATGAAATTG ACCGTGTATC TGGTCAGACC CAGTTCAACG GTGTGAACGT GCTGTCGAAA AACGATTCGA TGAAGATTCA GATTGGTGCC AATGATAACC AGACGATCAG CATTGGCTTG CAACAAATCG ACAGTACCAC TTTGAATCTG AAAGGATTTA CCGTGTCCGG CATGGCGGAT TTCAGCGCGG CGAAACTGAC GGCTGCTGAT GGTACAGCAA TTGCTGCTGC GGATGTCAAG GATGCTGGGG GTAAACAAGT CAATTTACTG TCTTACACTG ACACCGCGTC TAACAGTACT AAATATGCGG TCGTTGATTC TGCAACCGGT AAATACATGG AAGCCACTGT AGCCATTACC GGTACGGCGG CGGCGGTAAC TGTTGGTGCA GCGGAAGTGG CGGGAGCCGC TACAGCCGAT CCGTTAAAAG CACTGGATGC CGCAATCGCT AAAGTCGACA AATTCCGCTC CTCCCTCGGT GCCGTTCAAA ACCGTCTGGA TTCTGCGGTC ACCAACCTGA ACAACACCAC CACCAACCTG TCTGAAGCGC AGTCCCGTAT TCAGGACGCC GACTATGCGA CCGAAGTGTC CAACATGTCG AAAGCGCAGA TTATCCAGCA GGCGGGCAAC TCCGTGCTGT CTAAAGCCAA CCAGGTACCG CAGCAAGTTC TGTCTCTGTT ACAAGGCTAA
|
Protein sequence | MAQVINTNSL SLITQNNINK NQSALSTSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG LTQAARNAND GISLAQTAEG ALSEINNNLQ RIRELTVQAS TGTNSDSDLS SIQDEIKSRL DEIDRVSGQT QFNGVNVLSK NDSMKIQIGA NDNQTISIGL QQIDSTTLNL KGFTVSGMAD FSAAKLTAAD GTAIAAADVK DAGGKQVNLL SYTDTASNST KYAVVDSATG KYMEATVAIT GTAAAVTVGA AEVAGAATAD PLKALDAAIA KVDKFRSSLG AVQNRLDSAV TNLNNTTTNL SEAQSRIQDA DYATEVSNMS KAQIIQQAGN SVLSKANQVP QQVLSLLQG
|
| |