Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2698 |
Symbol | |
ID | 6968196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2531990 |
End bp | 2533747 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643386559 |
Product | flagellin |
Protein accession | YP_002271038 |
Protein GI | 209399146 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.132656 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC GCGAAGGATG ACGCCGCAGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC CTGACTCAGG CGGCCCGTAA CGCCAACGAC GGTATTTCTG TTGCGCAGAC CACCGAAGGC GCGCTGTCCG AAATCAACAA CAACTTACAG CGTATTCGTG AACTGACGGT TCAGGCCACT ACAGGGACTA ACTCCGATTC TGACCTGGAC TCCATCCAGG ACGAAATCAA ATCTCGTCTT GATGAAATTG ACCGCGTATC CGGCCAGACC CAGTTCAACG GCGTGAACGT GCTGGCGAAA GACGGTTCAA TGAAAATTCA GGTTGGTGCG AATGACGGCG AAACCATCAC GATCGACCTG AAAAAAATCG ATTCTGATAC TCTGGGTCTG AATGGCTTTA ACGTAAATGG TAAAGGTACT ATTACCAACA AAGCTGCAAC GGTAAGTGAT TTAACTTCTG CTGGCGCGAA GTTAAACACC ACGACAGGTC TTTATGATCT GAAAACCGAA AATACCTTGT TAACTACCGA TGCTGCATTC GATAAATTAG GGAATGGCGA TAAAGTCACA GTTGGCGGCG TAGATTATAC TTACAACGCT AAATCTGGTG ATTTTACTAC CACTAAATCT ACTGCTGGTA CGGGTGTAGA CGCCGCGGCG CAGGCTGCTG ATTCAGCTTC AAAACGTGAT GCGTTAGCTG CCACCCTTCA TGCTGATGTG GGTAAATCTG TTAATGGTTC TTACACCACA AAAGATGGTA CTGTTTCTTT CGAAACGGAT TCAGCAGGTA ATATCACCAT CGGTGGAAGC CAGGCATACG TAGACGATGC AGGCAACTTG ACGACTAACA ACGCTGGTAG CGCAGCTAAA GCTGATATGA AAGCGCTGCT CAAAGCAGCG AGCGAAGGTA GTGACGGTGC CTCTCTGACA TTCAATGGCA CAGAATATAC CATCGCAAAA GCAACTCCTG CGACAACCAC TCCAGTAGCT CCGTTAATCC CTGGTGGGAT TACTTATCAG GCTACAGTGA GTAAAGATGT AGTATTGAGC GAAACCAAAG CGGCTGCCGC GACATCTTCA ATTACCTTTA ATTCCGGTGT ACTGAGCAAA ACTATTGGGT TTACCGCGGG TGAATCCAGT GATGCTGCGA AGTCTTATGT GGATGATAAA GGTGGTATCA CTAACGTTGC CGACTATACA GTCTCTTACA GCGTTAACAA GGATAACGGC TCTGTGACTG TTGCCGGGTA TGCTTCAGCG ACTGATACCA ATAAAGATTA TGCTCCAGCA ATTGGTACTG CTGTAAATGT GAACTCCGCG GGTAAAATCA CTACTGAGAC TACCAGTGCT GGTTCTGCAA CGACCAACCC GCTTGCTGCC CTGGACGACG CAATCAGCTC CATCGACAAA TTCCGTTCTT CCCTGGGTGC TATCCAGAAC CGTCTGGATT CCGCAGTCAC CAACCTGAAC AACACCACTA CCAACCTGTC CGAAGCGCAG TCCCGTATTC AGGACGCCGA CTATGCGACC GAAGTGTCCA ACATGTCGAA AGCGCAGATC ATTCAGCAGG CCGGTAACTC CGTGCTGGCA AAAGCTAACC AGGTACCGCA GCAGGTTCTG TCTCTGCTGC AGGGTTAA
|
Protein sequence | MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG LTQAARNAND GISVAQTTEG ALSEINNNLQ RIRELTVQAT TGTNSDSDLD SIQDEIKSRL DEIDRVSGQT QFNGVNVLAK DGSMKIQVGA NDGETITIDL KKIDSDTLGL NGFNVNGKGT ITNKAATVSD LTSAGAKLNT TTGLYDLKTE NTLLTTDAAF DKLGNGDKVT VGGVDYTYNA KSGDFTTTKS TAGTGVDAAA QAADSASKRD ALAATLHADV GKSVNGSYTT KDGTVSFETD SAGNITIGGS QAYVDDAGNL TTNNAGSAAK ADMKALLKAA SEGSDGASLT FNGTEYTIAK ATPATTTPVA PLIPGGITYQ ATVSKDVVLS ETKAAAATSS ITFNSGVLSK TIGFTAGESS DAAKSYVDDK GGITNVADYT VSYSVNKDNG SVTVAGYASA TDTNKDYAPA IGTAVNVNSA GKITTETTSA GSATTNPLAA LDDAISSIDK FRSSLGAIQN RLDSAVTNLN NTTTNLSEAQ SRIQDADYAT EVSNMSKAQI IQQAGNSVLA KANQVPQQVL SLLQG
|
| |