Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1716 |
Symbol | |
ID | 6065051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1912864 |
End bp | 1914582 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641601128 |
Product | flagellin |
Protein accession | YP_001724693 |
Protein GI | 170019739 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.363061 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC GCGAAGGATG ACGCAGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC CTGACTCAGG CTGCACGTAA CGCCAATGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC GCGCTGTCCG AAATCAACAA CAACTTACAG CGTGTGCGTG AACTGACCGT TCAGGCGACC ACCGGTACCA ACTCCCAGTC TGATCTGGAC TCTATCCAGG ACGAAATCAA ATCCCGTTTG GACGAAATTG ACCGCGTATC TGGTCAGACT CAGTTCAACG GCGTGAACGT ACTGGCAAAA GACGGTTCGA TGAAAATTCA GGTTGGCGCG AATGATGGCC AGACCATCAC TATCGACCTG AAGAAGATTG ACTCTTCTAC GTTGAAACTG ACTGGTTTTA ACGTGAATGG TTCTGGTTCT GTGGCGAATA CTGCGGCGAC TAAAGCTGAT TTGGCTGCTG CTGCAATTGG TACCCCTGGG GCAGCAGATT CTACAGGTGC CATTGCTTAC ACAGTAAGTG CTGGGCTGAC TAAAACTACA GCCGCAGATG TACTGTCTAG CCTCGCTGAT GGTACGACTA TTACAGCCAC AGGCGTGAAA AATGGCTTTG CTGCAGGAGC CACTTCCAAT GCCTATAAAC TTAACAAAGA TAATAATACA TTTACTTATG ACACGACTGC TACGACAGCT GAGCTGCAGT CTTACCTGAC TCCGAAAGCG GGCGACACTG CAACATTCAG TGTTGAAATT GGTGGTACTA CACAAGACGT CGTGCTGTCC AGTGATGGCA AACTCACTGC TAAGGATGGC TCTAAGCTTT ACATTGATAC AACTGGTAAC TTAACTCAGA ATGGTGGTAA TAACGGTGTT GGAACACTCG CGGAAGCGAC TCTGAGTGGT TTAGCTCTGA ACAACAATAA TGGTGCAGCG GCTGTTAAAT CCACAATTAC TACAGCTGAT AACACTTCGA TTGTACTGAA TGGTTCAAGC GATGGTACTG AAGGTACGAT TGCTGTTACA GGCGCTGTAA TTAGTTCAGC TGCTCTGCAA TCTGCAAGCA AAACGACTGG TTTCACTGTT GGTACAGCAG ACACAGCTGG TTATATCTCT GTAGGTACTG ATGGGAGTGT TCAGGCATAT GATGTTGCGA CTTCTGGCAA CAAAGATTCT TACACCAACA CTGACGGTAC ACTGACTACT GATAACACCA CTAAACTGTA TCTGCAGAAA GATGGCTCTG TAACCAACGG TTCAGGTAAA GCGGTCTATG TAGAAGCGGA TGGTGATTTC ACTACCGACG CTGCAACCAA AGCCGCAACC ACCACCGATC CGCTGGCCGC TCTGGATGAC GCAATCAGCC AGATCGACAA GTTCCGTTCA TCCTTGGGTG CTATCCAGAA CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT ACCAACCTGT CCGAAGCGCA GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC AACATGTCGA AAGCGCAGAT TATTCAGCAG GCAGGTAACT CCGTGCTGGC AAAAGCTAAC CAGGTACCGC AGCAGGTTCT GTCTCTGCTG CAGGGTTAA
|
Protein sequence | MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG LTQAARNAND GISVAQTTEG ALSEINNNLQ RVRELTVQAT TGTNSQSDLD SIQDEIKSRL DEIDRVSGQT QFNGVNVLAK DGSMKIQVGA NDGQTITIDL KKIDSSTLKL TGFNVNGSGS VANTAATKAD LAAAAIGTPG AADSTGAIAY TVSAGLTKTT AADVLSSLAD GTTITATGVK NGFAAGATSN AYKLNKDNNT FTYDTTATTA ELQSYLTPKA GDTATFSVEI GGTTQDVVLS SDGKLTAKDG SKLYIDTTGN LTQNGGNNGV GTLAEATLSG LALNNNNGAA AVKSTITTAD NTSIVLNGSS DGTEGTIAVT GAVISSAALQ SASKTTGFTV GTADTAGYIS VGTDGSVQAY DVATSGNKDS YTNTDGTLTT DNTTKLYLQK DGSVTNGSGK AVYVEADGDF TTDAATKAAT TTDPLAALDD AISQIDKFRS SLGAIQNRLD SAVTNLNNTT TNLSEAQSRI QDADYATEVS NMSKAQIIQQ AGNSVLAKAN QVPQQVLSLL QG
|
| |