Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2024 |
Symbol | fliD |
ID | 5593204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2022876 |
End bp | 2024291 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640921168 |
Product | flagellar capping protein |
Protein accession | YP_001458713 |
Protein GI | 157161395 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAGTA TTTCAACGCT TGGGGTTGGG TCGGGGCTGG ACTTAAGCAG CATCCTTGAC AGTCTGGAGG CGGCAGAAAA ATCTACCCTG ACGCCGATAT CAAAACAGCA AAGCTCTTAC ACCGCGAAAC TGAGCGCCTA CGGGACGCTG AAAAGTGCAC TAGAGAGTTT CCAGACAGCC AACACAGCAC TGAATAAAGC CGATTTATTT ACCGCCACCT CAACCACCAG CAGTTCATCG GCTTTCAGCG CCACCACCAC GGGCAGCGCG ATTGCTGGGA AATACACCAT TAGCGTTAGC CAACTGGCGC AGGCACAGAC GCTGACAACA AAAAATAGTC AAAAAGACAG TAAAGCCGCC ATCGCCACCA GCGATAGCGT ACTGACCATC CAACAAGGCG GCGGTAAAGA TCCGGTCACG ATTGATATCA GCGCGGGTAA CTCGTCACTT TCCGGCATTC GCGATGCCAT TAACAACGCC AAAGCTGGTG TCAGCGCCAG TATCATCAAT GTGGGAAACG GCGAATATCG TCTGTCCATC ACCGCTAACG ATACGGGCAG CAATAACGCT ATGAAACTTA GCGTCAGCGG CGACAGCGCC CTGGAAAGTT TTATGGGTTA CAACGGAACG CCGGGTGATA GCAGCAATGG CATGATTGAA AGCGTCACCG CGCAAAACGC CAAACTGACA GTCAATAACG TTGAGATTGA AAACAGCAGT AACACTATCA GCGACTCACT GGAAGATATC ACCCTCAACC TGAATGACGT CACCACCGGT AATCAGACAC TGACCATCAG TAAAGACACA TCCAAAGCTG AAAATGCGGT TAAAGCGTGG GTGGATGCCT ATAACACATT GCAGGATACA TTCAGCAGCT TAACCAAATA CACCGCTGTC GATGCCGGAG CTGAAAGTCA GGACTCCAGT AATGGCGCAT TACTCGGTGA CTCCACACTG CGTACCATCC AGACTCAGCT TAAAACCTTG CTCGCCAATA CTCACAGCAG TTCAAACTAT AAAACGCTGG CACAGATTGG CATTACCAGC GATGCCAGTA CCGGCAAGCT GGAAATCGCG ACCGACAAAC TGCAAACGGC GCTAAAAAAT GATGCCGCTG GTATCGGTGA AATGTTTATA GGTGACGGCA AAAGTACTGG GGTAACCACC GGCATTAGTA ACAACCTGAC CAGTTGGCTC TCTTCCACCG GGATAATCCA GGCAGCAAAA GATGGCGTCA GCAAAACGCT GAATAATTTG ACCGACCAGT ACAACGCAGC CAGTGAACGT ATCGATACCT TAATGACGCG CTATAAAGCC CAGTTCACCC AATTAGATGT GTTAATGAAC TCGCTGAACT CCACTAGTAG CTATCTGACA CAGCAGTTCG ACACCTCAAA TAGCAACTCG AAATAA
|
Protein sequence | MASISTLGVG SGLDLSSILD SLEAAEKSTL TPISKQQSSY TAKLSAYGTL KSALESFQTA NTALNKADLF TATSTTSSSS AFSATTTGSA IAGKYTISVS QLAQAQTLTT KNSQKDSKAA IATSDSVLTI QQGGGKDPVT IDISAGNSSL SGIRDAINNA KAGVSASIIN VGNGEYRLSI TANDTGSNNA MKLSVSGDSA LESFMGYNGT PGDSSNGMIE SVTAQNAKLT VNNVEIENSS NTISDSLEDI TLNLNDVTTG NQTLTISKDT SKAENAVKAW VDAYNTLQDT FSSLTKYTAV DAGAESQDSS NGALLGDSTL RTIQTQLKTL LANTHSSSNY KTLAQIGITS DASTGKLEIA TDKLQTALKN DAAGIGEMFI GDGKSTGVTT GISNNLTSWL SSTGIIQAAK DGVSKTLNNL TDQYNAASER IDTLMTRYKA QFTQLDVLMN SLNSTSSYLT QQFDTSNSNS K
|
| |