Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3357 |
Symbol | |
ID | 6067442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3678027 |
End bp | 3679343 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602771 |
Product | flagellar hook-associated 2 domain-containing protein |
Protein accession | YP_001726303 |
Protein GI | 170021349 |
COG category | [N] Cell motility |
COG ID | [COG1345] Flagellar capping protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.507353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACC CAAGAACCAT TGCGCAAGAA ATCGCTTATG CGGATGTTGC CACTCAGGCA GCCAATTTGC AGGAGAAGCA GAGCGAGCTG GATGCTGAAA GCAGCGGCCT GGACTCGCTC AGCTCAGCGT TGAGCGATTT CCAGAGCGCC GTTGACGCGC TGAACAGCGA TACCGACGGC CCGGTGACTT TTGCCGCCAC CAGCAATAAT GACTCGGCGA CCGTTTCCGC CAATTCCCAG GCGCAGGCGG GAAGCTACTC ATTTTTCGTT GAGCAACTGG CGCAGGGGCA GCAGACCACA TTCAGTATGG GCGACGACGC CTTTTCCGCC ACCGGCACCT TCGAACTGAC GATGGGCGAC AGCACCATGG ATATCGATCT GTCGGCGGCA GATCAGAACG GTGACGGCGA TGGTTTTATC GACGCCAGCG AGCTGGTGAA TGCGATTAAC GACTCCGATG ACAATCCAGG CGTGTCGGCG GCACTGGTAA AAACCGACGG CACCACCACG ATTATGCTCA CCTCCGATAG CACCGGGGCG CAGAGCGCGT TTTCGGTGAG CGTAACGGGG CATGATGCCA GCAACGATAG CACCAGCGCG CCCGTTGCCA CGGATGTTTC CTCCGCTCAG GACGCGATTA TTCATCTCGG TAGCGCCACA GGGCCTGCGA TCACCAACAG CAGCAATACA TTTGATGATG TGATCCCCGG CGTCACCATG ACTTTTACCG AAGTCAGCGA TTCTGACAGC GATCTCACCA CCTTTAACAT CAGCGAAGAT TCCAGCGCCA GCCAGGAGAA AGTACAGACC TTTGTCGATG CCTATAACAC CTTGATTGAT ACGGTTGATT CGCTGACCAC TCACGGTGAT GACAGTACCA GCGCCGGGGT ATTTGCAGGC GACGCCGGGC TCAGTTCACT GGCGAACCAG CTCGATGACA TCGCCCATGC CAGCTACAAC GGCGTGTCGA TTGTTGACTA TGGCATCACC CTCGATTCTC ACGGCCACTT ACAGATTGAC TCCGACCAGT TCAACGACGA AATGGCGAAA AATCCTGACG GTCTGACCTC TATTTTCGTT GGCGACAACA GCATGGTGGC GCAGATGGAC GACCTGATTA ACACCTACAC GGACTCCAGC AACGGCATCA TCACCCTGCG CCAGCAGAAC ATTGACGATC AGATGAGCAA AATTCAGGAC GAAGGCGATC AGCTTACCGA TACCTATAAC GCCAACTACG ACCGTTATCT GGAGGAGTAC ACCAACACGC TGGTTGAGGT GTACACCATG AAAGCCAGCA TGGCGGCATT CGCGTAA
|
Protein sequence | MINPRTIAQE IAYADVATQA ANLQEKQSEL DAESSGLDSL SSALSDFQSA VDALNSDTDG PVTFAATSNN DSATVSANSQ AQAGSYSFFV EQLAQGQQTT FSMGDDAFSA TGTFELTMGD STMDIDLSAA DQNGDGDGFI DASELVNAIN DSDDNPGVSA ALVKTDGTTT IMLTSDSTGA QSAFSVSVTG HDASNDSTSA PVATDVSSAQ DAIIHLGSAT GPAITNSSNT FDDVIPGVTM TFTEVSDSDS DLTTFNISED SSASQEKVQT FVDAYNTLID TVDSLTTHGD DSTSAGVFAG DAGLSSLANQ LDDIAHASYN GVSIVDYGIT LDSHGHLQID SDQFNDEMAK NPDGLTSIFV GDNSMVAQMD DLINTYTDSS NGIITLRQQN IDDQMSKIQD EGDQLTDTYN ANYDRYLEEY TNTLVEVYTM KASMAAFA
|
| |