Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0112 |
Symbol | pilC |
ID | 6968931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 118992 |
End bp | 120194 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384189 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_002268712 |
Protein GI | 209397106 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0263524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGTA AGCAACTCTG GCGCTGGCAT GGCATAACCG GCGACGGCAA TGCGCAAGAT GGGATGCTAT GGGCAGAGAG CCGTACTTTA CTGCTTATGG CACTACAGCA ACAGATGGTT ACCCCACTAA GCCTGAAGCG AATTGCTATC AATTCTGCGC AGTGGCGAGG AGATAAAAGC GCGGAAGTCA TTCATCAACT GGCGACGCTA CTTAAAGCCG GGTTAACGCT TTCTGAAGGG CTGGCACTGC TGGCGGAACA GCATCCCAGT AAGCAATGGC AAGCGTTGCT GCAATCTCTG GCACACGATC TCGAACAGGG CATTGCCTTT TCCAATGCCT TATTACCCTG GTCAGAGGTA TTTCCGCCGC TCTATCAGGC GATGATCCGC ACGGGTGAAC TGACCGGTAA GCTGGATGAA TGCTGCTTTG AACTGGCGCG TCAGCAAAGA GCCCAGCGTC AGTTGACCGA CAAAGTAAAA TCAGCGTTAC GTTATCCCAT CATCATTTTA GCGATGGCAA TCATGGTGGT TGTGGCAATG CTGCATTTTG TTCTGCCGGA GTTTGCCGCT ATCTATAAGA CCTTCAACAC CCCACTACCG GCACTAACGC AGGGGATCAT GACGCTGGCA GACTTTAGTG GCGAATGGGG CTGGCTGCTG GTGTTGTTCG GCATTATGCT GACGATAGCC AATAAGTTGC TGATGCGCCG CCCGACCTGG CTTATCGCGC GGCAGAAATT GCTGTTACGC ATCCCGATTA TGGGGTCACT GATGCGGGGA CAAAAACTCA CGCAGATCTT TACGATTCTG GCGCTGACTC AAAGTGCAGG CATTACTTTT TTACAGGGCG TAGAGAGCGT CAGAGAAACA ATGCGCTGCC CGTACTGGGT GCAACTTCTG ACACAAATCC AGCACGATAT CAGTAACGGT CATCCCATCT GGCTGGCGTT AAAAAATGCC GGTGAGTTTA GCCCGCTCTG TTTGCAATTA GTGAGAACGG GAGAGGCCTC CGGCTCGCTG GATCTCATGT TAGACAACAT CGCCCATCAT CATCGGGATA ACACAATGGC GCTGGCGGAT AACCTAGCAG CCTTACTGGA ACCGGCGTTG CTGATCATAA CGGGAGGAAT TATCGGTACG CTGGTGGTGG CGATGTATCT GCCAATTTTC CATTTAGGCG ATGCGATGAG TGGGATGGGA TAA
|
Protein sequence | MASKQLWRWH GITGDGNAQD GMLWAESRTL LLMALQQQMV TPLSLKRIAI NSAQWRGDKS AEVIHQLATL LKAGLTLSEG LALLAEQHPS KQWQALLQSL AHDLEQGIAF SNALLPWSEV FPPLYQAMIR TGELTGKLDE CCFELARQQR AQRQLTDKVK SALRYPIIIL AMAIMVVVAM LHFVLPEFAA IYKTFNTPLP ALTQGIMTLA DFSGEWGWLL VLFGIMLTIA NKLLMRRPTW LIARQKLLLR IPIMGSLMRG QKLTQIFTIL ALTQSAGITF LQGVESVRET MRCPYWVQLL TQIQHDISNG HPIWLALKNA GEFSPLCLQL VRTGEASGSL DLMLDNIAHH HRDNTMALAD NLAALLEPAL LIITGGIIGT LVVAMYLPIF HLGDAMSGMG
|
| |