Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0109 |
Symbol | pilC |
ID | 6142966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 121038 |
End bp | 122240 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615010 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_001742226 |
Protein GI | 170681709 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0111048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.534441 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGTA AGCAACTCTG GCGCTGGCAT GGCATAACCG GCGACGGCAA TGCGCAAGAT GGGATGCTAT GGGCTGAGAG CCGTGCTTTG CTGCTCATAG CACTACAGCA ACAGATGGTT ACCCCACTTA GCCTGAAGCG AATCGCCATC AATTCTGCGC AGTGGCGAGG AGATAAAAGC GCGGAAGTCA TTCATCAACT GGCGACGCTA CTCAAAGCCG GGTTAACGCT TTCTGAAGGG CTGGCACTGC TCGCGGAACA GCATCCCAGT AAGCAATGGC AAGCGTTGCT GCAATCGCTG GCGTACGATC TCGAACAGGG CATTGCTTTT TCCAATGCCT TATTACCCTG GTCAGAGGTA TTTCCGCCGC TCTATCAGGC GATGATCCGC ACGGGTGAAC TGACCGGTAA GCTGGATGAA TGCTGCTTTG AACTGGCGCG TCAGCAAAAA GCCCAGCGTC AGTTGACCGA CAAAGTGAAA TCAGCGTTAC GTTATCCCAT TATCATTTTA GCGATGGCAA TCATGGTGGT TGTGGCAATG CTGCATTTTG TTCTACCGGA GTTTGCCGCT ATCTATAAGA TCTTTAACAC TCCGCTACCG GCGCTAACGC AGGGGATCAT GACGCTGGCA AGTTTTAGTG GCGAATGGGG TTGGTTGCTG GTGCTATTCG GCTTTCTGCT GGCGATAGCC AATAAGCTGC TGATGCGCCA CCCAACCAGG CTTATCGCAC GGCAGAAATT GCTGTTACGC ATCCCGATTA TGGGTTCACT GATGCGGGGA CAAAAACTCA CGCAGATCTT TACGATTCTG ACGCTGACAC AAAGTGCAGG CATTTCTTTT TTACAGGGCG TTGAGAGCGT CAGAGAAACA ATGCGCTGCC CGTACTGGGT GCAACTTCTG ACACAAATCC AGCACGATAT CAGTAACGGT CACCCCATCT GGCTGGCGCT AAAAAATGCC GGTGAGTTTA GCCCGCTCTG TTTGCAATTA GTGAGAACAG GAGAGGCATC CGGCTCGCTG GATCTCATGT TAGACAACCT CGCCCATCAT CATCGGGATA ACACAATGGC GCTGGCGGAT AACCTCGCAG CCTTACTGGA ACCGGCGTTG CTGATCATAA CGGGAGGAAT TATCGGTACG CTGGTGGTGG CAATGTATCT GCCAATTTTC CATTTAGGCG ATGCGATGAG TGGAATGGGA TAA
|
Protein sequence | MASKQLWRWH GITGDGNAQD GMLWAESRAL LLIALQQQMV TPLSLKRIAI NSAQWRGDKS AEVIHQLATL LKAGLTLSEG LALLAEQHPS KQWQALLQSL AYDLEQGIAF SNALLPWSEV FPPLYQAMIR TGELTGKLDE CCFELARQQK AQRQLTDKVK SALRYPIIIL AMAIMVVVAM LHFVLPEFAA IYKIFNTPLP ALTQGIMTLA SFSGEWGWLL VLFGFLLAIA NKLLMRHPTR LIARQKLLLR IPIMGSLMRG QKLTQIFTIL TLTQSAGISF LQGVESVRET MRCPYWVQLL TQIQHDISNG HPIWLALKNA GEFSPLCLQL VRTGEASGSL DLMLDNLAHH HRDNTMALAD NLAALLEPAL LIITGGIIGT LVVAMYLPIF HLGDAMSGMG
|
| |