Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0098 |
Symbol | pilC |
ID | 6270048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 105382 |
End bp | 106584 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641724356 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_001878915 |
Protein GI | 187732598 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00354832 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGTA AGCAACTCTG GCGCTGGCAT GGCATAACCG GCGACGGCAA TGCGCAAGAT GGGATGCTAT GGGCAGAGAG CCGTGCTTTG CTGCTCATGG CACTACAGCA ACAGATGGTT ACCCCACTTA GCCTGAAGCG AATCGCCATC AATTCTGCGC AGTGGCGAGG AGATAAAAGC GCGGAAGTCA TTCATCAACT GGCGACGCTA CTCAAAGCCG GGTTAACGCT TTCTGAAGGG CTGGCACTGC TGGCGGAACA GCATCCCAGT AAGCAATGGC AAGCGTTGCT GCAATCGCTG GCGCACGATC TCGAACAGGG CATTGCTTTT TCCAATGCCT TATTACCCTG GTCAGAGGTA TTTCCGCCAC TCTATCAGGC GATGATCCGC ACGGGTGAAC TGACCGGTAA GCTGGATGAA TGCTGCTTTG AACTGGCGCG TCAGCAAAAA GCCCAGCGTC AGTTGACCGA CAAAGTGAAA TCAGCGTTAC GTTATCCCAT CATCATTTTA GCGATGGCAA TCATGGTGGT TGTGGCAATG CTGCATTTTG TTCTGCCGGA GTTTGCCGCT ATCTATAAGA CCTTCAACAC CCCACTACCG GCACTAACGC AGGGGATCAT GACGCTGGCA GACTTTAGTG GCGAATGGAG CTGGCTGCTG GTGTTGTTCG GCTTTCTGCT GGCGATAGCC AATAAGTTGC TGATGCGCCG ACCGACTTGG CTTATAGTGC GGCAGAAATT GCTGTTACGC ATCCCGATTA TGGGTTCACT GATGCGGGGA CAAAAACTCA CGCAGATCTT TACGATTCTG GCGCTGACAC AAAGTGCAGG CATTACTTTT TTGCAGGGCG TAGAGAGCGT CAGAGAAACA ATGCGCTGCC CGTACTGGGT GCAACTTCTG ACACAAATCC AGCACGATAT CAGTAACGGT CATCCCATCT GGCTGGCGCT AAAAAATACC GGGGAGTTTA GCCCGCTCTG TTTGCAATTA GTGAGAACAG GAGAGGCATC CGGCTCGCTG GACCTCATGT TAGACAACCT CGCCCATCAT CATCGGGATA ACACAATGGC GCTGGCGGAT AACCTCGCAG CCTTACTGGA ACCGGCGTTG CTGATCATAA CGGGAGGAAT TATCGGTACG CTGGTGGTGG CGATGTATCT GCCAATTTTC CATTTAGGCG ATGCGATGAG TGGGATGGGA TAA
|
Protein sequence | MASKQLWRWH GITGDGNAQD GMLWAESRAL LLMALQQQMV TPLSLKRIAI NSAQWRGDKS AEVIHQLATL LKAGLTLSEG LALLAEQHPS KQWQALLQSL AHDLEQGIAF SNALLPWSEV FPPLYQAMIR TGELTGKLDE CCFELARQQK AQRQLTDKVK SALRYPIIIL AMAIMVVVAM LHFVLPEFAA IYKTFNTPLP ALTQGIMTLA DFSGEWSWLL VLFGFLLAIA NKLLMRRPTW LIVRQKLLLR IPIMGSLMRG QKLTQIFTIL ALTQSAGITF LQGVESVRET MRCPYWVQLL TQIQHDISNG HPIWLALKNT GEFSPLCLQL VRTGEASGSL DLMLDNLAHH HRDNTMALAD NLAALLEPAL LIITGGIIGT LVVAMYLPIF HLGDAMSGMG
|
| |