Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3553 |
Symbol | |
ID | 6066593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3883857 |
End bp | 3885059 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602970 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_001726494 |
Protein GI | 170021540 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000520074 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAGTA AGCAACTCTG GCGCTGGCAT GGCATAACCG GCGACGGCAA TGCGCAAGAT GGGATGCTAT GGGCAGAGAG CCGTACTTTA CTGCTTATGG CACTACAGCA ACAGATGGTT ACCCCACTAA GCCTGAAGCG AATCGCCATC AATTCTGCGC AGTGGCGAGG AGATAAAAGC GCGGAAGTCA TTCATCAACT GGCGACGCTA CTCAAAGCAG GGTTAACGCT TTCTGAAGGG CTGGCTCTGC TGGCGGAACA GCATCCCAGT AAGCAATGGC AAGCGTTGCT GCAATCGCTG GCGCACGATC TCGAACAGGG CATTGCTTTT TCCAATGCCT TATTACCCTG GTCAGAGGTA TTTCCGCCGC TCTATCAGGC GATGATCCGC ACGGGTGAAC TGACCGGTAA GCTGGATGAA TGCTGCTTTG AACTGGCGCG TCAGCAAAAA GCCCAGCGTC AGTTGACCGA CAAAGTGAAA TCAGCGTTAC GTTATCCCAT CATCATTTTA GCGATGGCAA TCATGGTGGT TGTGGCAATG CTGCATTTTG TTCTGCCGGA GTTTGCCGCT ATCTATAAGA CCTTCAACAC CCCACTACCG GCACTAACGC AGGGGATCAT GACGCTGGCA GACTTTAGTG GCGAATGGAG CTGGCTGCTG GTGTTGTTCG GCTTTCTGCT GGCGATAGCC AATAAGTTGC TGATGCGCCG ACCGACCTGG CTTATAGTGC GGCAGAAATT GCTGTTACGC ATCCCGATTA TGGGTTCACT GATGCGGGGA CAAAAACTCA CGCAGATCTT TACGATTCTG GCGCTGACAC AAAGTGCAGG CATTACTTTT TTACAGGGCG TAGAGAGCGT CAGAGAAACA ATGCGCTGCC CGTACTGGGT GCAACTTCTG ACACAAATCC AGCACGATAT CAGTAACGGT CAACCCATCT GGCTGGCGCT AAAAAATACC GGTGAGTTTA GCCCGCTCTG TTTGCAATTA GTGAGAACAG GAGAGGCATC CGGCTCTCTG GATCTCATGT TAGACAACCT CGCCCATCAT CATCGGGAAA ACACAATGGC GCTGGCGGAT AACCTCGCAG CCTTACTGGA ACCGGCGTTG CTGATCATAA CGGGAGGAAT TATTGGTACG CTGGTGGTGG CAATGTATCT GCCAATTTTC CATTTAGGCG ATGCGATGAG TGGGATGGGA TAA
|
Protein sequence | MASKQLWRWH GITGDGNAQD GMLWAESRTL LLMALQQQMV TPLSLKRIAI NSAQWRGDKS AEVIHQLATL LKAGLTLSEG LALLAEQHPS KQWQALLQSL AHDLEQGIAF SNALLPWSEV FPPLYQAMIR TGELTGKLDE CCFELARQQK AQRQLTDKVK SALRYPIIIL AMAIMVVVAM LHFVLPEFAA IYKTFNTPLP ALTQGIMTLA DFSGEWSWLL VLFGFLLAIA NKLLMRRPTW LIVRQKLLLR IPIMGSLMRG QKLTQIFTIL ALTQSAGITF LQGVESVRET MRCPYWVQLL TQIQHDISNG QPIWLALKNT GEFSPLCLQL VRTGEASGSL DLMLDNLAHH HRENTMALAD NLAALLEPAL LIITGGIIGT LVVAMYLPIF HLGDAMSGMG
|
| |