Gene EcSMS35_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0109 
SymbolpilC 
ID6142966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp121038 
End bp122240 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID641615010 
Producttype IV pilin biogenesis protein 
Protein accessionYP_001742226 
Protein GI170681709 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0111048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.534441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTA AGCAACTCTG GCGCTGGCAT GGCATAACCG GCGACGGCAA TGCGCAAGAT 
GGGATGCTAT GGGCTGAGAG CCGTGCTTTG CTGCTCATAG CACTACAGCA ACAGATGGTT
ACCCCACTTA GCCTGAAGCG AATCGCCATC AATTCTGCGC AGTGGCGAGG AGATAAAAGC
GCGGAAGTCA TTCATCAACT GGCGACGCTA CTCAAAGCCG GGTTAACGCT TTCTGAAGGG
CTGGCACTGC TCGCGGAACA GCATCCCAGT AAGCAATGGC AAGCGTTGCT GCAATCGCTG
GCGTACGATC TCGAACAGGG CATTGCTTTT TCCAATGCCT TATTACCCTG GTCAGAGGTA
TTTCCGCCGC TCTATCAGGC GATGATCCGC ACGGGTGAAC TGACCGGTAA GCTGGATGAA
TGCTGCTTTG AACTGGCGCG TCAGCAAAAA GCCCAGCGTC AGTTGACCGA CAAAGTGAAA
TCAGCGTTAC GTTATCCCAT TATCATTTTA GCGATGGCAA TCATGGTGGT TGTGGCAATG
CTGCATTTTG TTCTACCGGA GTTTGCCGCT ATCTATAAGA TCTTTAACAC TCCGCTACCG
GCGCTAACGC AGGGGATCAT GACGCTGGCA AGTTTTAGTG GCGAATGGGG TTGGTTGCTG
GTGCTATTCG GCTTTCTGCT GGCGATAGCC AATAAGCTGC TGATGCGCCA CCCAACCAGG
CTTATCGCAC GGCAGAAATT GCTGTTACGC ATCCCGATTA TGGGTTCACT GATGCGGGGA
CAAAAACTCA CGCAGATCTT TACGATTCTG ACGCTGACAC AAAGTGCAGG CATTTCTTTT
TTACAGGGCG TTGAGAGCGT CAGAGAAACA ATGCGCTGCC CGTACTGGGT GCAACTTCTG
ACACAAATCC AGCACGATAT CAGTAACGGT CACCCCATCT GGCTGGCGCT AAAAAATGCC
GGTGAGTTTA GCCCGCTCTG TTTGCAATTA GTGAGAACAG GAGAGGCATC CGGCTCGCTG
GATCTCATGT TAGACAACCT CGCCCATCAT CATCGGGATA ACACAATGGC GCTGGCGGAT
AACCTCGCAG CCTTACTGGA ACCGGCGTTG CTGATCATAA CGGGAGGAAT TATCGGTACG
CTGGTGGTGG CAATGTATCT GCCAATTTTC CATTTAGGCG ATGCGATGAG TGGAATGGGA
TAA
 
Protein sequence
MASKQLWRWH GITGDGNAQD GMLWAESRAL LLIALQQQMV TPLSLKRIAI NSAQWRGDKS 
AEVIHQLATL LKAGLTLSEG LALLAEQHPS KQWQALLQSL AYDLEQGIAF SNALLPWSEV
FPPLYQAMIR TGELTGKLDE CCFELARQQK AQRQLTDKVK SALRYPIIIL AMAIMVVVAM
LHFVLPEFAA IYKIFNTPLP ALTQGIMTLA SFSGEWGWLL VLFGFLLAIA NKLLMRHPTR
LIARQKLLLR IPIMGSLMRG QKLTQIFTIL TLTQSAGISF LQGVESVRET MRCPYWVQLL
TQIQHDISNG HPIWLALKNA GEFSPLCLQL VRTGEASGSL DLMLDNLAHH HRDNTMALAD
NLAALLEPAL LIITGGIIGT LVVAMYLPIF HLGDAMSGMG