Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2178 |
Symbol | |
ID | 6147382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2184571 |
End bp | 2185641 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617054 |
Product | putative fimbrial protein |
Protein accession | YP_001744228 |
Protein GI | 170683719 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0306197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.828503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATAA TTTCAGGGGA TAGTAGTGTG TTAAGACTCC GGCTTCTTTT GACTGCTGTT TTAATGCTGT GGGGATTTCA TGCCGCCGCT TATACCGGGC AGTGTCATAC CACTCAGGGG AATCCGTATA TTGGCGTCAA TTTTGGCGTT AAAACCCTGG AGGAGGAAGA GAATACGGCT GGAAAAGTCA AAGATGAATT TTATCAGTGG AGTGAATCGA ATGATTATTA TGTTTCCTGC GATTGCGATA AAGACAATGT CAGAAGTGGT CTATGGGCAT TCGCTGCGGA TTCTCCGTTA GTTTATTTGG GCGACAACTG GTACAAAATT AACGATTATC TTGCCGCAAA GGTGCTATTA GGAGTGAAAG GTAACACACC TTCGGCCGTA CCTTTTGAAA ATCTCGGTAC AGGAAGTGAT ACCCGATGGC ATATTTGCGA TCCCGGTGGT CAACGTTTGG GCGGCAAGGG GGCAAGCGGC AATAGCGGAA GTTTTTCACT GAAAATATTG CAGCCGTTTG TCGGTTCGGT CGTCATTCCT CCTATGGCGC TGGCGCGACT ATATGAGTGT TACAACATAC CCGCAAGTGA TTCCTGCACG ACTACAGGTA CACCGGTTTT AGTGTATTAC CTGTCTGGTA CGATTAATTC ACTTGGCTCA TGTTCTGTCA ATTCTGGAGA GACAATAGAG GTTGATTTGG GGGATGTATT TGCTGCCAAT TTCCGTGTTG TAGGGCATAA ACCGCTTGGA TCCAGAACGG CAGAACTCTC AATTCCAGTC AGGTGTAATA CCGGAAACGC AGAGCTGGTT AATGTTAATT TGAGTCTGAC GGCAACCAGT GATCCCGATT ATCCTCAGGC AATCAAGACA TCACGACCAG GTGTCGGTGT TGTGGTCACC GACAGCCAAA ACAACATTAT TTCGCCTGCC GGTGGCACAC TGCCGCTATC CATCCCTGAC GACGCGGACA GCTTCGCTCG AATGAATGTT TATCCGGTCA GCACGACAGG TGTACCACCG GAAACCGGGA GCTTTGAAGC CACAGCAACG GTGAGAATAA ACTTTGACTG A
|
Protein sequence | MRIISGDSSV LRLRLLLTAV LMLWGFHAAA YTGQCHTTQG NPYIGVNFGV KTLEEEENTA GKVKDEFYQW SESNDYYVSC DCDKDNVRSG LWAFAADSPL VYLGDNWYKI NDYLAAKVLL GVKGNTPSAV PFENLGTGSD TRWHICDPGG QRLGGKGASG NSGSFSLKIL QPFVGSVVIP PMALARLYEC YNIPASDSCT TTGTPVLVYY LSGTINSLGS CSVNSGETIE VDLGDVFAAN FRVVGHKPLG SRTAELSIPV RCNTGNAELV NVNLSLTATS DPDYPQAIKT SRPGVGVVVT DSQNNIISPA GGTLPLSIPD DADSFARMNV YPVSTTGVPP ETGSFEATAT VRINFD
|
| |