Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2179 |
Symbol | |
ID | 6143011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2185632 |
End bp | 2188232 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617055 |
Product | outer membrane usher protein fimD-like protein |
Protein accession | YP_001744229 |
Protein GI | 170682116 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.932643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.834799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAGAA CTCAACGACA ACACAGCCTG TTAAGCTCTG GTGGAGTGCC ATCGTTTATT GGTGGGCTGG TGGTGTTTGT GTCGGCAGCG TTCAATGCAC AAGCTGAAAC CTGGTTCGAT CCAGCCTTTT TCAAAGATGA TCCCTCAATG GTGGCTGATT TGTCTCGTTT CGAAAAAGGG CAAAAAATAA CGCCAGGGGT TTATCGAGTC GATATTGTTC TGAATCAGAC AATTGTAGAT ACGCGCAACG TCAATTTTGT TGAGATAACG CCAGAGAAGG GGATTGCCGC CTGTTTGACG ACTGAAAGCC TGGATGCAAT GGGCGTGAAT ACTGATGCGT TTCCGGCTTT TAAACAACTG GATAAACAAG CGTGTGCGCC ATTGGCGGAA ATTATTCCGG ATGCCAGCGT AACTTTTAAT GTGAATAAAC TCCGTCTGGA AATTTCAGTA CCGCAAATCG CCATCAAAAG TAACGCTCGT GGTTATGTCC CCCCTGAACG TTGGGATGAA GGGATCAACG CGCTATTACT GGGATATTCA TTTAGCGGGG CTAACAGTAT TCATAGCAGC GCAGGTAGTG ATTCTGGCGA CAGCTATTTT CTGAATTTAA ACAGTGGCGT TAATTTAGGC CCATGGAGAT TGCGCAACAA TTCAACATGG AGTCGCAGTA GTGGCCAAAC CGCAGAATGG AAGAATCTCA GCAGCTATTT GCAGCGGGCG GTGATTCCAT TAAAAGGCGA ACTGACCGTC GGTGATGATT ATACGGCAGG CGATTTTTTT GATAGCGTCA GCTTTCGTGG CGTGCAGCTG GCGTCAGATG ACAACATGCT GCCAGACAGC TTGAAAGGGT TTGCGCCAGT GGTGCGTGGT ATCGCCAAAA GCAATGCACA GGTAACGATT AAGCAAAATG GTTACACCAT CTATCAAACT TATGTTTCGC CTGGCGCTTT TGAGATTAGT GATCTCTACT CTACGTCGTC GAGTGGTGAT TTGTTGGTTG AAATCAAAGA AGCTGACGGT AGTGTCAATA GTTACAGTGT GCCCTTTTCC AGCGTGCCAT TACTCCAGCG TCAGGGACGC ATCAAATATG CGGTGACGCT GGCGAAATAC AGAACCAATA GTAATGACCA GCAAGAAAGT AAATTTGCTC AGGCCACGCT GCAATGGGGT GGGCCGAGGG GAACGACCTG GTATGGCGGA GGGCAATATG CTGAATATTA CCGTGCCGCT ATGTTTGGTC TGGGTTTTAA CCTCGGCGAT TTCGGAGCAA TTTCGTTCGA CGCGACCCAG GCTAAAAGTA CGCTGGCAGA CCAAAGTGAA CATAAAGGGC AGTCATATCG TTTTCTGTAT GCCAAAACGC TCAACCAATT GGGCACCAAC TTTCAATTGA TGGGCTACCG CTATTCGACG TCGGGTTTCT ACACCCTTTC CGACACTATG TATAAACACA TGGATGGCTA CGAATTTAAT GACGGTGATG ATGAAGATAC GCCAATGTGG TCGCGTTATT ACAATTTGTT TTACACAAAA CGTGGCAAAC TGCAGGTCAA CATCTCCCAG CAATTAGGCG AGTACGGTTC GTTTTATTTA AGCGGTAGCC AGCAAACTTA CTGGCATACC GATCAACAGG ATCGGCTATT ACAGTTTGGC TACAACACGC AAATTAAAGA TCTCTCGCTG GGGGTTTCCT GGAACTACAG TAAGTCCCGT GGTCAACCTG ACGCTGACCA GGTGTTTGCA CTTAATTTTT CCCTGCCGCT CAATCTGTTG CTCCCCAAAA GTAATGATAG CTATACCAGG AAAAAAAATT ACGCCTGGAT GACCTCTAAT ACCAGTATCG ATAACGAAGG GCACACTACA CAAAACCTGG GTTTAACGGA GACACTACTC GATGACGGTA ATCTGAGCTA CAGCGTGCAA CAGGGATATA ACAGCGAGGG GAAAACGGCT AATGGTAGCG CCAGCATGGA CTACAAAGGG GCGTTTGCAG ATGCTCGAGT GGGCTACAAC TACAGCGATA ACGGCAGTCA ACAACAACTG AACTACGCTC TTTCAGGCAG TTTAGTTGCC CATTCGCAGG GTATTACGTT GGGTCAATCA TTGGGTGAAA CTAATGTCCT GATTGCCGCG CCAGGCGCCG AAAATACTCG TGTGGCGAAC AGCACCGGGC TGAAAACTGA CTGGCGTGGA TATACCGTTG TGCCTTATGC CACTTCTTAT CGGGAAAATC GAATTGCACT TGATGCGGCG TCGTTAAAAC GTAACGTGGA TCTTGAAAAT GCCGTAGTAA ACGTGGTTCC CACCAAAGGG GCATTGGTTC TGGCGGAGTT CAATGCCCAT GCGGGGGCTA GGGTATTAAT GAAAACATCA AAGCAGGGTA TGCCGCTGAG ATTTGGTGCA ATGGCAACAC TGGATGGCGC ACAAACAATT AGCGGTATCA TTGATGATGA TGGTTCGCTC TATATGTCTG GTTTGCCGGC GAAGGGAACG ATAACTGTAC GCTGGGGCGA CGCTCCCGAT CAAATTTGTC ATATCAGTTA CGAGCTTACC GAACAACAAA TTAACGCTGC GATTACGCGG ATGGATTCAG TATGCGAATA A
|
Protein sequence | MYRTQRQHSL LSSGGVPSFI GGLVVFVSAA FNAQAETWFD PAFFKDDPSM VADLSRFEKG QKITPGVYRV DIVLNQTIVD TRNVNFVEIT PEKGIAACLT TESLDAMGVN TDAFPAFKQL DKQACAPLAE IIPDASVTFN VNKLRLEISV PQIAIKSNAR GYVPPERWDE GINALLLGYS FSGANSIHSS AGSDSGDSYF LNLNSGVNLG PWRLRNNSTW SRSSGQTAEW KNLSSYLQRA VIPLKGELTV GDDYTAGDFF DSVSFRGVQL ASDDNMLPDS LKGFAPVVRG IAKSNAQVTI KQNGYTIYQT YVSPGAFEIS DLYSTSSSGD LLVEIKEADG SVNSYSVPFS SVPLLQRQGR IKYAVTLAKY RTNSNDQQES KFAQATLQWG GPRGTTWYGG GQYAEYYRAA MFGLGFNLGD FGAISFDATQ AKSTLADQSE HKGQSYRFLY AKTLNQLGTN FQLMGYRYST SGFYTLSDTM YKHMDGYEFN DGDDEDTPMW SRYYNLFYTK RGKLQVNISQ QLGEYGSFYL SGSQQTYWHT DQQDRLLQFG YNTQIKDLSL GVSWNYSKSR GQPDADQVFA LNFSLPLNLL LPKSNDSYTR KKNYAWMTSN TSIDNEGHTT QNLGLTETLL DDGNLSYSVQ QGYNSEGKTA NGSASMDYKG AFADARVGYN YSDNGSQQQL NYALSGSLVA HSQGITLGQS LGETNVLIAA PGAENTRVAN STGLKTDWRG YTVVPYATSY RENRIALDAA SLKRNVDLEN AVVNVVPTKG ALVLAEFNAH AGARVLMKTS KQGMPLRFGA MATLDGAQTI SGIIDDDGSL YMSGLPAKGT ITVRWGDAPD QICHISYELT EQQINAAITR MDSVCE
|
| |