Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0934 |
Symbol | |
ID | 6142802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 950111 |
End bp | 952591 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641615821 |
Product | fimbrial usher protein |
Protein accession | YP_001743013 |
Protein GI | 170684280 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAGAA TGACCCCACT TGCATCAGCA ATCGTAGCGT TATTGCTCGG CATTGAAGCT TATGCAGCTG AAGAAACCTT TGATACCCAT TTTATGATAG GTGGAATGAA AGACCAGCAG GTTGCAAATA TTCGTCTTGA TGATAATCAA CCCTTGCCGG GGCAGTATGA CATCGATATT TATGTCAATA AGCAATGGCG CGGGAAATAT GAGATTATTG TTAAAGACAA CCCGCAAGAA ACATGTTTAT CAAGGGAAAT TATCAAGCGG TTAGGCATTA ATACCGATAA CTTTGCCAGC GGTAAGCAAT GTTTAACATT TGAGCAACTT GTTCAGGGTG GGAGCTATAC CTGGGATATC GGAGTTTTTC GTCTCGATTT CAGTGTCCCG CAGGCGTGGG TGGAAGAACT GGAAAGTGGC TATGTTCCAC CGGAAAACTG GGAGCGGGGT ATTAATGCGT TTTATACCTC TTATTATGTG AGTCAGTATT ACAGCGACTA TAAAGCGTCG GGTAATAGCA AGAGTACATA TGTACGTTTT AACAGCGGGT TAAACTTACT GGGGTGGCAA CTGCATTCTG ATGCCAGTTT CAGTAAAACA AATAGCAATC CAGGGGTGTG GAAAAGCAAT ACCCTGTATC TGGAACGTGG ATTTGCCCAA CTTCTCGGCA CGCTTCGCGT GGGTGATATG TACACATCAA GCGATATTTT TGATTCTGTT CGCTTCAGCG GTGTGCGGTT GTTTCGTGAT ATGCAGATGT TGCCTAACTC GAAACAGAAT TTTACGCCAC GGGTGCAGGG GATTGCTCAG AGTAACGCGC TGGTAACTAT TGAACAGAAC GGTTTTGTGG TTTATCAGAA AGAGGTTCCT CCTGGCCCGT TTGCGATTAC AGATTTGCAG TTGGCCGGTG GTGGAGCAGA TCTTGATGTC AGCGTGAAAG AGGCGGACGG TTCGGTAACC ACCTATCTGG TGCCTTATGC AGCGGTGCCG AATATGCTGC AACCCGGCGT GTCGAAATAT GATTTTGCGG CAGGTCGTAG CCATATTGAA GGGGCGAGCA AACAAAGTGA TTTTGTTCAG GCGGGTTATC AGTATGGTTT TAATAACTTA TTGACGCTGT ATGGCGGATC GATGGTCGCG AATAATTATT ACGCGTTCAC TTTGGGGACT GGCTGGAATA CACGCATTGG TGCCATTTCC GTCGATGCCA CGAAGTCGCA TAGTAAACAA GACAACGGCG ATGTGTTTGA CGGGCAAAGT TATCAAATTG CCTACAACAA ATTTGTGAGC CAAACGTCGA CGCGTTTTGG TCTGGCGGCC TGGCGTTATT CGTCGCGTGA TTATCGGACA TTTAATGATC ACGTATGGGC AAACAATAAA GATAATTATC GCCGTGATGA AAACGATATC TATGACATTG CCGATTATTA CCAGAACGAT TTTGGCCGTA AAAATAGCTT CTCTGCCAAT ATGAGTCAGT CATTGCCAGA AGGCTGGGGT TCTGTGTCCT TAAGTACGTT ATGGCGAGAT TACTGGGGGC GTAGCGGCAG CAGTAAGGAT TATCAGTTGA GTTATTCCAA CAACCTGCGA AGGATAAGCT ATACCCTCGC GGCAAGCCAT GCTTATGACG AGAATCATCA TGAAGAGAAA CGTTTTAATA TTTTTATATC GATTCCCTTT GATTGGGGTG ATGACGTTAC GACGCCTCGT CGGCAAATAT ATATGTCTAA CTCAACGACG TTTGATGATC AGGGGGTTGC CTCAAATAAT ACGGGATTAT CAGGAACCGT TGGAAGCCGG GATCAGTTTA ACTATGGGGT CAACCTGAGT TATCAGTATC AGGGAAATGA AACGACAGCT GGGGCGAATT TAACCTGGAA CGCGCCGGTT GCGACAGTGA ATGGCAGTTA TAGTCAGTCG AGTGCTTATC GACAGGCTGG AGCCAGTGTT TCAGGGGGCA TTGTCGCCTG GTCGGGTGGC GTTAATCTGG CAAACCGTCT TTCTGAAACG TTTGCTGTGA TGAATGCGCC AGGAATTAAA GATGCTTATG TCAATGGGCA AAAATATCGC ACAACAAACC GTAATGGAGT GGTGGTATAC GACGGAATGA CACCTTATCG GGAAAATTAC CTGATGTTGG ATGTGTCACA AAGCGATAGC GAAGCAGAAT TACGTGGCAA CCGGAAAATT GCCGCCCCTT ATCGCGGCGC GGTTGTACTG GTTAATTTTG ATACCGATCA GCGCAAGCCA TGGTTTATAA AAGCGTTAAG AGCGGATGGG CAACCATTAA CGTTTGGTTA TGAAGTCAAT GATATCCATG GTCATAATAT TGGTGTTGTC GGCCAGGGAA GCCAGTTATT TATTCGCACC AATGAAGTAC CGCCATCGGT TAATGTAGCA ATTGATAAGC AACAAGGACT TTCATGCACA ATCACCTTCG GTAAAGAGAT TGATGAAAGT AGAAATTATA TTTGCCAGTA A
|
Protein sequence | MLRMTPLASA IVALLLGIEA YAAEETFDTH FMIGGMKDQQ VANIRLDDNQ PLPGQYDIDI YVNKQWRGKY EIIVKDNPQE TCLSREIIKR LGINTDNFAS GKQCLTFEQL VQGGSYTWDI GVFRLDFSVP QAWVEELESG YVPPENWERG INAFYTSYYV SQYYSDYKAS GNSKSTYVRF NSGLNLLGWQ LHSDASFSKT NSNPGVWKSN TLYLERGFAQ LLGTLRVGDM YTSSDIFDSV RFSGVRLFRD MQMLPNSKQN FTPRVQGIAQ SNALVTIEQN GFVVYQKEVP PGPFAITDLQ LAGGGADLDV SVKEADGSVT TYLVPYAAVP NMLQPGVSKY DFAAGRSHIE GASKQSDFVQ AGYQYGFNNL LTLYGGSMVA NNYYAFTLGT GWNTRIGAIS VDATKSHSKQ DNGDVFDGQS YQIAYNKFVS QTSTRFGLAA WRYSSRDYRT FNDHVWANNK DNYRRDENDI YDIADYYQND FGRKNSFSAN MSQSLPEGWG SVSLSTLWRD YWGRSGSSKD YQLSYSNNLR RISYTLAASH AYDENHHEEK RFNIFISIPF DWGDDVTTPR RQIYMSNSTT FDDQGVASNN TGLSGTVGSR DQFNYGVNLS YQYQGNETTA GANLTWNAPV ATVNGSYSQS SAYRQAGASV SGGIVAWSGG VNLANRLSET FAVMNAPGIK DAYVNGQKYR TTNRNGVVVY DGMTPYRENY LMLDVSQSDS EAELRGNRKI AAPYRGAVVL VNFDTDQRKP WFIKALRADG QPLTFGYEVN DIHGHNIGVV GQGSQLFIRT NEVPPSVNVA IDKQQGLSCT ITFGKEIDES RNYICQ
|
| |