Gene EcSMS35_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0934 
Symbol 
ID6142802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp950111 
End bp952591 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content45% 
IMG OID641615821 
Productfimbrial usher protein 
Protein accessionYP_001743013 
Protein GI170684280 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGAA TGACCCCACT TGCATCAGCA ATCGTAGCGT TATTGCTCGG CATTGAAGCT 
TATGCAGCTG AAGAAACCTT TGATACCCAT TTTATGATAG GTGGAATGAA AGACCAGCAG
GTTGCAAATA TTCGTCTTGA TGATAATCAA CCCTTGCCGG GGCAGTATGA CATCGATATT
TATGTCAATA AGCAATGGCG CGGGAAATAT GAGATTATTG TTAAAGACAA CCCGCAAGAA
ACATGTTTAT CAAGGGAAAT TATCAAGCGG TTAGGCATTA ATACCGATAA CTTTGCCAGC
GGTAAGCAAT GTTTAACATT TGAGCAACTT GTTCAGGGTG GGAGCTATAC CTGGGATATC
GGAGTTTTTC GTCTCGATTT CAGTGTCCCG CAGGCGTGGG TGGAAGAACT GGAAAGTGGC
TATGTTCCAC CGGAAAACTG GGAGCGGGGT ATTAATGCGT TTTATACCTC TTATTATGTG
AGTCAGTATT ACAGCGACTA TAAAGCGTCG GGTAATAGCA AGAGTACATA TGTACGTTTT
AACAGCGGGT TAAACTTACT GGGGTGGCAA CTGCATTCTG ATGCCAGTTT CAGTAAAACA
AATAGCAATC CAGGGGTGTG GAAAAGCAAT ACCCTGTATC TGGAACGTGG ATTTGCCCAA
CTTCTCGGCA CGCTTCGCGT GGGTGATATG TACACATCAA GCGATATTTT TGATTCTGTT
CGCTTCAGCG GTGTGCGGTT GTTTCGTGAT ATGCAGATGT TGCCTAACTC GAAACAGAAT
TTTACGCCAC GGGTGCAGGG GATTGCTCAG AGTAACGCGC TGGTAACTAT TGAACAGAAC
GGTTTTGTGG TTTATCAGAA AGAGGTTCCT CCTGGCCCGT TTGCGATTAC AGATTTGCAG
TTGGCCGGTG GTGGAGCAGA TCTTGATGTC AGCGTGAAAG AGGCGGACGG TTCGGTAACC
ACCTATCTGG TGCCTTATGC AGCGGTGCCG AATATGCTGC AACCCGGCGT GTCGAAATAT
GATTTTGCGG CAGGTCGTAG CCATATTGAA GGGGCGAGCA AACAAAGTGA TTTTGTTCAG
GCGGGTTATC AGTATGGTTT TAATAACTTA TTGACGCTGT ATGGCGGATC GATGGTCGCG
AATAATTATT ACGCGTTCAC TTTGGGGACT GGCTGGAATA CACGCATTGG TGCCATTTCC
GTCGATGCCA CGAAGTCGCA TAGTAAACAA GACAACGGCG ATGTGTTTGA CGGGCAAAGT
TATCAAATTG CCTACAACAA ATTTGTGAGC CAAACGTCGA CGCGTTTTGG TCTGGCGGCC
TGGCGTTATT CGTCGCGTGA TTATCGGACA TTTAATGATC ACGTATGGGC AAACAATAAA
GATAATTATC GCCGTGATGA AAACGATATC TATGACATTG CCGATTATTA CCAGAACGAT
TTTGGCCGTA AAAATAGCTT CTCTGCCAAT ATGAGTCAGT CATTGCCAGA AGGCTGGGGT
TCTGTGTCCT TAAGTACGTT ATGGCGAGAT TACTGGGGGC GTAGCGGCAG CAGTAAGGAT
TATCAGTTGA GTTATTCCAA CAACCTGCGA AGGATAAGCT ATACCCTCGC GGCAAGCCAT
GCTTATGACG AGAATCATCA TGAAGAGAAA CGTTTTAATA TTTTTATATC GATTCCCTTT
GATTGGGGTG ATGACGTTAC GACGCCTCGT CGGCAAATAT ATATGTCTAA CTCAACGACG
TTTGATGATC AGGGGGTTGC CTCAAATAAT ACGGGATTAT CAGGAACCGT TGGAAGCCGG
GATCAGTTTA ACTATGGGGT CAACCTGAGT TATCAGTATC AGGGAAATGA AACGACAGCT
GGGGCGAATT TAACCTGGAA CGCGCCGGTT GCGACAGTGA ATGGCAGTTA TAGTCAGTCG
AGTGCTTATC GACAGGCTGG AGCCAGTGTT TCAGGGGGCA TTGTCGCCTG GTCGGGTGGC
GTTAATCTGG CAAACCGTCT TTCTGAAACG TTTGCTGTGA TGAATGCGCC AGGAATTAAA
GATGCTTATG TCAATGGGCA AAAATATCGC ACAACAAACC GTAATGGAGT GGTGGTATAC
GACGGAATGA CACCTTATCG GGAAAATTAC CTGATGTTGG ATGTGTCACA AAGCGATAGC
GAAGCAGAAT TACGTGGCAA CCGGAAAATT GCCGCCCCTT ATCGCGGCGC GGTTGTACTG
GTTAATTTTG ATACCGATCA GCGCAAGCCA TGGTTTATAA AAGCGTTAAG AGCGGATGGG
CAACCATTAA CGTTTGGTTA TGAAGTCAAT GATATCCATG GTCATAATAT TGGTGTTGTC
GGCCAGGGAA GCCAGTTATT TATTCGCACC AATGAAGTAC CGCCATCGGT TAATGTAGCA
ATTGATAAGC AACAAGGACT TTCATGCACA ATCACCTTCG GTAAAGAGAT TGATGAAAGT
AGAAATTATA TTTGCCAGTA A
 
Protein sequence
MLRMTPLASA IVALLLGIEA YAAEETFDTH FMIGGMKDQQ VANIRLDDNQ PLPGQYDIDI 
YVNKQWRGKY EIIVKDNPQE TCLSREIIKR LGINTDNFAS GKQCLTFEQL VQGGSYTWDI
GVFRLDFSVP QAWVEELESG YVPPENWERG INAFYTSYYV SQYYSDYKAS GNSKSTYVRF
NSGLNLLGWQ LHSDASFSKT NSNPGVWKSN TLYLERGFAQ LLGTLRVGDM YTSSDIFDSV
RFSGVRLFRD MQMLPNSKQN FTPRVQGIAQ SNALVTIEQN GFVVYQKEVP PGPFAITDLQ
LAGGGADLDV SVKEADGSVT TYLVPYAAVP NMLQPGVSKY DFAAGRSHIE GASKQSDFVQ
AGYQYGFNNL LTLYGGSMVA NNYYAFTLGT GWNTRIGAIS VDATKSHSKQ DNGDVFDGQS
YQIAYNKFVS QTSTRFGLAA WRYSSRDYRT FNDHVWANNK DNYRRDENDI YDIADYYQND
FGRKNSFSAN MSQSLPEGWG SVSLSTLWRD YWGRSGSSKD YQLSYSNNLR RISYTLAASH
AYDENHHEEK RFNIFISIPF DWGDDVTTPR RQIYMSNSTT FDDQGVASNN TGLSGTVGSR
DQFNYGVNLS YQYQGNETTA GANLTWNAPV ATVNGSYSQS SAYRQAGASV SGGIVAWSGG
VNLANRLSET FAVMNAPGIK DAYVNGQKYR TTNRNGVVVY DGMTPYRENY LMLDVSQSDS
EAELRGNRKI AAPYRGAVVL VNFDTDQRKP WFIKALRADG QPLTFGYEVN DIHGHNIGVV
GQGSQLFIRT NEVPPSVNVA IDKQQGLSCT ITFGKEIDES RNYICQ