Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4094 |
Symbol | lpfC |
ID | 6142994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4187436 |
End bp | 4189958 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618918 |
Product | long polar fimbrial operon protein LpfC |
Protein accession | YP_001746056 |
Protein GI | 170684256 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.118889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGA CCAGAATAAA GGTTGGCCTC ACGGCAGGGA CGTGTCTGAT TTTCTCGCAA AGCCTGATGG CCGAGGTCAG TGTATTCAAT CCGGCGCTTC TGGAAATCGA CCATCAATCC GGAGTCGATA TTCGCCAGTT TAATCGGGCA AACCTGATGC CCCCAGGTGT TTATAGCGTT GATATTTTTA TCAACGGTAA AATGTTTGAA CGTCAGGATG TGACCTTTGT TCAGGATAAT CCCGATGCTG ATTTACACGC CTGCTTTGTC GCTATTAAAA AAACGCTAAC CACCTTTGGC GTAAAAGTTG ATGCGCTGAA ATCGCTCAAT GATGTAGATG AAACAGTCTG TATTGATCCT GGGCCACGCA TTGAAGGCTC ATCATGGCAG TTCGACAGCG ATAAACTGCA ACTGAATATC TCCATTCCCC AAATCTATAT GGATGCGATG GCTTATGATT ACATCAGCCC CTCGCGTTGG GATGAGGGGA TTAACGCGCT CACCATCAAC TACGATTTTT CTGGTTCACA TACTCTACGT TCGGATTATG GTTCACAAGA GACAGATACC AGTTATCTCA ATCTGCGCAA TGGACTGAAT ATTGGGCCGT GGCGATTACG CAATTACAGC ACCTTAAATA CGACCGATGG CAGTGCAGAA TACAACTCTA TAAGCACCTG GATACAACGT GATATAGCTG CGTTAAGAAG CCAGATTATG ATAGGTGATA CGTGGACGGC GAGCGATATC TTCGACAGTA CGCAAATTCG TGGCGCACGC TTGTATACCG ATAACGATAT GTTGCCTGCC AGCCAAAACG GATTTGCCCC CGTAGTGCGT GGGATTGCAA AGTCTAACGC CACTGTCATT ATTCGCCAGA ATGGCTACGT GATTTATCAG TCAGCCGTTC CACAAGGTGC TTTTGAAATT ACCGACCTCA ACACCGCCAG TACAGGTGGT GATCTGGACG TAACCATCAA AGAAGAAGAC GGTAGCGAAC AACGATTTAC TCAACCTTAT GCGTCGTTAG CCATCCTCAA ACGTGAAGGT CAAACTGATG TTGATGTCAG TGTGGGAGAA TTACGCGATG AAGATGGCTT TACCCCTGAC GTTATTCAGG CTCAGATCCT TCATGGTTTT CCCTACGGTT TCACCTTGTA TGGAGGTATG CAGGCTGCTG AAAAGTATGG TTCTGCCGCT TTGGGTGTCG GTAAAGATCT TGGCGCATTG GGCGCAATTT CTTTCGATGT GACACATGCT CGCGCGAAAT TTAGCCATGA TGATACGGAA ACCGGTCAGT CTTATCGCTT CCTGTACTCG AAACGCTTTG ACGATACGGA TACCAGCTTG CGCCTGGTTG GCTATCGTTA CTCCACCGAA GGTTATTATA CCCTCAATGA ATGGGCGTCC CGGCGTAATA ATCCTGAAGA TTTCTGGGAA ACCGGTAACC GCCGCAGCCG TGTGGAAGGA ACGTTAACGC AATCACTGGG CAGAGATTAT GGCAACTTAT ATCTGACATT AAGTCGCCAG CAGTACTGGC ATACCGATGA TGTTGAACGA TTAATGCAAT TTGGCTACAG CAGTAGCTGG AAGCGTCTCT CGTGGAACGT CTCCTGGAGT TATTCCAATA CTGCCCGGCA GGGGACGGGG AACAACCATG CCAGTGATAA CACCAGTGAG CAGATCTACA TGCTCTCTTT ATCTGTTCCT TTATCGGGCT GGTGGGGGAA TAGTTACGCC ACCTATTCTG TTTCACAAAA CGATAATTCC GGTAGCTCAC ATCAACTTGG ACTCAGCGGT ACGGCGCTGG AAAGAAATAA CCTTTCCTGG AATTTAATGC AGTCCTATAA CAGCCATGAT GATGAGGTTG GCGGTAATAT GTCCCTGACC TATGATGGCA CTTATGGCAC GGTGAACGGC AGCTATAACT ACAGCCAAAA TTCTCAGAGG CTGAATTATG GTATCAGAGG GGGAATTCTG GCTCACAGCG AAGGCGTAAC GTTAAGCCAG GAGTTGGGTG AAACCATAGC TCTTGTTAAA GCACCAGGGG CTGCCGGGTT AGAAATAGAC AATATGCGTG GTGCTGCGAC GGACTGGCGT GGTTATACGG TCAAGACACA GCTAAACCCT TATGATGAAA ACCGGGTAGC AATCAGCGAT AACTATTTCT CGAAGTCGAA TATAGAACTT GATAATACCG TCGTTACGAT GGTTCCCACC CGTGGCGCGG TGGTTAAAGC CGAATTTGTT ACTCGTGTAG GTTATCGTGT GCTTTTCAGG GTGGCGGGTA CAAAAGGTAA ACCCGCACCT TTTGGCGCTA TTGCTACAGT ACAAAATACA AGCTCCGCTG ATTCAGGGAT TGTCGGTGAC CTGGGGGAGC TTTATCTCTC TGGCCTTCCT GAAAAGGGGC AAGTAATGCT CTCCTGGGGG GAAAATGCCG CCACTACATG CACCTTCGAT TATTCAATTT CAATACCAGA AAGTGAAAGC GGCTTAATTG AACAAGGTGT GACATGTCAT TAA
|
Protein sequence | MMTTRIKVGL TAGTCLIFSQ SLMAEVSVFN PALLEIDHQS GVDIRQFNRA NLMPPGVYSV DIFINGKMFE RQDVTFVQDN PDADLHACFV AIKKTLTTFG VKVDALKSLN DVDETVCIDP GPRIEGSSWQ FDSDKLQLNI SIPQIYMDAM AYDYISPSRW DEGINALTIN YDFSGSHTLR SDYGSQETDT SYLNLRNGLN IGPWRLRNYS TLNTTDGSAE YNSISTWIQR DIAALRSQIM IGDTWTASDI FDSTQIRGAR LYTDNDMLPA SQNGFAPVVR GIAKSNATVI IRQNGYVIYQ SAVPQGAFEI TDLNTASTGG DLDVTIKEED GSEQRFTQPY ASLAILKREG QTDVDVSVGE LRDEDGFTPD VIQAQILHGF PYGFTLYGGM QAAEKYGSAA LGVGKDLGAL GAISFDVTHA RAKFSHDDTE TGQSYRFLYS KRFDDTDTSL RLVGYRYSTE GYYTLNEWAS RRNNPEDFWE TGNRRSRVEG TLTQSLGRDY GNLYLTLSRQ QYWHTDDVER LMQFGYSSSW KRLSWNVSWS YSNTARQGTG NNHASDNTSE QIYMLSLSVP LSGWWGNSYA TYSVSQNDNS GSSHQLGLSG TALERNNLSW NLMQSYNSHD DEVGGNMSLT YDGTYGTVNG SYNYSQNSQR LNYGIRGGIL AHSEGVTLSQ ELGETIALVK APGAAGLEID NMRGAATDWR GYTVKTQLNP YDENRVAISD NYFSKSNIEL DNTVVTMVPT RGAVVKAEFV TRVGYRVLFR VAGTKGKPAP FGAIATVQNT SSADSGIVGD LGELYLSGLP EKGQVMLSWG ENAATTCTFD YSISIPESES GLIEQGVTCH
|
| |