Gene EcSMS35_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4094 
SymbollpfC 
ID6142994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4187436 
End bp4189958 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content48% 
IMG OID641618918 
Productlong polar fimbrial operon protein LpfC 
Protein accessionYP_001746056 
Protein GI170684256 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.118889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGA CCAGAATAAA GGTTGGCCTC ACGGCAGGGA CGTGTCTGAT TTTCTCGCAA 
AGCCTGATGG CCGAGGTCAG TGTATTCAAT CCGGCGCTTC TGGAAATCGA CCATCAATCC
GGAGTCGATA TTCGCCAGTT TAATCGGGCA AACCTGATGC CCCCAGGTGT TTATAGCGTT
GATATTTTTA TCAACGGTAA AATGTTTGAA CGTCAGGATG TGACCTTTGT TCAGGATAAT
CCCGATGCTG ATTTACACGC CTGCTTTGTC GCTATTAAAA AAACGCTAAC CACCTTTGGC
GTAAAAGTTG ATGCGCTGAA ATCGCTCAAT GATGTAGATG AAACAGTCTG TATTGATCCT
GGGCCACGCA TTGAAGGCTC ATCATGGCAG TTCGACAGCG ATAAACTGCA ACTGAATATC
TCCATTCCCC AAATCTATAT GGATGCGATG GCTTATGATT ACATCAGCCC CTCGCGTTGG
GATGAGGGGA TTAACGCGCT CACCATCAAC TACGATTTTT CTGGTTCACA TACTCTACGT
TCGGATTATG GTTCACAAGA GACAGATACC AGTTATCTCA ATCTGCGCAA TGGACTGAAT
ATTGGGCCGT GGCGATTACG CAATTACAGC ACCTTAAATA CGACCGATGG CAGTGCAGAA
TACAACTCTA TAAGCACCTG GATACAACGT GATATAGCTG CGTTAAGAAG CCAGATTATG
ATAGGTGATA CGTGGACGGC GAGCGATATC TTCGACAGTA CGCAAATTCG TGGCGCACGC
TTGTATACCG ATAACGATAT GTTGCCTGCC AGCCAAAACG GATTTGCCCC CGTAGTGCGT
GGGATTGCAA AGTCTAACGC CACTGTCATT ATTCGCCAGA ATGGCTACGT GATTTATCAG
TCAGCCGTTC CACAAGGTGC TTTTGAAATT ACCGACCTCA ACACCGCCAG TACAGGTGGT
GATCTGGACG TAACCATCAA AGAAGAAGAC GGTAGCGAAC AACGATTTAC TCAACCTTAT
GCGTCGTTAG CCATCCTCAA ACGTGAAGGT CAAACTGATG TTGATGTCAG TGTGGGAGAA
TTACGCGATG AAGATGGCTT TACCCCTGAC GTTATTCAGG CTCAGATCCT TCATGGTTTT
CCCTACGGTT TCACCTTGTA TGGAGGTATG CAGGCTGCTG AAAAGTATGG TTCTGCCGCT
TTGGGTGTCG GTAAAGATCT TGGCGCATTG GGCGCAATTT CTTTCGATGT GACACATGCT
CGCGCGAAAT TTAGCCATGA TGATACGGAA ACCGGTCAGT CTTATCGCTT CCTGTACTCG
AAACGCTTTG ACGATACGGA TACCAGCTTG CGCCTGGTTG GCTATCGTTA CTCCACCGAA
GGTTATTATA CCCTCAATGA ATGGGCGTCC CGGCGTAATA ATCCTGAAGA TTTCTGGGAA
ACCGGTAACC GCCGCAGCCG TGTGGAAGGA ACGTTAACGC AATCACTGGG CAGAGATTAT
GGCAACTTAT ATCTGACATT AAGTCGCCAG CAGTACTGGC ATACCGATGA TGTTGAACGA
TTAATGCAAT TTGGCTACAG CAGTAGCTGG AAGCGTCTCT CGTGGAACGT CTCCTGGAGT
TATTCCAATA CTGCCCGGCA GGGGACGGGG AACAACCATG CCAGTGATAA CACCAGTGAG
CAGATCTACA TGCTCTCTTT ATCTGTTCCT TTATCGGGCT GGTGGGGGAA TAGTTACGCC
ACCTATTCTG TTTCACAAAA CGATAATTCC GGTAGCTCAC ATCAACTTGG ACTCAGCGGT
ACGGCGCTGG AAAGAAATAA CCTTTCCTGG AATTTAATGC AGTCCTATAA CAGCCATGAT
GATGAGGTTG GCGGTAATAT GTCCCTGACC TATGATGGCA CTTATGGCAC GGTGAACGGC
AGCTATAACT ACAGCCAAAA TTCTCAGAGG CTGAATTATG GTATCAGAGG GGGAATTCTG
GCTCACAGCG AAGGCGTAAC GTTAAGCCAG GAGTTGGGTG AAACCATAGC TCTTGTTAAA
GCACCAGGGG CTGCCGGGTT AGAAATAGAC AATATGCGTG GTGCTGCGAC GGACTGGCGT
GGTTATACGG TCAAGACACA GCTAAACCCT TATGATGAAA ACCGGGTAGC AATCAGCGAT
AACTATTTCT CGAAGTCGAA TATAGAACTT GATAATACCG TCGTTACGAT GGTTCCCACC
CGTGGCGCGG TGGTTAAAGC CGAATTTGTT ACTCGTGTAG GTTATCGTGT GCTTTTCAGG
GTGGCGGGTA CAAAAGGTAA ACCCGCACCT TTTGGCGCTA TTGCTACAGT ACAAAATACA
AGCTCCGCTG ATTCAGGGAT TGTCGGTGAC CTGGGGGAGC TTTATCTCTC TGGCCTTCCT
GAAAAGGGGC AAGTAATGCT CTCCTGGGGG GAAAATGCCG CCACTACATG CACCTTCGAT
TATTCAATTT CAATACCAGA AAGTGAAAGC GGCTTAATTG AACAAGGTGT GACATGTCAT
TAA
 
Protein sequence
MMTTRIKVGL TAGTCLIFSQ SLMAEVSVFN PALLEIDHQS GVDIRQFNRA NLMPPGVYSV 
DIFINGKMFE RQDVTFVQDN PDADLHACFV AIKKTLTTFG VKVDALKSLN DVDETVCIDP
GPRIEGSSWQ FDSDKLQLNI SIPQIYMDAM AYDYISPSRW DEGINALTIN YDFSGSHTLR
SDYGSQETDT SYLNLRNGLN IGPWRLRNYS TLNTTDGSAE YNSISTWIQR DIAALRSQIM
IGDTWTASDI FDSTQIRGAR LYTDNDMLPA SQNGFAPVVR GIAKSNATVI IRQNGYVIYQ
SAVPQGAFEI TDLNTASTGG DLDVTIKEED GSEQRFTQPY ASLAILKREG QTDVDVSVGE
LRDEDGFTPD VIQAQILHGF PYGFTLYGGM QAAEKYGSAA LGVGKDLGAL GAISFDVTHA
RAKFSHDDTE TGQSYRFLYS KRFDDTDTSL RLVGYRYSTE GYYTLNEWAS RRNNPEDFWE
TGNRRSRVEG TLTQSLGRDY GNLYLTLSRQ QYWHTDDVER LMQFGYSSSW KRLSWNVSWS
YSNTARQGTG NNHASDNTSE QIYMLSLSVP LSGWWGNSYA TYSVSQNDNS GSSHQLGLSG
TALERNNLSW NLMQSYNSHD DEVGGNMSLT YDGTYGTVNG SYNYSQNSQR LNYGIRGGIL
AHSEGVTLSQ ELGETIALVK APGAAGLEID NMRGAATDWR GYTVKTQLNP YDENRVAISD
NYFSKSNIEL DNTVVTMVPT RGAVVKAEFV TRVGYRVLFR VAGTKGKPAP FGAIATVQNT
SSADSGIVGD LGELYLSGLP EKGQVMLSWG ENAATTCTFD YSISIPESES GLIEQGVTCH