Gene EcolC_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0554 
Symbol 
ID6064985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp597029 
End bp599545 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content48% 
IMG OID641599961 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionYP_001723558 
Protein GI170018604 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0981109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAAC GACACCACCA GGGACATAAA CGCACACCGA AACAGTTGGC GCTCATTATC 
AAACGCTGTT TGCCGATGGT GCTCACTGGC AGCGGCATGC TTTGCACTAC CGCTAACGCC
GAAGAGTATT ATTTCGACCC CATTATGCTG GAAACCACAA AAAGTGGTAT GCAAACAACC
GATCTGTCAC GATTTTCAAA GAAATACGCA CAACTACCAG GAACTTATCA GGTTGATATC
TGGCTGAATA AAAAGAAGGT TTCACAGAAA AAAATTACAT TTACCGCCAA TGCAGAGCAA
CTTCTGCAGC CACAGTTTAC GGTAGAACAA CTACGTGAGC TGGGTATTAA GGTGGATGAA
ATCCCGGCGC TGGCTGAAAA AGATGACGAT AGCGTGATCA ACTCGCTTGA ACAAATCATT
CCCGGTACAG CTGCTGAATT TGATTTCAAT CATCAGCGAC TTAATTTGAG CATTCCCCAA
ATTGCACTGT ACCGTGATGC AAGAGGTTAC GTCTCCCCTT CTCGTTGGGA CGATGGTATA
CCAACGCTGT TTACCAACTA CTCGTTTACA GGTTCTGATA ACCGTTACCG CCAGGGCAAT
CGTAGCCAAC GACAGTACCT GAATATGCAA AATGGTGCTA ATTTTGGCCC CTGGCGATTA
CGTAACTATT CTACGTGGAC ACGCAACGAT CAGGCGTCAA GCTGGAACAC TATCAGTAGT
TATTTACAAC GTGATATCAA GGCGTTGAAG TCTCAGTTGC TTCTGGGAGA AAGCGCCACC
AGCGGCAGTA TTTTTTCCAG CTACACCTTT ACTGGCGTGC AACTCGCTTC CGACGATAAT
ATGTTGCCAA ACAGCCAGCG CGGATTTGCC CCAACGGTAC GCGGTATCGC AAACAGTAGT
GCAATCGTGA CTATCAGGCA AAATGGTTAT GTGATCTATC AAAGCAACGT GCCAGCGGGT
GCCTTTGAAA TTAACGATCT CTACCCCTCT TCCAACAGCG GCGATTTAGA AGTCACGATT
GAAGAAAGTG ACGGTACACA ACGTCGCTTT ATCCAGCCTT ATTCTTCATT ACCCATGATG
CAGCGACCTG GGCATCTAAA GTATAGCGCG ACCGCTGGAC GCTATCGCGC TGATGCAAAC
AGTGATAGCA AGGAACCCGA ATTTGCTGAA GCCACGGCAA TATATGGTTT GAATAATACT
TTTACGCTGT ATGGCGGCCT GCTCGGTTCT GAAGATTATT ATGCGCTGGG GATCGGTATC
GGCGGCACAC TTGGCGCACT GGGCGCGTTG TCGATGGATA TCAACAGAGC TGACACCCAA
TTCGATAACC AGCACTCTTT TCATGGCTAT CAATGGCGTA CGCAGTACAT CAAAGATATC
CCGGAAACCA ACACCAATAT CGCTGTCAGC TACTATCGCT ATACCAACGA TGGCTATTTT
AGTTTTGATG AAGCCAATAC CCGTAATTGG AACTATAACA GTCGCCAAAA AAGTGAAATT
CAATTCAACA TCAGCCAGAC AATATTTGAT GGGGTAAGTC TGTATGCCTC CGGTTCGCAG
CAAGACTATT GGGGCAATAA CGATAAAAAC AGGAATATCT CTGTTGGGGT TTCCGGCCAG
CAATGGGGAG TTGGTTACAG CCTGAATTAT CAATACAGCC GCTACACTGA TCAAAATAAT
GACCGCGCAC TCTCTTTGAA TCTCAGTATT CCGTTAGAAC GCTGGTTACC GCGTAGCCGG
GTTTCCTATC AGATGACCAG CCAGAAAGAT CGCCCAACCC AACATGAAAT GCGTCTTGAT
GGTTCACTGC TGGATGATGG TCGCCTGAGC TATAGCCTGG AACAAAGTCT GGATGACGAT
AACAACCATA ACAGTAGCGT GAACGCCAGT TACCGTTCAC CTTATGGAAC CTTCAGTGCC
GGATACAGTT ACGGTAATGA CAGTAGCCAA TACAATTACG GCGTTACCGG CGGCGTGGTT
ATCCATCCTC ATGGTGTGAC GCTCTCGCAA TATCTGGGCA ACGCTTTTGC GCTTATTGAT
GCTAACGGGG CATCTGGCGT GAGGATACAA AACTATCCGG GGATTGCTAC TGATCCCTTT
GGCTATGCAG TGGTTCCTTA TCTCACGACT TATCAGGAAA ACCGTCTCTC GGTAGATACT
ACGCAGCTGC CCGATAACGT CGATCTTGAG CAAACAACAC AGTTTGTGGT GCCCAACAGA
GGTGCAATGG TAGCGGCGCG TTTCAACGCC AATATCGGTT ATCGCGTACT TGTTACAGTC
AGCGATCGCA ACGGTAAACC GTTGCCCTTT GGCGCTCTTG CCAGCAACGA TGAGACGGGG
CAACAAAGTA TCGTCGATGA GGGCGGCATA CTATATCTCT CTGGGATATC GAGTAAATCA
CAAAGCTGGA CTGTACGCTG GGGAAATCAG GCAGATCAAC AATGTCAGTT TGCTTTTAGT
ACACCGGATT CAGAACCAAC AACCTCTGTA TTACAAGGCA CGGCGCAGTG CCATTAA
 
Protein sequence
MPQRHHQGHK RTPKQLALII KRCLPMVLTG SGMLCTTANA EEYYFDPIML ETTKSGMQTT 
DLSRFSKKYA QLPGTYQVDI WLNKKKVSQK KITFTANAEQ LLQPQFTVEQ LRELGIKVDE
IPALAEKDDD SVINSLEQII PGTAAEFDFN HQRLNLSIPQ IALYRDARGY VSPSRWDDGI
PTLFTNYSFT GSDNRYRQGN RSQRQYLNMQ NGANFGPWRL RNYSTWTRND QASSWNTISS
YLQRDIKALK SQLLLGESAT SGSIFSSYTF TGVQLASDDN MLPNSQRGFA PTVRGIANSS
AIVTIRQNGY VIYQSNVPAG AFEINDLYPS SNSGDLEVTI EESDGTQRRF IQPYSSLPMM
QRPGHLKYSA TAGRYRADAN SDSKEPEFAE ATAIYGLNNT FTLYGGLLGS EDYYALGIGI
GGTLGALGAL SMDINRADTQ FDNQHSFHGY QWRTQYIKDI PETNTNIAVS YYRYTNDGYF
SFDEANTRNW NYNSRQKSEI QFNISQTIFD GVSLYASGSQ QDYWGNNDKN RNISVGVSGQ
QWGVGYSLNY QYSRYTDQNN DRALSLNLSI PLERWLPRSR VSYQMTSQKD RPTQHEMRLD
GSLLDDGRLS YSLEQSLDDD NNHNSSVNAS YRSPYGTFSA GYSYGNDSSQ YNYGVTGGVV
IHPHGVTLSQ YLGNAFALID ANGASGVRIQ NYPGIATDPF GYAVVPYLTT YQENRLSVDT
TQLPDNVDLE QTTQFVVPNR GAMVAARFNA NIGYRVLVTV SDRNGKPLPF GALASNDETG
QQSIVDEGGI LYLSGISSKS QSWTVRWGNQ ADQQCQFAFS TPDSEPTTSV LQGTAQCH