Gene EcolC_0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0605 
Symbol 
ID6067677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp645539 
End bp648223 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content46% 
IMG OID641600011 
Productouter membrane fimbrial user protein 
Protein accessionYP_001723608 
Protein GI170018654 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAA AATTACTGGC TCTTTTGATC CTGGCGAGTC TCAGCCCGGC AGAGGCGGCA 
TTAACCAAAA TCCCCGCAGG GTTTGAGGTT ATTGCTCAGG GACAGCAGGA GTATATCGAG
GTTTATTTTT CAGGGAAAAA TCTCGGTAAA TATTATGCAA TGGTTAATCT TGATACCGTA
ACATTTCTTG ATCCAGCAAG TTTATATAAC AAGCTGGAAC TGGATGTCGA CGATCAGAAA
ATCGCGCATA TAGTGAAAGA AAAATTATCG CAGCCGCTAG CTCGCCACGG TGAATTGGCT
TGCGGTTATG TACGTACTGA CTCAGGGTGT GGTTTTCTGA ATACCGATAC GCTGGAAATA
ATCTATAATG ATGAAGAAAG TTCGGCAACG TTGTTTATTA ATCCGCAATG GAATTCAGCT
TTCGATGCGA AGTCATTATA TTTAAATCCA GACAAAAATA CGGTTAACGC TTTTATACAT
CAGCAAGACA TCAATGTTCT GGCACAGGAT GATTACCAAT CGTTGTCTAT TCAGGGAAAC
GGTGCGCTGG GAATAACAGA AAATAGCTAT ATTGGTGCAC ACTGGAATTT CAACGGTTAT
GATGCAGATG ATGTCAGTGA CAGTAATGCT GATGTCAGCG ATCTCTATTA TCGTTATGAT
TTTTTACGTC GTTATTATGT GCAGGCGGGG CGCATGGACA ACCGCACGCT ATTTAATGCA
CAAGGCGGGA ACTTTACCTT TAACTTTTTG CCACTCGGTG CAATCGACGG GATGCGTATC
GGGTCGACTC TTAGCTATTT AAACCAGGCG CAAAGCCAGC AGGGAACCCC GGTAATGGTC
CTGCTTTCGC GCAATTCTCG TGTTGACGCT TATCGTAATG AGCAACTTCT GGGATCGTTT
TATCTCAATA GTGGTTCGCA ATTTATTGAT ACCAGTTCCT TTCCGCCGGG TAGCTATAGC
GTAGCGTTAA AAGTCTATGA AAATAACCAA CTCACCCGCA CCGAGCTAGT ACCGTTTACC
AAAACAGGCG GTCTGACGGA CGGAAATGCG CAATGGTTCT TACAGGCAGG TAAAACTACA
TCACAGGCTT CTGATGATGA AAGTTCAGCT TATCAACTGG GGGTACGCCT GCCATTACAT
CCGCAATATG AGCTCTACGC AGGGCTGGCG AATGCCGATA ATGTGAGTGC TTTCGAGTTA
GGTAATGACT GGACGGCAAA TTTAGGCGGG GCAGGGAATC TTGCAATCAG CGCCAGCGTG
TTCCGTAACG ATGACGGCGG CAAAGGTGAT ATGCAACAGG CCAACTGGAG TAATTCGGGA
TGGCCGACGT TGGGCTTTTA TCGGACCAAC TCTGACGGCG ATGCTTGTGC AACCGACAGC
AGAGAGAGCT ATAACGCCTT AAGCTGTTAT GAAAGTATTT CCGCGACGGT TTCACAGAAT
TTTGTCGGCT GGAATATGAT GCTGGGTTAT TCCCGCACAC AAAATAACAC TGATGATAGT
TTGCGTTGGG ATAAACAGCA GAGCTTTGAA AATAACTATC TTCGCCAGAC AACTGCGCAA
AGTATCTCCG AAACTGTACA ACTTAGCGCT TCCCGCGCTT TTGTGATGCG TGACTGGATC
TTGAGTACTT CGGTTGGTGT TTTCCATCGT AATGACAACG GTGGCGATAA CGACGACAAC
GGCTTGTACT TATCGTTTTC GTTATCTGAC ACGCCAACGA TGGACAGCAA TAACAACAGC
CATTCAACCA ATGTTTCTAC GGATTATCGT TATAGCGATC AGGATGGCGA TCAAACGTCA
TGGCAGTTAT CCCATACTTT TTATAACGAT TCATTCAGCC ATAAAGAACT TGGCGTAACC
GTTGGGGGCC TGAACACCGA TACCATAAAC AGCGCGGTTA ACGGGCGTTG GGATGGTCAA
TACGGAAATG TCTACGCTAC CGTATCTGAC AGTTATGACC GTAAGAATCA TGATCATCTC
TCGGCCTTTA CGGGGACTTA CAGCTCTACA CTGGCTGTCA GTCGCTATGG CGTTAATTTG
GGTGCCAGTG GTACAGACGA TTTGCTGGGT GCGGTATTGG TGGATGTGAA AGGCTTCTCT
GAACAGGATG AAGAGAGTCA GGATCTGCAA CTCGAAGCGC GGGTGGCAGG CAGCCGAACG
TTGCAGCTTG GTCAAAGCGA CAGTGTGTTG TTCCCTTATC CTGGATTTCA GTCTGGTTTT
GTTGAGGTTA ACGACAGTAG CCAGGGCAAT CAACAAGGGA CAACAAACAT CATTAACGGT
GCGGGGAATC GTGAATTAAT GTTGTTGCCT GGCAAACTGC GCTATCGCGA AGTGTCTGCC
AGCTTTAATT ACAACTATAT CGGTCGCTTG TTATTACCAG CATCGGTAGA GAAATTCCCG
CTGGTTGGTC TGAATAGCGC CATGTTACTG GTAGCTGAAG ATGGCGGATT TACACTTGAA
ATTAACGGTA GCGAAAAAGA GCTATATCTG CTTTCCGGGC AGCAATTCCT TAAGTGTCCG
TTGAGTGTTG TAAAGAAACG CGCCAGCATT CGTTACAGCG GAGATGTCAC TTGTAGTGTG
GTGACTTATT CACAATTACC GGAATCCATT CAGGTTCAGG CACAGTTAAA ACAGCCTAAA
TTACGTGGAA ACGTTCAGAC GGCGCAAAGG GAGGTTGCAC CATGA
 
Protein sequence
MDKKLLALLI LASLSPAEAA LTKIPAGFEV IAQGQQEYIE VYFSGKNLGK YYAMVNLDTV 
TFLDPASLYN KLELDVDDQK IAHIVKEKLS QPLARHGELA CGYVRTDSGC GFLNTDTLEI
IYNDEESSAT LFINPQWNSA FDAKSLYLNP DKNTVNAFIH QQDINVLAQD DYQSLSIQGN
GALGITENSY IGAHWNFNGY DADDVSDSNA DVSDLYYRYD FLRRYYVQAG RMDNRTLFNA
QGGNFTFNFL PLGAIDGMRI GSTLSYLNQA QSQQGTPVMV LLSRNSRVDA YRNEQLLGSF
YLNSGSQFID TSSFPPGSYS VALKVYENNQ LTRTELVPFT KTGGLTDGNA QWFLQAGKTT
SQASDDESSA YQLGVRLPLH PQYELYAGLA NADNVSAFEL GNDWTANLGG AGNLAISASV
FRNDDGGKGD MQQANWSNSG WPTLGFYRTN SDGDACATDS RESYNALSCY ESISATVSQN
FVGWNMMLGY SRTQNNTDDS LRWDKQQSFE NNYLRQTTAQ SISETVQLSA SRAFVMRDWI
LSTSVGVFHR NDNGGDNDDN GLYLSFSLSD TPTMDSNNNS HSTNVSTDYR YSDQDGDQTS
WQLSHTFYND SFSHKELGVT VGGLNTDTIN SAVNGRWDGQ YGNVYATVSD SYDRKNHDHL
SAFTGTYSST LAVSRYGVNL GASGTDDLLG AVLVDVKGFS EQDEESQDLQ LEARVAGSRT
LQLGQSDSVL FPYPGFQSGF VEVNDSSQGN QQGTTNIING AGNRELMLLP GKLRYREVSA
SFNYNYIGRL LLPASVEKFP LVGLNSAMLL VAEDGGFTLE INGSEKELYL LSGQQFLKCP
LSVVKKRASI RYSGDVTCSV VTYSQLPESI QVQAQLKQPK LRGNVQTAQR EVAP