Gene EcE24377A_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3626 
Symbol 
ID5587988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3625995 
End bp3628511 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content48% 
IMG OID640927250 
Productfimbrial usher family protein 
Protein accessionYP_001464619 
Protein GI157156256 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACAAC GACACCACCA GGGACATAAA CGCACACCGA AACAGTTGGC GCTCATCATC 
AAACGCTGTT TGCCGATGGT GCTCACTGGC AGCGGCATGC TTTGCACTAC CGCTAACGCC
GAAGAGTATT ATTTCGACCC CATTATGCTG GAAACTACAA AAAGTGGTAT GCAAACAACT
GATCTGTCAC GTTTTTCAAA AAAATACGCA CAACTACCAG GAACTTATCA GGTTGATATC
TGGCTGAATA AAAAGAAGGT TTCACAGAAA AAAATTACAT TTACCGCCAA TGCAGAGCAA
CTTCTGCAGC CACAGTTTAC GGTAGAACAA CTACGTGAGC TGGGTATTAA GGTGGATGAA
ATCCCGGCGC TGGCTGAAAA AGATGACGAT AGCGTGATCA ACTCGCTTGA ACAAATCATT
CCCGGTACAG CTGCTGAATT TGATTTCAAT CATCAGCGAC TTAATTTGAG CATTCCCCAA
ATTGCACTGT ACCGTGATGC AAGAGGTTAC GTCTCCCCTT CTCGTTGGGA CGATGGTATA
CCAACGCTGT TTACCAACTA CTCGTTTACA GGTTCTGATA ACCGTTACCG CCAGGGCAAT
CGTAGCCAAC GACAGTACCT GAATATGCAA AATGGTGCTA ATTTTGGCCC CTGGCGATTA
CGTAACTATT CTACGTGGAC ACGCAACGAT CAGGCGTCAA GCTGGAACAC TATCAGTAGT
TATTTACAAC GTGATATCAA GGCGTTGAAG TCTCAGTTGC TTCTGGGAGA AAGCGCCACC
AGCGGCAGTA TTTTTTCCAG CTACACCTTT ACTGGCGTGC AACTCGCTTC CGACGATAAT
ATGTTGCCAA ACAGCCAGCG CGGATTTGCC CCAACGGTAC GCGGTATCGC AAACAGTAGT
GCAATCGTGA CTATCAGGCA AAATGGTTAT GTGATCTATC AAAGCAACGT GCCAGCGGGT
GCCTTTGAAA TTAACGATCT CTACCCCTCT TCCAACAGCG GCGATTTAGA AGTCACGATT
GAAGAAAGTG ACGGTACGCA ACGTCGCTTT ATCCAGCCTT ATTCTTCATT ACCCATGATG
CAGCGACCTG GGCATCTAAA ATATAGCGCG ACCGCTGGAC GCTATCGCGC TGATGCAAAC
AGTGATAGCA AGGAACCCGA ATTTGCTGAA GCCACGGCAA TATATGGTTT GAATAATACT
TTTACGCTGT ATGGCGGCCT GCTCGGTTCT GAAGATTATT ATGCGCTGGG GATCGGTATC
GGCGGCACAC TTGGCGCACT GGGCGCGTTG TCGATGGATA TCAACAGAGC TGACACCCAA
TTCGATAACC GGCACTCTTT TCATGGCTAT CAATGGCGTA CGCAGTACAT CAAAGATATC
CCGGAAACCA ACACCAATAT CGCTGTTAGC TACTATCGCT ATACCAACGA TGGCTATTTT
AGTTTTGATG AAGCCAATAC CCGTAATTGG GACTATAACA GTCGCCAAAA AAGTGAAATT
CAATTCAACA TCAGCCAGAC AATATTTGAT GGGGTAAGTC TGTATGCCTC CGGTTCGCAG
CAAGACTATT GGGGCAATAA CGATAAAAAC AGGAATATCT CAGTTGGGGT TTCCGGTCAG
CAATGGGGAA TTGGTTACAG CCTGAATTAT CAATACAGTC GCTACACGGA TCAAAATAAT
GACCGCGCAC TCTCTTTGAA TCTCAGTATT CCGTTAGAAC GCTGGTTACC GCGTAGCCGG
GTTTCCTATC AGATGACCAG CCAGAAAGAT CGCCCAACCC AACATGAAAT GCGTCTTGAT
GGCTCACTGC TGGATGATGG TCGCCTGAGC TATAGCCTGG AACAAAGTCT GGATGACGAT
AACAACCATA ACAGTAGCCT GAACGCCAGT TACCGTTCAC CTTATGGCAC CTTCAGTGCC
GGATACAGCT ACGGTAATGA CAGCAGCCAA TACAATTACG GCGTTACCGG CGGCGTGGTT
ATCCATCCTC ATGGCGTGAC GCTCTCGCAA TATCTGGGCA ACGCTTTTGC GCTTATCGAT
GCTAATGGGG CTTCTGGCGT GAGGATACAA AACTATCCGG GGATTGCTAC TGATCCCTTT
GGCTATGCAG TGGTTCCTTA TCTCACGACT TACCAGGAAA ACCGTCTCTC GGTAGATACT
ACGCAGCTGC CCGATAACGT CGATCTTGAG CAAACAACAC AGTTTGTGGT GCCCAACAGA
GGTGCAATGG TAGCGGCGCG TTTCAACGCC AATATCGGTT ATCGCGTACT TGTTACAGTC
AGCGATCGCA ACGGTAAACC GTTGCCCTTT GGCGCTCTTG CCAGCAACGA TGAGACGGGG
CAACAAAGTA TCGTCGATGA GGGCGGCATA CTATATCTCT CTGGGATATC GAGTAAATCA
CAAAGCTGGA CTGTACGCTG GGGAAATCAG GCAGATCAAC AATGTCAGTT TGCTTTTAGT
ACACCGGATT CAGAACCAAC AACCTCTGTA TTACAAGGCA CGGCGCAGTG CCATTAA
 
Protein sequence
MPQRHHQGHK RTPKQLALII KRCLPMVLTG SGMLCTTANA EEYYFDPIML ETTKSGMQTT 
DLSRFSKKYA QLPGTYQVDI WLNKKKVSQK KITFTANAEQ LLQPQFTVEQ LRELGIKVDE
IPALAEKDDD SVINSLEQII PGTAAEFDFN HQRLNLSIPQ IALYRDARGY VSPSRWDDGI
PTLFTNYSFT GSDNRYRQGN RSQRQYLNMQ NGANFGPWRL RNYSTWTRND QASSWNTISS
YLQRDIKALK SQLLLGESAT SGSIFSSYTF TGVQLASDDN MLPNSQRGFA PTVRGIANSS
AIVTIRQNGY VIYQSNVPAG AFEINDLYPS SNSGDLEVTI EESDGTQRRF IQPYSSLPMM
QRPGHLKYSA TAGRYRADAN SDSKEPEFAE ATAIYGLNNT FTLYGGLLGS EDYYALGIGI
GGTLGALGAL SMDINRADTQ FDNRHSFHGY QWRTQYIKDI PETNTNIAVS YYRYTNDGYF
SFDEANTRNW DYNSRQKSEI QFNISQTIFD GVSLYASGSQ QDYWGNNDKN RNISVGVSGQ
QWGIGYSLNY QYSRYTDQNN DRALSLNLSI PLERWLPRSR VSYQMTSQKD RPTQHEMRLD
GSLLDDGRLS YSLEQSLDDD NNHNSSLNAS YRSPYGTFSA GYSYGNDSSQ YNYGVTGGVV
IHPHGVTLSQ YLGNAFALID ANGASGVRIQ NYPGIATDPF GYAVVPYLTT YQENRLSVDT
TQLPDNVDLE QTTQFVVPNR GAMVAARFNA NIGYRVLVTV SDRNGKPLPF GALASNDETG
QQSIVDEGGI LYLSGISSKS QSWTVRWGNQ ADQQCQFAFS TPDSEPTTSV LQGTAQCH