Gene EcHS_A3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3336 
Symbol 
ID5594849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3339740 
End bp3342256 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content48% 
IMG OID640922454 
Productfimbrial usher family protein 
Protein accessionYP_001459947 
Protein GI157162629 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.559371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACAAC GACACCACCA GGGACATAAA CGCACACCGA AACAGTTGGC GCTCATTATC 
AAACGCTGTT TGCCGATGGT GCTCACTGGC AGCGGCATGC TTTGCACTAC CGCTAACGCC
GAAGAGTATT ATTTCGACCC CATTATGCTG GAAACCACAA AAAGTGGTAT GCAAACAACC
GATCTGTCAC GATTTTCAAA GAAATACGCA CAACTACCAG GAACTTATCA GGTTGATATC
TGGCTGAATA AAAAGAAGGT TTCACAGAAA AAAATTACAT TTACCGCCAA TGCAGAGCAA
CTTCTGCAGC CACAGTTTAC GGTAGAACAA CTACGTGAGC TGGGTATTAA GGTGGATGAA
ATCCCGGCGC TGGCTGAAAA AGATGACGAT AGCGTGATCA ACTCGCTTGA ACAAATCATT
CCCGGTACAG CTGCTGAATT TGATTTCAAT CATCAGCGAC TTAATTTGAG CATTCCCCAA
ATTGCACTGT ACCGTGATGC AAGAGGTTAC GTCTCCCCTT CTCGTTGGGA CGATGGTATA
CCAACGCTGT TTACCAACTA CTCGTTTACA GGTTCTGATA ACTGTTACCG CCAGGGCAAT
CGTAGCCAAC GACAGTACCT GAATATGCAA AATGGTGCTA ATTTTGGCCC CTGGCGATTA
CGCAACTATT CCACATGGAC ACGCAACGAT CAGACATCAA GCTGGAATAC CATCAGTAGT
TATTTACAAC GTGATATCAA GGCATTGAAG TCTCAGTTGC TTCTGGGAGA AAGCGCCACC
AGCGGCAGTA TTTTTTCCAG CTACACCTTT ACTGGCGTTC AACTCGCTTC CGACGATAAT
ATGCTGCCAA ACAGCCAGCG CGGATTTGCC CCAACGGTAC GCGGTATCGC AAACAGTAGT
GCAATCGTGA CTATCAGGCA AAATGGTTAT GTGATCTATC AAAGCAACGT GCCAGCGGGT
GCCTTTGAAA TTAACGATCT CTACCCCTCT TCCAACAGCG GCGATTTAGA AGTCACGATT
GAAGAAAGTG ACGGTACGCA ACGTCGCTTT ATCCAGCCTT ATTCTTCATT ACCCATGATG
CAGCGACCTG GGCATCTAAA GTATAGCGCG ACCGCTGGAC GCTATCGCGC TGATGCAAAC
AGTGATAGCA AGGAACCCGA ATTTGCTGAA GCCACGGCAA TATATGGTTT GAATAATACT
TTTACGCTGT ATGGCGGCCT GCTCGGTTCT GAAGATTATT ATGCGCTGGG GATCGGTATC
GGCGGCACAC TTGGCGCACT GGGCGCGTTG TCGATGGATA TCAACAGAGC TGACACCCAA
TTCGATAACC AGCACTCTTT TCATGGCTAT CAATGGCGTA CGCAGTACAT CAAAGATATC
CCGGAAACCA ACACCAATAT CGCTGTCAGC TACTATCGCT ATACCAACGA TGGCTATTTT
AGTTTTGATG AAGCCAATAC CCGTAATTGG AACTATAACA GTCGCCAAAA AAGTGAAATT
CAATTCAACA TCAGCCAGAC AATATTTGAT GGGGTAAGTC TGTATGCCTC CGGTTCGCAG
CAAGACTATT GGGGCAATAA CGATAAAAAC AGGAATATCT CTGTTGGGGT TTCCGGCCAG
CAATGGGGAG TTGGTTACAG CCTGAATTAT CAATACAGCC GCTACACTGA TCAAAATAAT
GACCGCGCAC TCTCTTTGAA TCTCAGTATT CCGTTAGAAC GCTGGTTACC GCGTAGCCGG
GTTTCCTATC AGATGACCAG CCAGAAAGAT CGCCCAACCC AACATGAAAT GCGTCTTGAT
GGCTCACTGC TGGATGATGG TCGCCTGAGC TATAGCCTGG AACAAAGTCT GGATGACGAT
AACAACCATA ACAGTAGCCT GAACGCCAGT TACCGTTCAC CTTATGGAAC CTTCAGTGCC
GGATACAGCT ACGGTAATGA CAGCAGCCAA TACAATTACG GCGTTACCGG CGGCGTGGTT
ATCCATCCTC ATGGCGTGAC GCTCTCGCAA TATCTGGGCA ACGCTTTTGC GCTTATCGAT
GCTAATGGGG CTTCTGGCGT GAGGATACAA AACTATCCGG GGATTGCTAC CGATCCCTTT
GGCTATGCAG TGGTTCCTTA TCTCACGACT TACCAGGAAA ACCGTCTCTC GGTAGATACT
ACGCAGCTGC CCGATAACGT CGATCTTGAG CAAACAACAC AGTTTGTGGT GCCCAACAGA
GGTGCAATGG TAGCGGCGCG TTTCAACGCC AATATCGGTT ATCGCGTACT TGTTACAGTC
AGCGATCGCA ACGGTAAACC GTTGCCCTTT GGCGCTCTTG CCAGCAACGA TGAGACGGGG
CAACAAAGTA TCGTCGATGA GGGCGGCATA CTATATCTCT CTGGGATATC GAGTAAATCA
CAAAGCTGGA CTGTACGCTG GGGAAATCAG GCAGATCAAC AATGTCAGTT TGCTTTTAGT
ACACCGGATT CAGAACCCAC AACCTCTGTA TTACAAGGCA CGGCGCAGTG CCATTAA
 
Protein sequence
MPQRHHQGHK RTPKQLALII KRCLPMVLTG SGMLCTTANA EEYYFDPIML ETTKSGMQTT 
DLSRFSKKYA QLPGTYQVDI WLNKKKVSQK KITFTANAEQ LLQPQFTVEQ LRELGIKVDE
IPALAEKDDD SVINSLEQII PGTAAEFDFN HQRLNLSIPQ IALYRDARGY VSPSRWDDGI
PTLFTNYSFT GSDNCYRQGN RSQRQYLNMQ NGANFGPWRL RNYSTWTRND QTSSWNTISS
YLQRDIKALK SQLLLGESAT SGSIFSSYTF TGVQLASDDN MLPNSQRGFA PTVRGIANSS
AIVTIRQNGY VIYQSNVPAG AFEINDLYPS SNSGDLEVTI EESDGTQRRF IQPYSSLPMM
QRPGHLKYSA TAGRYRADAN SDSKEPEFAE ATAIYGLNNT FTLYGGLLGS EDYYALGIGI
GGTLGALGAL SMDINRADTQ FDNQHSFHGY QWRTQYIKDI PETNTNIAVS YYRYTNDGYF
SFDEANTRNW NYNSRQKSEI QFNISQTIFD GVSLYASGSQ QDYWGNNDKN RNISVGVSGQ
QWGVGYSLNY QYSRYTDQNN DRALSLNLSI PLERWLPRSR VSYQMTSQKD RPTQHEMRLD
GSLLDDGRLS YSLEQSLDDD NNHNSSLNAS YRSPYGTFSA GYSYGNDSSQ YNYGVTGGVV
IHPHGVTLSQ YLGNAFALID ANGASGVRIQ NYPGIATDPF GYAVVPYLTT YQENRLSVDT
TQLPDNVDLE QTTQFVVPNR GAMVAARFNA NIGYRVLVTV SDRNGKPLPF GALASNDETG
QQSIVDEGGI LYLSGISSKS QSWTVRWGNQ ADQQCQFAFS TPDSEPTTSV LQGTAQCH