Gene EcHS_A3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3222 
Symbol 
ID5592065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3230856 
End bp3233357 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content42% 
IMG OID640922340 
Productfimbrial usher protein 
Protein accessionYP_001459838 
Protein GI157162520 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.000448746 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA AATTAAAATT AACCACAATA AGCGAATTGA TTAAAAATAT TTATTGTTCA 
TTATCCGTTA TCATCATTGG TTGTGCGTCA GCTTATGCCG TTGAATTCAA CAAAGATTTA
ATCGAAGCCG AAGATCGTGA AAACGTTAAC CTTTCCCAAT TTGAAACTGA TGGCCAATTA
CCCGTCGGCA AATATTCACT AAGCACTCTG ATTAATAATA AGAGGACGCC AATCCATCTG
GACCTGCAAT GGGTATTAAT TGATAACCAA ACTGCAGTTT GTCTGACTCC AGAGCAATTA
ACATTATTAG GCTTTACTGA TGAGATTATT GAAGAGGCTC AGCAAAACCT GATCGATGGT
TGTTACCCTA TCGAAAAAGA AAAACAAATT ACAACTTATC TCGATAAAGG GAAAATGCAA
TTATCCATAT CTGCACCTCA GGCATGGTTA AAATACAAAG ATGCAAACTG GACGCCTCCT
GAACTTTGGG ATCATGGTAT TGCTGGGGCA TTTCTTGACT ACAATTTATA TGCCTCTCAT
TATGCACCAC ATCAGGGCGA TAATTCACAA AATATAAGTT CCTATGGGCA GGCTGGGGTT
AATCTTGGGG CCTGGCGCCT GCGTACTGAT TACCAGTACG ATCAGTCATT TAACAATGGC
AAAAGCCAGG CGAACAACCT GGATTTTCCG CGTATTTATT TGTTTCGCCC AATCCCAGAA
ATTAATGCAA AACTAACTAT AGGTCAATAC GATACCGAAT CGTCTATTTT CGACTCTTTC
CATTTTTCTG GCATTTCGTT GAAAAGTGAT GAAAACATGT TACCGCCAGA TCTACGTGGT
TACGCACCGC AAATCACGGG GGTCGCACAA ACGAATGCAA AGGTCACTGT CTCACAGAAC
AACCGTATTA TTTATCAAGA AAACGTTCCT CCAGGCCCAT TTTCTATTAC CAATTTATTC
AATACATTAC AGGGGCAACT GGACGTCAAG GTTGAAGAAG AGGACGGGCG CGTCACGCAA
TGGCAAGTTG CATCTAATAG CATTCCTTAT CTGACGCGTA AAGGGCAGAT TCGCTACACT
ACTGCAATGG GGAAACCGAC CAGCGTTGGC GGCGATTCCT TACAACAGCC CTTCTTCTGG
ACCGGTGAAT TCTCATGGGG TTGGCTGAAC AACGTATCCC TGTATGGTGG TTCAGTATTA
ACAAACCGTG ATTATCAATC CCTGGCTACC GGAGTTGGTT TTAACCTTAA CTCGTTGGGC
TCATTATCCT TTGATGTCAC GCGATCTGAT GCTCAGTTAC ATAATCAGAA TAAAGAAACG
GGTTATAGCT ACCGTGCTAA CTATTCAAAA CGTTTTGAAT CTACCGGTAG CCAGCTCACT
TTCGCTGGCT ACCGTTTCTC TGATAAAAAC TTTGTGTCGA TGAATGAATA TATCAATGAC
ACTAACCATT ACACGAATTA TCAGAATGAA AAAGAGAGTT ATATTGTCAC GTTTAACCAG
TATCTTGAAT CATTGAGATT AAATACATAC GTAAGTCTGG CTCGTAATAC TTACTGGGAC
GCCAGCAGTA ATGTGAATTA TTCATTATCA CTTAGCCGCG ATTTTGATAT CGGCCCATTA
AAAAACGTAT CCACCTCACT AACATTTAGC CGAATAAACT GGGAAGATGA CAACCAGGAT
CAACTGTACC TAAATATTTC AATTCCCTGG GGAACCAGTA GAACATTGAG CTATGGTATG
CAACGAAATC AGGATAACAA CATTTCGCAT ACTGCTTCGT GGTATGACTC TTCCGATCGA
AATAATTCCT GGAGCGTTTC TGCTTCAGGC GACAATGACG AATTTAAAGA TATGGAGGCG
TCACTGCGCG CCAGTTATCA GCATAATACC GAGAACGGTC GTCTCTATCT CTCCGGTACA
TCACAGCGGG ACAGCTACTA TTCTCTGAAT GCCAGTTGGA ACGGTTCATT CACTGCAACT
CGCCACGGCG CCGCTTTCCA TGACTATAGT GGTAGTGCTG ACTCGCGTTT TATGATCGAC
GCAGACGGCG CTGAAGATAT TCAGTTGAAT AATAAACGCG CGGTAACTAA TCGTTATGGC
ATCGGAGTTA TTCCATCAGT CAGCAGTTAC ATAACAACAT CATTAAGCGT TGACACCCGA
AATCTGCCAG AAAATGTGGA TATCGAAAAC TCGGTTATCA CCACCACCTT AACCGAGGGT
GCTATTGGCT ACGCCAAACT TGATACCCGC AAAGGCTACC AAATCATGGG GATTATTCGC
CTGGCAGATG GTAGTCACCC ACCACTGGGG ATTAGCGTAA AAGATGAAAC CAGCCACAAA
GAATTAGGAC TTGTTGCTGA TGGCGGCTTT GTATACCTCA ACGGCATTCA AGATGACAGC
AAAATTACTT TACACTGGGG TGACAAATCT TGTTTTATTC AACCCCCCCC CAATAGCAGC
AACTTAACCA CCGGAACGGT TATTTTACCG TGTATTAGCT AA
 
Protein sequence
MYKKLKLTTI SELIKNIYCS LSVIIIGCAS AYAVEFNKDL IEAEDRENVN LSQFETDGQL 
PVGKYSLSTL INNKRTPIHL DLQWVLIDNQ TAVCLTPEQL TLLGFTDEII EEAQQNLIDG
CYPIEKEKQI TTYLDKGKMQ LSISAPQAWL KYKDANWTPP ELWDHGIAGA FLDYNLYASH
YAPHQGDNSQ NISSYGQAGV NLGAWRLRTD YQYDQSFNNG KSQANNLDFP RIYLFRPIPE
INAKLTIGQY DTESSIFDSF HFSGISLKSD ENMLPPDLRG YAPQITGVAQ TNAKVTVSQN
NRIIYQENVP PGPFSITNLF NTLQGQLDVK VEEEDGRVTQ WQVASNSIPY LTRKGQIRYT
TAMGKPTSVG GDSLQQPFFW TGEFSWGWLN NVSLYGGSVL TNRDYQSLAT GVGFNLNSLG
SLSFDVTRSD AQLHNQNKET GYSYRANYSK RFESTGSQLT FAGYRFSDKN FVSMNEYIND
TNHYTNYQNE KESYIVTFNQ YLESLRLNTY VSLARNTYWD ASSNVNYSLS LSRDFDIGPL
KNVSTSLTFS RINWEDDNQD QLYLNISIPW GTSRTLSYGM QRNQDNNISH TASWYDSSDR
NNSWSVSASG DNDEFKDMEA SLRASYQHNT ENGRLYLSGT SQRDSYYSLN ASWNGSFTAT
RHGAAFHDYS GSADSRFMID ADGAEDIQLN NKRAVTNRYG IGVIPSVSSY ITTSLSVDTR
NLPENVDIEN SVITTTLTEG AIGYAKLDTR KGYQIMGIIR LADGSHPPLG ISVKDETSHK
ELGLVADGGF VYLNGIQDDS KITLHWGDKS CFIQPPPNSS NLTTGTVILP CIS