Gene EcHS_A2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2489 
Symbol 
ID5593695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2493977 
End bp2496619 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content52% 
IMG OID640921610 
Productfimbrial usher family protein 
Protein accessionYP_001459144 
Protein GI157161826 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATC ATTCTCTTTT TCGATTACGG ATTCTTCCGT GGTGCATTGC GCTGGCAATG 
TCAGGGAGTT ATAGCAGTGT CTGGGCTGAA GACGACATTC AGTTTGATTC CCGTTTTCTG
GAATTAAAAG GCGACACAAA AATTGATCTG AAGCGTTTTT CCAGCCAGGG ATATGTTGAG
CCCGGAAAAT ACAATTTACA GGTTCAACTA AATAAACAGC CATTGGCGGA AGAGTACGAT
ATTTACTGGT ATGCTGGTGA AGATGACGCG AGCAAAAGCT ATGCTTGTCT GACACCGGAA
CTGGTAGCGC AGTTTGGTTT AAAAGAAGAC GTGGCGAAAA ATCTGCAATG GAGCCACGAT
GCTAAATGCC TGAAATCCGG TCAACTGGAA GGCGTGGAAA TTAAGGCTGA TTTAAGCCAG
TCCGCATTAG TCATTTCACT GCCACAGGCT TACCTCGAAT ATACTTATCC CGACTGGGAT
CCGCCTTCAC GTTGGGATGA CGGCATCTCC GGGATCGTCG CGGACTACAG CATCAACGCA
CAAACCCGGC ACGAAGAAAA TGGCGGTGAT GATAGTAACG AGATCAGCGG CAACGGGACG
GTCGGGGTTA ACCTGGGGCC GTGGCGTATG CGTGCTGACT GGCAGACTAA CTATCAACAT
ACTCGCAGTA ATGATGACGA TGAATTCAGC GGCGATGAAA CTCAAAAAAA ATGGGAGTGG
AGTCGCTACT ATGCCTGGCG GGCGTTACCA TCATTAAAAG CCAAACTGGC GCTGGGCGAG
GATTACCTCA GATCCGATAT TTTTGATGGT TTTAACTATG TTGGTGGCAG TGTCAGTACT
GACGATCAAA TGTTGCCTCC CAATCTGCGC GGCTACGCGC CAGACATTTC CGGCGTGGCA
CACACCACAG CAAAAGTGAC CGTCAGCCAG ATGGGGCGTG TGATTTACGA AACGCAGGTT
CCGGCTGGAC CGTTTCGTAT TCAGGATCTT GGTGATTCTG TCTCCGGTAC GTTGCATATT
CGCATTGAAG AACAGAACGG CCAGGTGCAG GAATATGACA TCAGCACCGC CTCGATGCCA
TACCTCACTC GTCCAGGTCA GGTTCGCTAT AAGATCATGA TGGGCCGTCC GCAAGAGTGG
GGACACCATG TCGAGGGTGA ATTTTTTTCT GGTGCTGAAG CTTCCTGGGG GATCGCTAAC
GGCTGGTCGT TATATGGCGG CGCACTGGGA GATGAAAACT ATCAGTCTGC GGCGCTTGGC
GTCGGTCGCG ATTTGTCTAC ATTCGGCGCG GTCGCGTTTG ATGTTACTCA CTCGCACACC
AAACTGGATA AAGACACCGC TTATGGCAAA GGTTCGCTGG ACGGTAACTC CTTCCGTGTG
AGTTATTCCA AAGACTTTGA CCAGCTCAAC AGCCGCGTTA CTTTTGCTGG ATATCGCTTC
TCGGAAGAGA ACTTTATGAC CATGAGCGAG TATCTGGATG CCAGTGACAG CGGAATGGTA
CGCACGGGCA ACGACAAAGA GATGTACACC GCCACTTATA ACCAGAACTT CCGCGATGCG
GGTGTTTCGG TTTATCTCAA CTATACCCGC CATACCTACT GGGATCGCGA GGAGCAGACA
AACTACAACA TCATGCTCTC GCACTATTTC AATATGGGCA GTATTCGTAA TGTCAGCATC
TCGATGACTG GCTACCGTTA CGAGTATGAC AACCAGGCCG ACAAAGGCAT GTACATTTCG
CTCAGTATGC CGTGGGGCGA CAACAGTACC GTTAGCTATA ACGGCAACTA TGGCAGTGGG
ACGGACAGCA GTCAGGTCGG TTATTTCAGC CGTGTCGATG ACGCGACTCA CTATCAGTTG
AACGTCGGCA CCAGTGACAA ACACACCAGC GTTGACGGCT ATTACAGCCA TGATGGTTCG
CTGGCGCAGG TTGACCTCAG CGCGAACTAC CATGAAGGGC AATACACCTC TGCGGGCTTG
TCGTTACAGG GCGGCGCAAC GCTTACTGCC CACGGTGGCG CACTTCACCG TACCCAGAAT
ATGGGCGGGA CACGCTTGTT GATTGATGCC GATGGCGTTG CCGATGTTCC GGTGGAAGGT
AACGGGGCAG CTGTTTATAC CAATATGTTT GGTAAGGCCG TCGTTTCTGA CGTCAATAAC
TATTACCGCA ATCAGGCGTA TATCGACCTC AACAAACTGC CGGAAAACGC CGAAGCAACC
CAGTCGGTGG TACAAGCCAC GCTAACTGAA GGGGCGATTG GCTACCGCAA ATTTACCGTC
ATCAGTGGTC AAAAAGCGAT GGCGGTGCTG CGCCTGAGCG ACGGCAGCCA TCCTCCGTTT
GGCGCAGAAG TAAAAAATGA TAACGAGCAG ACAGTGGGCC TTGTCGATGA TGACGGCAAT
GTTTATCTGG CAGGGGTGAA ACCTGGCGAA CACATGAGTG TGTTCTGGAG TGGTGTTGCG
CATTGCGATA TCAACCTGCC GGACCCGCTG CCTGCCGATC TGTTTAACGG CTTGTTACTG
CCATGCCAGC ATAAAGGCAA TGTAGCACCT GTCACTTCGC CGGCGGTCAA ACCGGCGATT
CAGGAACAGA CACAGCGGGT GACGCCAACG GAACCCCCGA CTTCAATTTC AGTAAACCAG
TAA
 
Protein sequence
MPDHSLFRLR ILPWCIALAM SGSYSSVWAE DDIQFDSRFL ELKGDTKIDL KRFSSQGYVE 
PGKYNLQVQL NKQPLAEEYD IYWYAGEDDA SKSYACLTPE LVAQFGLKED VAKNLQWSHD
AKCLKSGQLE GVEIKADLSQ SALVISLPQA YLEYTYPDWD PPSRWDDGIS GIVADYSINA
QTRHEENGGD DSNEISGNGT VGVNLGPWRM RADWQTNYQH TRSNDDDEFS GDETQKKWEW
SRYYAWRALP SLKAKLALGE DYLRSDIFDG FNYVGGSVST DDQMLPPNLR GYAPDISGVA
HTTAKVTVSQ MGRVIYETQV PAGPFRIQDL GDSVSGTLHI RIEEQNGQVQ EYDISTASMP
YLTRPGQVRY KIMMGRPQEW GHHVEGEFFS GAEASWGIAN GWSLYGGALG DENYQSAALG
VGRDLSTFGA VAFDVTHSHT KLDKDTAYGK GSLDGNSFRV SYSKDFDQLN SRVTFAGYRF
SEENFMTMSE YLDASDSGMV RTGNDKEMYT ATYNQNFRDA GVSVYLNYTR HTYWDREEQT
NYNIMLSHYF NMGSIRNVSI SMTGYRYEYD NQADKGMYIS LSMPWGDNST VSYNGNYGSG
TDSSQVGYFS RVDDATHYQL NVGTSDKHTS VDGYYSHDGS LAQVDLSANY HEGQYTSAGL
SLQGGATLTA HGGALHRTQN MGGTRLLIDA DGVADVPVEG NGAAVYTNMF GKAVVSDVNN
YYRNQAYIDL NKLPENAEAT QSVVQATLTE GAIGYRKFTV ISGQKAMAVL RLSDGSHPPF
GAEVKNDNEQ TVGLVDDDGN VYLAGVKPGE HMSVFWSGVA HCDINLPDPL PADLFNGLLL
PCQHKGNVAP VTSPAVKPAI QEQTQRVTPT EPPTSISVNQ