Gene EcHS_A1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1589 
Symbol 
ID5591496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1605676 
End bp1606824 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content45% 
IMG OID640920742 
Productfimbrial usher family protein 
Protein accessionYP_001458298 
Protein GI157160980 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTT ACACCGTCAA GCCTCCTACC GGAGACACCA ATGAGCAGAC ACAATTTATT 
GATTATTTTA ATCTGTTCTA CAGTAAGCGT GGTCAGGAAC AAATAAGCAT CTCTCAGCAG
CTTGGAAATT ACGGTACGAC ATTTTTCAGT GCCAGTCGCC AAAGTTACTG GAACACGTCA
CGCAGCGACC AGCAAATATC ATTTGGATTA AATGTGCCGT TTGGTGATAT TACGACTTCG
CTGAATTACA GCTATTCCAA TAATATATGG CAAAACGATC GGGATCATTT ACTCGCTTTT
ACGCTTAATG TTCCCTTCAG TCATTGGATG CGTACAGACA GTCAGTCGGC ATTTCGTAAT
TCAAACGCCA GTTACAGTAT GTCAAACGAT TTGAAAGGCG GCATGACCAA TCTATCGGGG
GTTTATGGCA CTCTGCTGCC GGATAATAAC CTGAATTATA GCGTTCAGGT CGGTAACACC
CACGGAGGTA ATACATCGTC TGGCACCAGT GGTTACAGTT CTCTTAATTA TCGTGGAGCT
TATGGTAATA CTAATGTCGG TTACAGTCGG AGTGGTGACA GCAGCCAGAT TTATTACGGA
ATGAGTGGTG GGATTATTGC TCATGCTGAT GGCATCACCT TTGGACAGCC GCTGGGCGAC
ACAATGGTTC TGGTTAAGGC TCCTGGTGCT GATAATGTCA AAATAGAGAA CCAGACCGGA
ATTCATACCG ACTGGCGTGG CTATGCCATA TTACCATTTG CGACAGAATA TAGAGAAAAC
CGTGTTGCTC TTAACGCGAA TTCCCTTGCA GATAATGTTG AACTGGATGA AACCGTGGTC
ACTGTCATCC CAACTCACGG TGCTATTGCC AGAGCAACAT TTAATGCACA AATCGGCGGG
AAAGTATTAA TGACGTTGAA GTACGGTAAT AAGAGCGTTC CATTCGGTGC AATTGTCACA
CACGGAGAGA ATAAAAATGG CAGCATTGTC GCGGAAAATG GTCAGGTTTA TCTGACTGGA
CTTCCACAGT CAGGGCAATT ACAGGTTTCA TGGGGCAAAG ATAAAAACTC AAACTGTATT
GTCGAGTACA AGCTTCCTGA AGTTTCTCCT GGTACCTTAC TGAACCAGCA GACAGCAATC
TGTCGCTAA
 
Protein sequence
MSGYTVKPPT GDTNEQTQFI DYFNLFYSKR GQEQISISQQ LGNYGTTFFS ASRQSYWNTS 
RSDQQISFGL NVPFGDITTS LNYSYSNNIW QNDRDHLLAF TLNVPFSHWM RTDSQSAFRN
SNASYSMSND LKGGMTNLSG VYGTLLPDNN LNYSVQVGNT HGGNTSSGTS GYSSLNYRGA
YGNTNVGYSR SGDSSQIYYG MSGGIIAHAD GITFGQPLGD TMVLVKAPGA DNVKIENQTG
IHTDWRGYAI LPFATEYREN RVALNANSLA DNVELDETVV TVIPTHGAIA RATFNAQIGG
KVLMTLKYGN KSVPFGAIVT HGENKNGSIV AENGQVYLTG LPQSGQLQVS WGKDKNSNCI
VEYKLPEVSP GTLLNQQTAI CR