Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1050 |
Symbol | |
ID | 5591790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1063422 |
End bp | 1064492 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640920215 |
Product | putative fimbrial protein |
Protein accession | YP_001457780 |
Protein GI | 157160462 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.0173294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATAA TCTTTGGAGA AAAATGCGTG TCATTACTAC GACTATTTTT TGCCGCCGTC TTAATGCTAT GGTGCGCTCA AACCGCTGCT TATAGCGGGC AGTGTCATAC TACTCAGGGG AATCCGTATA TTGGCGTCAA TTTTGGCGTT AAAACCCTGG AGGAAGAAGC AAATACGGCA GGGGTAGTTA AAGACAAATT TTATCAGTGG AACGAATCGA ATGATTATTA TGTTTCCTGT GATTGCGATA AAGACAATGT CAGAAGTGGC CGATGGGCAT TCGCCGCGGA TTCACCGTTA GTCTATTTAG GCGACAACTG GTACAAAATT AATGACTATC TTGCCGCCAA AGTTTTATTG CAGGTTAAAG GCAGTTCTCC TACTGCGGTT CCTTTCGAAA ACGTGGGCAC AGGGGGGGAT ACCCGATGGC ATATTTGCGA CCCTGGCGGT CAACGTTTAG GTGGGCAGGG GGCAAGCGGT AATAGCGGTA GCTTTTCCCT GAAAATATTG CAGCCGTTCG TTGGCTCGGT CGTCATTCCT CCTATGGCGC TGGCGCGATT ATATGAATGC TACAACATAC CCGCAGGTGA TTCCTGCACG ACTACAGGTA CACCGGTTTT AGTGTATTAC CTGTCTGGTA CGATCAATTC ACTTGGCTCA TGTTCCGTCA ATGCCGGAGA GACAATTGAA GTTGATTTAG GTGATGTCTT CGCTGCCAAT TTCCGTGTTG TAGGGCATAA ACCTCTTGGG GCCAGAACAG CAGAACTTGC AATTCCAGTC AGGTGTAACA CGGGAAACGC GGGATTAGTT AATGTCAACC TGAGTCTGAC GGCAACCACA GACCCCAGCT ATCCCCAGGC GATTAAGACG TCACGTCCTG GCGTGGGCGT GGTGGTGACC GATAGCCAGA ACAACATTAT TTCCCCTGCT GGTGGAACAT TACCGCTCTC TATTCCTGAT GATGCAGACA GTATCGCGCG AATGAATGTC TATCCAGTCA GCACGACAGG TGTACCACCA GAAACCGGGC GATTTGAAGC CACGGCAACG GTGAGAATAA ATTTTGATTA A
|
Protein sequence | MQIIFGEKCV SLLRLFFAAV LMLWCAQTAA YSGQCHTTQG NPYIGVNFGV KTLEEEANTA GVVKDKFYQW NESNDYYVSC DCDKDNVRSG RWAFAADSPL VYLGDNWYKI NDYLAAKVLL QVKGSSPTAV PFENVGTGGD TRWHICDPGG QRLGGQGASG NSGSFSLKIL QPFVGSVVIP PMALARLYEC YNIPAGDSCT TTGTPVLVYY LSGTINSLGS CSVNAGETIE VDLGDVFAAN FRVVGHKPLG ARTAELAIPV RCNTGNAGLV NVNLSLTATT DPSYPQAIKT SRPGVGVVVT DSQNNIISPA GGTLPLSIPD DADSIARMNV YPVSTTGVPP ETGRFEATAT VRINFD
|
| |