Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1049 |
Symbol | |
ID | 5591789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1060831 |
End bp | 1063431 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640920214 |
Product | outer membrane usher protein fimD-like protein |
Protein accession | YP_001457779 |
Protein GI | 157160461 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 0.607471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAGAA CTCACCGACA ACACAGCCTG TTAAGCTCTG GTGGAGTGCC ATCGTTTATT GGTGGGCTGG TGGTGTTTGT GTCGGCAGCG TTCAATGCAC AAGCTGAAAC CTGGTTCGAA CCTGCCTTTT TCAAAGATGA TCCCTCAATG GTGGCCGATT TGTCTCGTTT CGAAAAAGGA CAAAAAATAA CGCCAGGGGT TTATCGTGTC GATATTGTTC TGAATCAGAC AATTGTAGAT ACGCGCAACG TCAATTTTGT TGAGTTAACG CCAGAGAAGG GGATTGCCGC CTGTTTGACG ACTGAAAGCC TGGATGCAAT GGGTGTGAAT ACTGATGCGT TTCCGGCTTT TAAACAACTG GACAAACAAG CGTGTGCGCT ATTGGCGGAG ATTATTCCGG ATGCCAGCGT AACTTTTAAT GTGAATAAAC TCCGTCTGGA AATTTCAGTA CCGCAAATTG CTATAAAAAG TAACGCTCGT GGTTATGTCC CCCCTGAACG TTGGGATGAA GGGATCAACG CGCTATTACT GGGATATTCA TTTAGCGGGG CTAACAGTAT TCATAGCAGC GCAGACAGTG ATTCTGGCGA CAGTTATTTT CTGAATTTAA ACAGTGGCGT TAATTTAGGC CCATGGAGAT TGCGCAACAA TTCAACATGG AGTCGCAGTA GTGGCCAAAC CGCAGAATGG AAGAATCTCA TCAGCTATTT GCAGCGGGCG GTTATTCCAC TAAAAGGCGA ACTGACCGTA GGTGATGATT ATACTGCAGG CGATTTTTTC GATAGTGTCA GCTTTCGTGG TGTGCAGCTG GCGTCAGATG ACAACATGCT GCCAGACAGC CTGAAAGGGT TTGCGCCTGT GGTGCGTGGT ATCGCCAAAA GCAATGCCCA GATAACGATT AAGCAAAATG GTTACACCAT TTACCAAACT TATGTATCGC CTGGTGCTTT TGAAATTAGT GATCTCTATT CCACGTCGTC GAGCGGTGAT TTGTTAGTTG AAATCAAAGA AGCGGACGGC AGCGTCAATA GCTACAGCGT ACCGTTTTCC AGCGTGCCAT TACTCCAGCG TCAGGGGCGA ATCAAATACG CGGTGACACT GGCGAAATAC AGAACCAATA GTAATGAACA GCAGGAGAGC AAATTTGCCC AGGCCACGTT GCAGTGGGGC GGACCGTGGG GAACGACATG GTATGGTGGT GGACAATATG CTGAATATTA CCGTGCCGCC ATGTTTGGTC TGGGATTTAA CCTTGGCGAT TTCGGAGCAA TTTCGTTCGA TGCGACCCAG GCGAAGAGTA CGCTGGCAGA CCAAAGCGAA CATAAAGGTC AGTCATATCG TTTTCTGTAT GCCAAAACGC TCAACCAATT GGGCACTAAT TTTCAATTGA TGGGCTATCG CTATTCGACG TCGGGTTTCT ACACCCTTTC CGACACCATG TATAAACATA TGGATGGCTA CGAATTTAAT GACGGTGATG ATGAAGATAC GCCGATGTGG TCGCGTTATT ACAATTTGTT TTACACCAAA CGTGGCAAAC TGCAGGTCAA TATCTCCCAG CAATTAGGCG AGTACGGTTC GTTTTATTTA AGTGGTAGCC AGCAAACTTA CTGGCATACC GATCAACAGG ATCGGCTATT ACAGTTTGGC TACAACACGC AAATTAAAGA TCTCTCGCTG GGGGGTTCCT GGAACTACAG TAAGTCCCGT GGTCAACCTG ATGCTGATCA GGTGTTTGCA CTAAATTTTT CCCTGCCGCT CAATCTGTTG CTCCCCAGAA GTAATGATAG CTATACCAGG AAAAAAAATT ACGCCTGGAT GACCTCTAAC ACCAGTATCG ATAACGAAGG GCACATTACA CAAAACCTGG GTTTAACGGA GACACTACTC GATGACGGTA ACCTGAGCTA CAGCGTGCAA CAGGGATATA ACAGCGAGGG GAAAACGGCT AATGGTAGCG CCAGCATGGA CTACAAAGGG GCGTTTGCAG ATGCCCGAGT GGGCTACAAC TACAGCGATA ACGGCAGTCA ACAACAACTG AACTACGCTC TTTCAGGCAG TTTAGTTGCC CATTCACAGG GCATTACCCT GGGGCAATCG CTGGGGGAAA CTAACGTTCT GATTGCAGCA CCAGGCGCAG AGAATACTCG TGTGGCGAAC AGCACCGGGC TGAAAACTGA CTGGCGCGGA TATACCGTTG TTCCTTATGC CACTTCTTAT CGGGAAAATC GAATCGCACT TGATGCGGCG TCGTTAAAAC GTAACGTGGA TCTTGAAAAT GCAGTAGTCA ACGTGGTTCC CACCAAAGGG GCGTTGGTTC TGGCGGAGTT CAATGCCCAT GCGGGTGCAA GGGTATTAAT GAAAACATCA AAGCAGGGTA TACCGCTGCG TTTTGGCGCG ATAGCGACGC TGGACGGCGT ACAGGCTAAT AGCGGCATAA TTGATGATGA TGGCTCGCTC TATATGGCGG GTTTACCGGC GAAGGGAACA ATAAGCGTGC GCTGGGGCGA AGCTCCCGAT CAAATTTGTC ATATCAATTA CGAGCTTACC GAACAACAAA TTAACTCTGC GATTACGCGA ATGGATGCCA TATGCAGATA A
|
Protein sequence | MYRTHRQHSL LSSGGVPSFI GGLVVFVSAA FNAQAETWFE PAFFKDDPSM VADLSRFEKG QKITPGVYRV DIVLNQTIVD TRNVNFVELT PEKGIAACLT TESLDAMGVN TDAFPAFKQL DKQACALLAE IIPDASVTFN VNKLRLEISV PQIAIKSNAR GYVPPERWDE GINALLLGYS FSGANSIHSS ADSDSGDSYF LNLNSGVNLG PWRLRNNSTW SRSSGQTAEW KNLISYLQRA VIPLKGELTV GDDYTAGDFF DSVSFRGVQL ASDDNMLPDS LKGFAPVVRG IAKSNAQITI KQNGYTIYQT YVSPGAFEIS DLYSTSSSGD LLVEIKEADG SVNSYSVPFS SVPLLQRQGR IKYAVTLAKY RTNSNEQQES KFAQATLQWG GPWGTTWYGG GQYAEYYRAA MFGLGFNLGD FGAISFDATQ AKSTLADQSE HKGQSYRFLY AKTLNQLGTN FQLMGYRYST SGFYTLSDTM YKHMDGYEFN DGDDEDTPMW SRYYNLFYTK RGKLQVNISQ QLGEYGSFYL SGSQQTYWHT DQQDRLLQFG YNTQIKDLSL GGSWNYSKSR GQPDADQVFA LNFSLPLNLL LPRSNDSYTR KKNYAWMTSN TSIDNEGHIT QNLGLTETLL DDGNLSYSVQ QGYNSEGKTA NGSASMDYKG AFADARVGYN YSDNGSQQQL NYALSGSLVA HSQGITLGQS LGETNVLIAA PGAENTRVAN STGLKTDWRG YTVVPYATSY RENRIALDAA SLKRNVDLEN AVVNVVPTKG ALVLAEFNAH AGARVLMKTS KQGIPLRFGA IATLDGVQAN SGIIDDDGSL YMAGLPAKGT ISVRWGEAPD QICHINYELT EQQINSAITR MDAICR
|
| |