Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2373 |
Symbol | |
ID | 5592447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2381399 |
End bp | 2385151 |
Gene Length | 3753 bp |
Protein Length | 1250 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921500 |
Product | adhesin |
Protein accession | YP_001459034 |
Protein GI | 157161716 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATTA TCTTTCTACG CAAGGAGTAT TTATCTTTAC TCCCGTCAAT GATTGCATCT CTTTTCTCTG CTAACGGTGT CGCGGCGGTC ACTGATTCAT GCCAGGGATA TGATGTCAAA GCGAGTTGTC AGGCCAGCAG GCAAAGCCTT TCAGGCATTA CGCAGGACTG GAGTATCGCT GATGGGCAAT GGCTGGTTTT TTCGGATATG ACCAATAACG CCAGCGGTGG GGCCGTATTT TTGCAACAAG GAGCGGAATT TTCACTATTA CCAGAAAATG AAACTGGAAT GACTCTGTTT GCCAATAACA CCGTTACAGG AGAATATAAT AACGGCGGGG CCATATTTGC TAAAGAAAAC TCAACGCTGA ATCTTACTGA TGTTATTTTT TCCGGTAACG TCGCAGGCGG CTATGGTGGC GCAATCTATT CTTCTGGTAC TAACGATACT GGTGCCGTCG ATTTACGTGT CACTAACGCC ATGTTTCGCA ATAACATCGC TAATGATGGC AAAGGTGGCG CAATTTATAC CATTAATAAT GACGTTTATT TAAGTGATGT TATTTTTGAT AACAACCAGG CATATACATC AACAAGTTAC AGTGATGGCG ATGGCGGGGC AATCGATGTT ACCGATAATA ATAGCGACAG CAAGCATCCT TCAGGTTATA CGATAGTAAA TAACACTGCC TTTACAAATA ACACTGCCGA AGGTTATGGC GGGGCGATAT ATACCAATAG CGTGACGGCT CCCTATCTTA TTGATATTTC TGTTGATGAC AGCTACAGCC AGAACGGAGG CGTGTTAGTC GATGAGAACA ATAGCGCAGC AGGCTATGGA GATGGTCCTT CCTCTGCGGC GGGTGGCTTT ATGTATCTCG GCTTAAGTGA AGTTACCTTT GATATTGCCG ACGGAAAAAC GCTGGTTATT GGCAATACAG AGAATGACGG AGCTGTTGAC TCTATTGCTG GTACCGGGTT AATCACCAAA ACAGGTTCCG GCGATCTGGT ACTTAATGCA GATAACAATG ACTTTACTGG TGAGATGCAG ATTGAAAACG GTGAAGTTAC CCTGGGCCGC AGCAACTCCC TGATGAATGT CGGCGATACG CATTGCCAGG ACGATCCGCA AGACTGCTAC GGTCTGACGA TAGGGAGTAT TGATCAGTAT CAGAATCAGG CTGAGCTAAA CGTTGGCTCG ACCCAACAAA CTTTTGTGCA CGCATTGACG GGCTTTCAGA ATGGCACTTT AAATATCGAT GCTGGTGGCA ACGTTACTGT TAATCAGGGC AGTTTTGCTG GCATCATCGA AGGTGCTGGT CAGCTCACCA TTGCGCAAAA CGGCAGCTAC GTGCTGGCAG GGGCGCAGTC GATGGCGCTA ACCGGCGATA TAGTCGTTGA TGATGGTGCG GTGCTTTCGC TGGAAGGCGA CGCGGCAGAT CTTACCGCTC TCCAGGACGA TCCGCAGTCG ATCGTGTTAA ACGGCGGTGT GCTCGATCTC TCTGATTTCT CCACCTGGCA GAGCGGCACA TCATACAACG ATGGCCTTGA AGTCAGTGGC AGCAGCGGAA CGGTTATCGG CAGTCAGGAT GTGGTAGATC TTGCAGGTGG CGACAATTTG CATATCGGCG GCGACGGGAA AGATGGCGTC TACGTGGTGG TCGATGCGAG CGACGGGCAG GTAAGTCTGG CAAACAATAA TAGTTATTTG GGCACAACAC AAATCGCCTC CGGTACGCTG ATGGTGAGCG ACAACTCGCA GCTTGGAGAT ACCCACTATA ACCGCCAGGT TATCTTTACC GATAAGCAAC AAGAAAGCGT GATGGAGATT ACCTCCGACG TTGACACGCG TTCAGATGCG GCAGGCCACG GACGTGATAT TGAAATGCGC GCCGACGGTG AAGTGGCAGT TGATGCGGGG GTAGACACGC AGTGGGGCGC ACTGATGGCT GACAGCAGCG GGCAGCATCA GGATGAGGGT AGCACATTGA CTAAAACGGG GGCTGGTACA CTGGAGCTGA CCGCCAGCGG TACAACGCAG TCGGCGGTAC GTGTCGAAGA AGGCACCCTG AAAGGTGATG TTGCTGATAT CCTTCCTTAT GCTTCGTCAC TGTGGGTTGG TGATGGGGCA ACGTTCGTTA CTGGCGCGGA TCAGGATATT CAGTCAATTG ATGCTATTTC CAGCGGCACT ATCGACATCA GCGATGGTAC GGTTTTGCGC CTGACCGGGC AGGATACTTC CGTCGCCCTT AATGCCTCAC TATTTAACGG CGATGGGACG CTGGTGAATG CCACCGATGG TGTGATGTTG ACAGGTGAGC TTAATACCAA CCTTGAAACT GACAGCCTGA CTTATCTTTC CAACGTGACG GTTAATGGCA ATCTGACCAA TACGTCCGGT GCGGTTAGCC TGCAAAATGG CGTCGCTGGC GATACGCTGA CGGTAAACGG TGATTATACC GGCGGCGGTA CGCTACTGCT CGATAGCGAA TTAAACGGCG ATGACTCGGT AAGCGATCAA TTGGTGATGA ACGGTAATAC TGCTGGCAAC ACAACTGTGG TGGTTAACTC CATTACAGGG ATTGGTGAGC CGACATCGAC AGGCATTAAA GTGGTTGATT TCGCAGCTGA TCCCACGCAG TTTCAAAACA ATGCGCAGTT CAGTCTGGCA GGCAGCGGCT ACGTCAATAT GGGAGCGTAT GACTACACGC TGGTGGAAGA TAACAACGAC TGGTATCTGC GATCGCAAGA AGTAACGCCG CCATCGCCAC CTGATCCAGA CCCGACTCCC GATCCTGATC CCACGCCGGA TCCTGATCCA ACACCCGACC CGGAACCTAC GCCTGCTTAC CAGCCGGTGT TGAATGCCAA AGTTGGCGGT TATCTCAATA ACCTGCGGGC GGCAAATCAG GCGTTTATGA TGGAGCGACG CGATCACGCA GGTGGCGATG GTCAGACGCT GAATTTACGT GTTATCGGCG GAGATTATCA TTACACAGCA GCGGGGCAAC TGGCTCAACA TGAAGACACT TCTACGGTGC AACTTAGCGG CGATCTGTTT AGCGGGCGCT GGGGCACGGA TGGCGAGTGG ATGCTTGGGA TTGTTGGTGG CTACAGCGAT AACCAGGGCG ACAGCCGCTC GAATATGACC GGAACTCGCG CCGATAACCA GAACCACGGT TATGCCGTTG GGCTGACATC AAGCTGGTTT CAGCACGGTA ATCAGAAGCA AGGGGCCTGG CTGGATAGCT GGCTGCAATA CGCGTGGTTT AGCAATGATG TTTCCGAACA AGAAGATGGC ACAGATCATT ACCACTCGTC GGGGATTATC GCCTCGCTGG AGGCGGGGTA TCAGTGGTTA CCGGGGCGTG GTGTGGTGAT TGAACCGCAG GCGCAGGTGA TTTATCAGGG CGTGCAGCAG GATGATTTTA CCGCCGCTAA CCGTGCGCGC GTGTCACAAT CGCAGGGTGA TGATATTCAG ACGCGGCTGG GTTTACACAG CGAATGGCGT ACCGCTGTTC ATGTCATACC AACATTAGAT CTGAATTATT ATCACGATCC CCATTCGACG GAAATTGAAG AGGATGGCAG CACTATCAGT GACGATGCGG TGAAGCAACG GGGTGAAATA AAAGTGGGAG TCACGGGCAA TATCAGTCAG CGAGTGTCCC TGCGCGGCAG CGTGGCGTGG CAGAAAGGGA GTGATGATTT TGCCCAGACG GCAGGGTTTT TGTCGATGAC GGTGAAATGG TAA
|
Protein sequence | MRIIFLRKEY LSLLPSMIAS LFSANGVAAV TDSCQGYDVK ASCQASRQSL SGITQDWSIA DGQWLVFSDM TNNASGGAVF LQQGAEFSLL PENETGMTLF ANNTVTGEYN NGGAIFAKEN STLNLTDVIF SGNVAGGYGG AIYSSGTNDT GAVDLRVTNA MFRNNIANDG KGGAIYTINN DVYLSDVIFD NNQAYTSTSY SDGDGGAIDV TDNNSDSKHP SGYTIVNNTA FTNNTAEGYG GAIYTNSVTA PYLIDISVDD SYSQNGGVLV DENNSAAGYG DGPSSAAGGF MYLGLSEVTF DIADGKTLVI GNTENDGAVD SIAGTGLITK TGSGDLVLNA DNNDFTGEMQ IENGEVTLGR SNSLMNVGDT HCQDDPQDCY GLTIGSIDQY QNQAELNVGS TQQTFVHALT GFQNGTLNID AGGNVTVNQG SFAGIIEGAG QLTIAQNGSY VLAGAQSMAL TGDIVVDDGA VLSLEGDAAD LTALQDDPQS IVLNGGVLDL SDFSTWQSGT SYNDGLEVSG SSGTVIGSQD VVDLAGGDNL HIGGDGKDGV YVVVDASDGQ VSLANNNSYL GTTQIASGTL MVSDNSQLGD THYNRQVIFT DKQQESVMEI TSDVDTRSDA AGHGRDIEMR ADGEVAVDAG VDTQWGALMA DSSGQHQDEG STLTKTGAGT LELTASGTTQ SAVRVEEGTL KGDVADILPY ASSLWVGDGA TFVTGADQDI QSIDAISSGT IDISDGTVLR LTGQDTSVAL NASLFNGDGT LVNATDGVML TGELNTNLET DSLTYLSNVT VNGNLTNTSG AVSLQNGVAG DTLTVNGDYT GGGTLLLDSE LNGDDSVSDQ LVMNGNTAGN TTVVVNSITG IGEPTSTGIK VVDFAADPTQ FQNNAQFSLA GSGYVNMGAY DYTLVEDNND WYLRSQEVTP PSPPDPDPTP DPDPTPDPDP TPDPEPTPAY QPVLNAKVGG YLNNLRAANQ AFMMERRDHA GGDGQTLNLR VIGGDYHYTA AGQLAQHEDT STVQLSGDLF SGRWGTDGEW MLGIVGGYSD NQGDSRSNMT GTRADNQNHG YAVGLTSSWF QHGNQKQGAW LDSWLQYAWF SNDVSEQEDG TDHYHSSGII ASLEAGYQWL PGRGVVIEPQ AQVIYQGVQQ DDFTAANRAR VSQSQGDDIQ TRLGLHSEWR TAVHVIPTLD LNYYHDPHST EIEEDGSTIS DDAVKQRGEI KVGVTGNISQ RVSLRGSVAW QKGSDDFAQT AGFLSMTVKW
|
| |