Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2384 |
Symbol | |
ID | 6144679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2427935 |
End bp | 2431699 |
Gene Length | 3765 bp |
Protein Length | 1254 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617257 |
Product | adhesin |
Protein accession | YP_001744429 |
Protein GI | 170679991 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.985186 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTA TTTTTCTACG CAAGGAGTAT TTATCTTTAC TCCCGTCAAT GATTGCATCT CTTTTCTCTG TTAACAGTGT CGCGGAGGTT TTGGATTCAT GCCAGGGATA TGATATCAAA GCGAGTTGTC AGGCCAGCAG GCAAAGCCTT TCAGGCATTA CGCAGGACTG GAGCATCGCC GATGGGCAAT GGGTGATTTT TTCGGGTATG GCCAATAATG CCAGCGGTGG TGCCGTATTT TTGCAGCAAA GTGCTGAATT TACGATATCA CCACAAAATG AAACAGGGAT GACCCTGTTT GCTAATAACT CGATTAGTGG CGAATATAAT AATGGCGGAG CTATCTTTGC TAAAGAAAAC TCAACGATAA ATCTTGCGAA TGTTATTTTC GACAGTAACG TTGCAGGAGG CTATGGCGGA GCAATCTATT CTGCGGGAAC AAACGATGCC GGTGATATGG ATTTAAATAT CACTAATGCA ATATTTGCGA ATAATATCGC CAATGATGGT AAAGGTGGCG CTATTTATTC TATCAATAAT GATATTTATT TAAGTGATGT CATTTTTGAT AATAATCAGG CATATACATC AACCAGCTAT AGTGATGGGG ACGGCGGTGC CATTGATGTT ACGGATAATA GTACGGATAA CACACATCTT TCAGGAAAAA CGATAATTAA TAATACCTCC TTTACCAATA ACTATGCAGA AGGTTATGGC GGAGCAATTT ATACCAGCAG CACCACATCC CCGTATCTTA TTGATATTTC TGTTGATGAT AATTATGACC AAAACAATGG TGTGATGATT GATGAAAATA ACAGCGCCTC GGGATATGAC CATTCGGCGA CGGCAGCGGC GGGGGGCTTT ATGTATATCG GTCATAGTGT GGCTGAATTT AACATTGCCG CCGATAAAAC GCTGGTAATA GGCAATACCA GCAACGATGG CGCGATTGAT TCTCTTGCCG GAACGGGAGT CATCGTGAAA GAGGGAGCGG GAGAGTTAGT CCTCAATGCC GACAATAATG CGTTTACCGG CGAGATGAGT ATCCAGAATG GCGAGGTGAC GCTGGGGCGC AGCGATGAGT TAATGAATGT CGGCGACACG CACTGTCAGA GCGATCCGCA GGATTGCTTT GGTCTGATGG TCGGCAGCAC TGTTCATTCT GAGTATCAGG CAGAACTGAA TGTTGGCAAT ACACAACAAA CGTTTGTGCA CTCATTAACC GGTTTTGCTA ATGGCATTTT GAATATCGAC GCAGGCGGCA ATGTTACCGT AAACCAGGGC GGTTTTTCCG GCTCGATTCA GGGTGAAGGA CAGTTGACGG TAGCGCAAGA TGGCAGCTAT CTCCTGACAG GGGAGCAGTC GATGGCATTA ACCGGCGATA TTGTGGTGGA AGATAACGCC GTATTGTCGC TAGCGGGGAA TCAGGCCGAT TTACGGGCAA TGCAGAGCGA TCCGCAATCG ATCGTGCTAA ATGGCGGAGT ATTGGATCTC TCTGATTTCA CCACTTGGGA TGGCGATAGC TCATACAATG ACGGTCTGCA AATCAGCGGC AGTGGGGGGA CAGTGATTGG CAGTAATGAT GTCGTTGATA TCAGCAGTGG CGATGATTTA CATATTGGCG GCAGTGATGC CAGCCAGAAT GGCGTTTATG TCGTCATTAA TGCGGGGGAT CAGCGCGTAA CGCTGGCGAA TAACAACGGT TATCTCGGTA ATACGCAAAT CGCCTCCGGT ACGCTTGAAG TCAGCGATAA TTCGCAATTG GGGGACACAA GTTACAACCG TTCAGTGATC TTCACAGATC CCCAACAACA CAGTGAAATG GATGTGACGA CTGATGTTGA CACTCGCTCG GCTACAACAG GTCAGGGCAG GAATATTGAA ATGCGCGCTG ATGGCGAAAT ACACGTTGAG GATGGCGTAG ATACGCAATG GGGCGGGTTG ATGGCGGATA GTACCGGGCA ACAACTGGAT AGCGTAAGCA CGTTGACTAA AAGTGGCGGC GGGACGCTGG AGTTGACCGC CAGCGGCACT GCGACGTCTG CGGTACGTGT GGAGGACGGA ACGCTAAAAG GTGAAGCGGA GAATATTATT CCTTATGTTT CTTCACTGTG GGTGGGGGAA GACGGTGTTT TTGAAACCGG GAAAAATCAG GATATTCGTT CAATCGATGC CACCTCTGGC GGCGATATCG ACATAACCGA TGGTACTGTG TTGCGATTGA CGCAGCAGGA TACGAACCAG GCGCTGGATG CCTCGCTGTT TAGCGGCGAC GGTACGTTGG TGAATGCCAC CGATGGTGTG ACGCTGACAG GCGAGCTTAA TACCAACCTT GAAACTGACA GCCTGACTTA TCTTTCAGAC GTGACGGTCA ATGGTGATCT AACTAATACT TCCGGTGCTG TCAGTCTGCA AAATGGCGTT GCTGGCGACA CGCTCACGGT AAACGGTGAT TACACCGGTG GCGGTACGTT GTTTCTCGAC AGCGAATTAA ACGGCGATGA CTCGGCAAGC GACCAACTGG TGTTGAACGG TAATACTGCT GGCAACACGA CCGTGGTAAT TAATCCCATT ACGGGTATTG GTGAGCCGAC ATCTACGGGC ATTAAAGTGG TTGATTTCGC AGCCGATCCA ACGCAATTTC AAAACAATGC GCAGTTCAGT CTGGCGGGCA GCGGCTACGT CAATATGGGA GCGTATGACT ACACGCTGGT GGAAGATAAC AACGACTGGT ATCTGCGATC GCAAGAAGTA ACGCCACCAT CGCCACCTGA TCCAGACCCG ACTCCAGATC CTGATCCTAC GCCGGATCCT GATCCAACAC CCGACCCGGC TCCAGCCCCT ACGCCTGCCT ACCAGCCGGT GCTGAATGCC AAAGTTGGTG GCTATCTCAA TAACCTACGT GCGGCAAATC AGGCATTTGT GATGGAGCGA CGCGATCACG CTGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT CATTACACAG CAGCGGGGCA ACTGGCTCAG CATGAAGACA CTTCTACGGT GCAGCTTAGC GGCGACTTGT TTAGCGGGCG CTGGGGCATG GATGGCGAGT GGATGCTTGG GGCGGTTGGT GGCTACAGTG ATAACCAGGG CGACAGCCGC TCGAATATGA CCGGAACTCG CGCCGATAAC CAGAACCACG GTTATGCCGT TGGTCTGACC TCAAGCTGGT TCCAGCACGG TAATCAGAAG CAGGGGGCCT GGCTGGATAG CTGGCTGCAA TACGCGTGGT TTAACAATGA TGTTTCCGAA CATGAAGATG GCGCGGATCA TTACCACTCA TCGGGGATTA TCGCCTCGCT GGAAGCCGGG TATCAGTGGT TACCGGGGCA TGGAGTGGTG ATTGAACCAC AGGCGCAGGT GATTTATCAG GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCATGCGC GGGTGTCACA ATCGCAGGGT GATGATATTC AAACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCGGT TGGTGTCATA CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATGCGA CGGAAATTGA AGAAGATGGC AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AATAACGGGC AATATCAGTC AGCGAGTTTC GCTGCGTGGT AGCGTGGCGT GGCAGAAAGG GAGTGATGAT TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACAGTGAAAT GGTAA
|
Protein sequence | MRIIFLRKEY LSLLPSMIAS LFSVNSVAEV LDSCQGYDIK ASCQASRQSL SGITQDWSIA DGQWVIFSGM ANNASGGAVF LQQSAEFTIS PQNETGMTLF ANNSISGEYN NGGAIFAKEN STINLANVIF DSNVAGGYGG AIYSAGTNDA GDMDLNITNA IFANNIANDG KGGAIYSINN DIYLSDVIFD NNQAYTSTSY SDGDGGAIDV TDNSTDNTHL SGKTIINNTS FTNNYAEGYG GAIYTSSTTS PYLIDISVDD NYDQNNGVMI DENNSASGYD HSATAAAGGF MYIGHSVAEF NIAADKTLVI GNTSNDGAID SLAGTGVIVK EGAGELVLNA DNNAFTGEMS IQNGEVTLGR SDELMNVGDT HCQSDPQDCF GLMVGSTVHS EYQAELNVGN TQQTFVHSLT GFANGILNID AGGNVTVNQG GFSGSIQGEG QLTVAQDGSY LLTGEQSMAL TGDIVVEDNA VLSLAGNQAD LRAMQSDPQS IVLNGGVLDL SDFTTWDGDS SYNDGLQISG SGGTVIGSND VVDISSGDDL HIGGSDASQN GVYVVINAGD QRVTLANNNG YLGNTQIASG TLEVSDNSQL GDTSYNRSVI FTDPQQHSEM DVTTDVDTRS ATTGQGRNIE MRADGEIHVE DGVDTQWGGL MADSTGQQLD SVSTLTKSGG GTLELTASGT ATSAVRVEDG TLKGEAENII PYVSSLWVGE DGVFETGKNQ DIRSIDATSG GDIDITDGTV LRLTQQDTNQ ALDASLFSGD GTLVNATDGV TLTGELNTNL ETDSLTYLSD VTVNGDLTNT SGAVSLQNGV AGDTLTVNGD YTGGGTLFLD SELNGDDSAS DQLVLNGNTA GNTTVVINPI TGIGEPTSTG IKVVDFAADP TQFQNNAQFS LAGSGYVNMG AYDYTLVEDN NDWYLRSQEV TPPSPPDPDP TPDPDPTPDP DPTPDPAPAP TPAYQPVLNA KVGGYLNNLR AANQAFVMER RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS GDLFSGRWGM DGEWMLGAVG GYSDNQGDSR SNMTGTRADN QNHGYAVGLT SSWFQHGNQK QGAWLDSWLQ YAWFNNDVSE HEDGADHYHS SGIIASLEAG YQWLPGHGVV IEPQAQVIYQ GVQQDDFTAA NHARVSQSQG DDIQTRLGLH SEWRTAVGVI PTLDLNYYHD PHATEIEEDG STISDDAVKQ RGEIKVGITG NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW
|
| |