Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1662 |
Symbol | |
ID | 6142868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1651535 |
End bp | 1656415 |
Gene Length | 4881 bp |
Protein Length | 1626 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616538 |
Product | pertactin family protein |
Protein accession | YP_001743716 |
Protein GI | 170684085 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.459563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGAA TCTATCGCGT GATATGGAAT TGCACCCTAC AGGTATTTCA GGCCTGCTCG GAATTAACTC GCAGGGTAGG TAAAACATCG ACGGTTAATT TGCGTAAATC CTCTGGATTG ACAACGAAAT TCAGTAGATT GACGCTGGGT GTTTTGCTGG CACTAAGCGG TTCAGCGTCT GGTGCGAGTC TGGAAGTTGA TAATGGTCAG ATTACCAATA TTAATACTGA TGTTGCTTAC GATGCCTACC TGGTTGGGTG GTATGGCACT GGAGTGCTTA ATATTTTGGC TGGCGGTAAT GCCTCCTTAA CCACTATTAC TACCAGCGTC ATTGGTGGTA ATGAGGACTC GGAGGGCACC GTTAATGTTT TGGGTGGCAC CTGGCGATTG TATGATAGCG GAAATAATGC AAGGCCTTTA AATGTGGGTC AATCCGGAAC GGGGACGCTG AATATTAAAC AGAAGGGTCA CGTTGACGGA GGATATTTAA GGATAGGTTC TTCGACAGGA GGCGTCGGGA CGGTTAATGT TGAGGGAGAG GACTCGGTTC TGACGACCGA ATTATTCGAA ATAGGGAGTT ACGGTACGGG CTCATTAAAT ATTACGGATA AGGGTTACGT CACGAGTTCA ATAGTCGCCA CTGTAGGATA TCAAGCGAAC AGTAATGGTA AGGTTGTCGT TGAAAAGGGC GGTGAGTGGC TAATAAAAAA TAACGATTCC TCAATTGAAT TTCAAATTGG TAATCAAGGA ACTGGGGAAG CGACTATTCG CGAGGGTGGA CTGATTACGG CTGAAAATAC GATTATCGGA GGCAATGCCA CCGGTATCGG AACCCTGAAT GTGCAGGATC AAGACTCTGT CATCACGGTA CGCAGACTCT ATAATGGATA TTTCGGTAAT GGCAAAGTCA ATATTTCCAA TAATGGACTG ATTAATAACA AAGAATATTC ATTGGTGGGC GTTCAGGACG GTTCCCACGG TGTCGTCAAC GTGACCGATA AAGGGCATTG GAATTTCCTC GGAACGGGCG AAGCTTTCCG CTATATCTAT ATCGGTGATG CTGGCGACGG TGAACTTAAT GTCTCTAGTG AAGGCAAAGT AGATTCGGGA ATTATCACTG CGGGGATGAA AGAAACAGGC ACAGGCAACA TTACTGTTAA GGATAAGAAC TCCGTTATCA CTAATCTCGG AACTAATCTT GGTTATGACG GCCACGGCGA AATGAATATC AGTAATGAGG GGCTTGTTGT CAGCAACGGA GGAAGTTCAC TCGGTTATGG AGAAACCGGC ATCGGGAAGG TTAGCATCAC CACGGGGGGG ATGTGGGAGG TCAATAAGAA TGTCTATACC ACTATTGGTG TTGCGGGCGT CGGAAACCTC AATATCAGCG ATGGCGGTAA GTTCGTATCG CAAAATATTA CTTTTTTGGG CGATAAAGCA AGCGGTATCG GCACACTGAA CCTGATGGAT GCGACATCAT CGTTCGATAC TGTAGGTATC AATGTCGGTA ATTTTGGTAG CGGCATCGTA AATGTCAGTA ATGGTGCCAC CCTTAATTCA ACGGGCTATG GATTTATCGG AGGAAATGCC TCCGGTAAGG GGATAGTTAA TATTTCAACG GATAGTCTCT GGAATTTAAA AACGTCATCT ACTAACGCCC AATTGCTACA GGTCGGTGTA TTAGGTACGG GTGAACTGAA TATTACCACC GGAGGTATAG TTAAAGCGCG TGATACACAG ATAGCTCTGA ATGACAAAAG TAAGGGCGAC GTGCGGGTGG ATGGGCAGAA CTCTCTTCTT GAAACATTCA ATATGAACGT AGGGACGACT GGCACAGGTA CGTTAACCCT GACGAATAAC GGCACGCTGA ATGTCGAAGG TGGAGAAGTT TACTTAGGTG TTTTTGAGCC TGCTGTAGGA ACGCTAAACA TTGGTGCTGC TCACGGTGAG GTGGCGGCAG ATGCCGGGTT TATTACCAAT GCGACGAAAG TGGAGTTTGG TCTTGGCGAA GGCGTTTTTG TCTTTAATCA TACCAATAAC AGTGATGCCG GCTACCAGGT CGATATGCTG ATTACGGGTG ACGATAAAGA CGGAAAAGTG ATGCATGATG CAGGCCATAC GGTGTTCAAT GCAGGGAATA CTTATAGCGG TAAAACGCTG GTCAATGACG GCCTCCTGAC TATTGCGTCA CATACGGCAG ATGGGGTAAC AGGCATGGGT TCGAGTGAAG TCACCATTGC AAGCCCCGGT ACGCTCGACA TTCTCGCATC AACGAACAGC GCAGGAGATT ACACGCTGAC CAATGCGCTC AAAGGCGATG GCTTGATGCG AGTGCAGCTG TCATCCTCCG ACAAGATGTT TGGCTTTACC CATGCAACAG GGACTGAATT CGCCGGTGTT GCCCAACTGA AAGACAGTAC CTTCACTCTG GAACGCGACA ACACCGCTGC GCTTACTCAC GCGATGTTGC AGTCTGACAT TGAAAATACC ACATCGGTAA ACGTAGGAGA GCAATCCATT GGTGGACTGG CCATGAATGG CGGTACGCTC ATTTTCGATA CGGATATTCC TGCTGCGACG CTTGCAGAGG GATATATCAG CGTCGATACG CTGGTTGTCG GCGCGAGTGA CTACACCTGG AAAGGCCGTA ACTATCAGGT AAACGGGACG GGCGACGTGC TTATCGACGT GCCTAAACCG TGGAATGATC CCATGGCGAA TAACCCTCTG ACGACGCTCA ATTTGCTGGA ACACGATGAT AACCATGTCG GCGTTCAACT GGTGAAGGCG CAAACGGTTA TTGGGTCGGG TGGCTCATTA ACGTTACGTG ATTTACAGGG CGACGAGGTG GAAGCGGACA AAACGTTACA CATTGCGCAA AACGGAACGG TGGTCGCTGA GGGTGATTAT GGATTCCGCC TCACGACCGC ACCAGGTGAT GGTTTGTACG TTAACTATGG GCTGAAAGCG CTGAACATCC ATGGTGGGCA AAAGCTGACA TTAGCCGAAC ATGGCGGAGC CTATGGCGCA ACGGCCGATA TGTCGGCAAA AATCGGTGGT GAAGGGGATC TGGCAATCAA TACGGTGCGA CAGGTTTCGC TTTCCAACGG TCAGAACGAC TATCAGGGGG CAACCTACGT TCAGATGGGG ACATTACGTA CCGATGCGGA TGGCGCGCTG GGCAACACCC GGGAACTGAA CATCAGCAAC GCGGCCATCG TCGATCTTAA TGGATCGACG CAGACGGTAG AGACATTCAC CGGGCAGATG GGTTCGACTG TTTTGTTCAA AGAAGGGTCG CTGACGGTAA ATAAAGGTGG GATCAGTCAG GGTGAACTGA CAGGTGGCGG AAACCTGAAT GTTACAGGGG GAACGCTGGC TATCGAGGGG CTTAATGCAC GCTACAATGC GTTAACCAGC GTTAGCCCAA ATGCGGAAGT CAGCCTCGAT AATACGCAGG GGTTAGGCAG AGGAAATATT GCCAATGACG GTCTGTTAAC GCTAAAAAAC GTGACCGGCG AACTGCGTAA TAGCATAAGC GGGAAGGGTA TCGTGAGCGC AACCGCCAGG ACAGACGTAG AGCTGGATGG CGATAATAGC CGCTTTGTGG GGCAATTCAA CATTGATACA GGCAGCGCGC TCAGCGTCAA CGAGCAGAAA AACCTGGGGG ATGCTTCCGT TATCAATAAT GGCCTGCTCA CCATCGCCAC TGAGCGTAGC TGGGCGATGA CGCACAGTAT CAGCGGTAGC GGTGATGTGA CAAAACTGGG TACCGGGATC CTGACTCTTA ACAAAGATTC CGCGGCGTAT CAGGGTACGA CGGATATCGT GGGTGGTGAA ATTGCTTTCG GTTCCGACTC TGCCATTAAT ATGGCAAGTC AGCACATTAA TATCCATAAC AGCGGTGTGA TGTCGGGAAA TGTCACCACT GCAGGTGATG TGAACGTTAT GCCAGGGGGG ACACTGCGTG TCGCTAAAAC CACTGTCGGC GGTAACCTGG AGAATGGTGG CACGGTTCAA ATGAACAGCG AAGGGGGGAA ACCGGGGAAT GTACTGACCG TTAACGGCAA CTATACCGGA AACAATGGCC TGATGACGTT CAACGCGACG CTGGGCGGCG ATAATTCACC CACGGATAAA ATGAACGTGA AAGGCGATAC CCAAGGGAAC ACTCGCGTTC GGGTTGATAA CATTGGCGGC GTCGGTGCAC AAACGGTCAA CGGTATTGAA CTCATTGAGG TTGGCGGTAA TTCTGCAGGT AACTTCGCGC TGACCACCGG AACTGTCGAA GCTGGGGCTT ACGTCTACAC GCTGGCTAAA GGGAAGGGGA ATGACGAGAA AAACTGGTAT CTGACCAGTA AATGGGACGG CGTAACGCCA GCGGATACAC CCGATCCCAT CAATAATCCC CCTGTTGTGG ATCCGGAAGG CCCATCAGTT TATCGCCCGG AGGCCGGAAG CTATATCAGC AACATTGCCG CAGCCAACTC GCTGTTTAGC CATCGCTTAC ACGACCGTCT GGGTGAGCCG CAGTATACAG ATTCACTGCG AGTTCAGGAT TCGGCAACCA GTATGTGGAT GCGTCATGTC GGAGGGCACG AACGTTTCAG GACCGGTGAT GGTCAGCTAA ATACTCAGGC TAACCGCTAT GTATTGCAGC TAGGCGGCGA TTTGGCGCAG TGGAGTAGCA CGCAGGATCG CTGGCATCTC GGCGTCATGG CAGGCTATGC CAATCAGCAC AGTAATACTC AGAGTAATCA TGTGGGTTAT AAATCGGATG GGCGCATCAG CGGTTACAGC GCAGGACTGT ACGCGACCTG GTATCAGAAC GATGCGAATA TAACAACCTT AGCCTGTGGG GGAATGTCGG TGTGCAACTA G
|
Protein sequence | MNRIYRVIWN CTLQVFQACS ELTRRVGKTS TVNLRKSSGL TTKFSRLTLG VLLALSGSAS GASLEVDNGQ ITNINTDVAY DAYLVGWYGT GVLNILAGGN ASLTTITTSV IGGNEDSEGT VNVLGGTWRL YDSGNNARPL NVGQSGTGTL NIKQKGHVDG GYLRIGSSTG GVGTVNVEGE DSVLTTELFE IGSYGTGSLN ITDKGYVTSS IVATVGYQAN SNGKVVVEKG GEWLIKNNDS SIEFQIGNQG TGEATIREGG LITAENTIIG GNATGIGTLN VQDQDSVITV RRLYNGYFGN GKVNISNNGL INNKEYSLVG VQDGSHGVVN VTDKGHWNFL GTGEAFRYIY IGDAGDGELN VSSEGKVDSG IITAGMKETG TGNITVKDKN SVITNLGTNL GYDGHGEMNI SNEGLVVSNG GSSLGYGETG IGKVSITTGG MWEVNKNVYT TIGVAGVGNL NISDGGKFVS QNITFLGDKA SGIGTLNLMD ATSSFDTVGI NVGNFGSGIV NVSNGATLNS TGYGFIGGNA SGKGIVNIST DSLWNLKTSS TNAQLLQVGV LGTGELNITT GGIVKARDTQ IALNDKSKGD VRVDGQNSLL ETFNMNVGTT GTGTLTLTNN GTLNVEGGEV YLGVFEPAVG TLNIGAAHGE VAADAGFITN ATKVEFGLGE GVFVFNHTNN SDAGYQVDML ITGDDKDGKV MHDAGHTVFN AGNTYSGKTL VNDGLLTIAS HTADGVTGMG SSEVTIASPG TLDILASTNS AGDYTLTNAL KGDGLMRVQL SSSDKMFGFT HATGTEFAGV AQLKDSTFTL ERDNTAALTH AMLQSDIENT TSVNVGEQSI GGLAMNGGTL IFDTDIPAAT LAEGYISVDT LVVGASDYTW KGRNYQVNGT GDVLIDVPKP WNDPMANNPL TTLNLLEHDD NHVGVQLVKA QTVIGSGGSL TLRDLQGDEV EADKTLHIAQ NGTVVAEGDY GFRLTTAPGD GLYVNYGLKA LNIHGGQKLT LAEHGGAYGA TADMSAKIGG EGDLAINTVR QVSLSNGQND YQGATYVQMG TLRTDADGAL GNTRELNISN AAIVDLNGST QTVETFTGQM GSTVLFKEGS LTVNKGGISQ GELTGGGNLN VTGGTLAIEG LNARYNALTS VSPNAEVSLD NTQGLGRGNI ANDGLLTLKN VTGELRNSIS GKGIVSATAR TDVELDGDNS RFVGQFNIDT GSALSVNEQK NLGDASVINN GLLTIATERS WAMTHSISGS GDVTKLGTGI LTLNKDSAAY QGTTDIVGGE IAFGSDSAIN MASQHINIHN SGVMSGNVTT AGDVNVMPGG TLRVAKTTVG GNLENGGTVQ MNSEGGKPGN VLTVNGNYTG NNGLMTFNAT LGGDNSPTDK MNVKGDTQGN TRVRVDNIGG VGAQTVNGIE LIEVGGNSAG NFALTTGTVE AGAYVYTLAK GKGNDEKNWY LTSKWDGVTP ADTPDPINNP PVVDPEGPSV YRPEAGSYIS NIAAANSLFS HRLHDRLGEP QYTDSLRVQD SATSMWMRHV GGHERFRTGD GQLNTQANRY VLQLGGDLAQ WSSTQDRWHL GVMAGYANQH SNTQSNHVGY KSDGRISGYS AGLYATWYQN DANITTLACG GMSVCN
|
| |