Gene EcSMS35_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1662 
Symbol 
ID6142868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1651535 
End bp1656415 
Gene Length4881 bp 
Protein Length1626 aa 
Translation table11 
GC content50% 
IMG OID641616538 
Productpertactin family protein 
Protein accessionYP_001743716 
Protein GI170684085 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.459563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAA TCTATCGCGT GATATGGAAT TGCACCCTAC AGGTATTTCA GGCCTGCTCG 
GAATTAACTC GCAGGGTAGG TAAAACATCG ACGGTTAATT TGCGTAAATC CTCTGGATTG
ACAACGAAAT TCAGTAGATT GACGCTGGGT GTTTTGCTGG CACTAAGCGG TTCAGCGTCT
GGTGCGAGTC TGGAAGTTGA TAATGGTCAG ATTACCAATA TTAATACTGA TGTTGCTTAC
GATGCCTACC TGGTTGGGTG GTATGGCACT GGAGTGCTTA ATATTTTGGC TGGCGGTAAT
GCCTCCTTAA CCACTATTAC TACCAGCGTC ATTGGTGGTA ATGAGGACTC GGAGGGCACC
GTTAATGTTT TGGGTGGCAC CTGGCGATTG TATGATAGCG GAAATAATGC AAGGCCTTTA
AATGTGGGTC AATCCGGAAC GGGGACGCTG AATATTAAAC AGAAGGGTCA CGTTGACGGA
GGATATTTAA GGATAGGTTC TTCGACAGGA GGCGTCGGGA CGGTTAATGT TGAGGGAGAG
GACTCGGTTC TGACGACCGA ATTATTCGAA ATAGGGAGTT ACGGTACGGG CTCATTAAAT
ATTACGGATA AGGGTTACGT CACGAGTTCA ATAGTCGCCA CTGTAGGATA TCAAGCGAAC
AGTAATGGTA AGGTTGTCGT TGAAAAGGGC GGTGAGTGGC TAATAAAAAA TAACGATTCC
TCAATTGAAT TTCAAATTGG TAATCAAGGA ACTGGGGAAG CGACTATTCG CGAGGGTGGA
CTGATTACGG CTGAAAATAC GATTATCGGA GGCAATGCCA CCGGTATCGG AACCCTGAAT
GTGCAGGATC AAGACTCTGT CATCACGGTA CGCAGACTCT ATAATGGATA TTTCGGTAAT
GGCAAAGTCA ATATTTCCAA TAATGGACTG ATTAATAACA AAGAATATTC ATTGGTGGGC
GTTCAGGACG GTTCCCACGG TGTCGTCAAC GTGACCGATA AAGGGCATTG GAATTTCCTC
GGAACGGGCG AAGCTTTCCG CTATATCTAT ATCGGTGATG CTGGCGACGG TGAACTTAAT
GTCTCTAGTG AAGGCAAAGT AGATTCGGGA ATTATCACTG CGGGGATGAA AGAAACAGGC
ACAGGCAACA TTACTGTTAA GGATAAGAAC TCCGTTATCA CTAATCTCGG AACTAATCTT
GGTTATGACG GCCACGGCGA AATGAATATC AGTAATGAGG GGCTTGTTGT CAGCAACGGA
GGAAGTTCAC TCGGTTATGG AGAAACCGGC ATCGGGAAGG TTAGCATCAC CACGGGGGGG
ATGTGGGAGG TCAATAAGAA TGTCTATACC ACTATTGGTG TTGCGGGCGT CGGAAACCTC
AATATCAGCG ATGGCGGTAA GTTCGTATCG CAAAATATTA CTTTTTTGGG CGATAAAGCA
AGCGGTATCG GCACACTGAA CCTGATGGAT GCGACATCAT CGTTCGATAC TGTAGGTATC
AATGTCGGTA ATTTTGGTAG CGGCATCGTA AATGTCAGTA ATGGTGCCAC CCTTAATTCA
ACGGGCTATG GATTTATCGG AGGAAATGCC TCCGGTAAGG GGATAGTTAA TATTTCAACG
GATAGTCTCT GGAATTTAAA AACGTCATCT ACTAACGCCC AATTGCTACA GGTCGGTGTA
TTAGGTACGG GTGAACTGAA TATTACCACC GGAGGTATAG TTAAAGCGCG TGATACACAG
ATAGCTCTGA ATGACAAAAG TAAGGGCGAC GTGCGGGTGG ATGGGCAGAA CTCTCTTCTT
GAAACATTCA ATATGAACGT AGGGACGACT GGCACAGGTA CGTTAACCCT GACGAATAAC
GGCACGCTGA ATGTCGAAGG TGGAGAAGTT TACTTAGGTG TTTTTGAGCC TGCTGTAGGA
ACGCTAAACA TTGGTGCTGC TCACGGTGAG GTGGCGGCAG ATGCCGGGTT TATTACCAAT
GCGACGAAAG TGGAGTTTGG TCTTGGCGAA GGCGTTTTTG TCTTTAATCA TACCAATAAC
AGTGATGCCG GCTACCAGGT CGATATGCTG ATTACGGGTG ACGATAAAGA CGGAAAAGTG
ATGCATGATG CAGGCCATAC GGTGTTCAAT GCAGGGAATA CTTATAGCGG TAAAACGCTG
GTCAATGACG GCCTCCTGAC TATTGCGTCA CATACGGCAG ATGGGGTAAC AGGCATGGGT
TCGAGTGAAG TCACCATTGC AAGCCCCGGT ACGCTCGACA TTCTCGCATC AACGAACAGC
GCAGGAGATT ACACGCTGAC CAATGCGCTC AAAGGCGATG GCTTGATGCG AGTGCAGCTG
TCATCCTCCG ACAAGATGTT TGGCTTTACC CATGCAACAG GGACTGAATT CGCCGGTGTT
GCCCAACTGA AAGACAGTAC CTTCACTCTG GAACGCGACA ACACCGCTGC GCTTACTCAC
GCGATGTTGC AGTCTGACAT TGAAAATACC ACATCGGTAA ACGTAGGAGA GCAATCCATT
GGTGGACTGG CCATGAATGG CGGTACGCTC ATTTTCGATA CGGATATTCC TGCTGCGACG
CTTGCAGAGG GATATATCAG CGTCGATACG CTGGTTGTCG GCGCGAGTGA CTACACCTGG
AAAGGCCGTA ACTATCAGGT AAACGGGACG GGCGACGTGC TTATCGACGT GCCTAAACCG
TGGAATGATC CCATGGCGAA TAACCCTCTG ACGACGCTCA ATTTGCTGGA ACACGATGAT
AACCATGTCG GCGTTCAACT GGTGAAGGCG CAAACGGTTA TTGGGTCGGG TGGCTCATTA
ACGTTACGTG ATTTACAGGG CGACGAGGTG GAAGCGGACA AAACGTTACA CATTGCGCAA
AACGGAACGG TGGTCGCTGA GGGTGATTAT GGATTCCGCC TCACGACCGC ACCAGGTGAT
GGTTTGTACG TTAACTATGG GCTGAAAGCG CTGAACATCC ATGGTGGGCA AAAGCTGACA
TTAGCCGAAC ATGGCGGAGC CTATGGCGCA ACGGCCGATA TGTCGGCAAA AATCGGTGGT
GAAGGGGATC TGGCAATCAA TACGGTGCGA CAGGTTTCGC TTTCCAACGG TCAGAACGAC
TATCAGGGGG CAACCTACGT TCAGATGGGG ACATTACGTA CCGATGCGGA TGGCGCGCTG
GGCAACACCC GGGAACTGAA CATCAGCAAC GCGGCCATCG TCGATCTTAA TGGATCGACG
CAGACGGTAG AGACATTCAC CGGGCAGATG GGTTCGACTG TTTTGTTCAA AGAAGGGTCG
CTGACGGTAA ATAAAGGTGG GATCAGTCAG GGTGAACTGA CAGGTGGCGG AAACCTGAAT
GTTACAGGGG GAACGCTGGC TATCGAGGGG CTTAATGCAC GCTACAATGC GTTAACCAGC
GTTAGCCCAA ATGCGGAAGT CAGCCTCGAT AATACGCAGG GGTTAGGCAG AGGAAATATT
GCCAATGACG GTCTGTTAAC GCTAAAAAAC GTGACCGGCG AACTGCGTAA TAGCATAAGC
GGGAAGGGTA TCGTGAGCGC AACCGCCAGG ACAGACGTAG AGCTGGATGG CGATAATAGC
CGCTTTGTGG GGCAATTCAA CATTGATACA GGCAGCGCGC TCAGCGTCAA CGAGCAGAAA
AACCTGGGGG ATGCTTCCGT TATCAATAAT GGCCTGCTCA CCATCGCCAC TGAGCGTAGC
TGGGCGATGA CGCACAGTAT CAGCGGTAGC GGTGATGTGA CAAAACTGGG TACCGGGATC
CTGACTCTTA ACAAAGATTC CGCGGCGTAT CAGGGTACGA CGGATATCGT GGGTGGTGAA
ATTGCTTTCG GTTCCGACTC TGCCATTAAT ATGGCAAGTC AGCACATTAA TATCCATAAC
AGCGGTGTGA TGTCGGGAAA TGTCACCACT GCAGGTGATG TGAACGTTAT GCCAGGGGGG
ACACTGCGTG TCGCTAAAAC CACTGTCGGC GGTAACCTGG AGAATGGTGG CACGGTTCAA
ATGAACAGCG AAGGGGGGAA ACCGGGGAAT GTACTGACCG TTAACGGCAA CTATACCGGA
AACAATGGCC TGATGACGTT CAACGCGACG CTGGGCGGCG ATAATTCACC CACGGATAAA
ATGAACGTGA AAGGCGATAC CCAAGGGAAC ACTCGCGTTC GGGTTGATAA CATTGGCGGC
GTCGGTGCAC AAACGGTCAA CGGTATTGAA CTCATTGAGG TTGGCGGTAA TTCTGCAGGT
AACTTCGCGC TGACCACCGG AACTGTCGAA GCTGGGGCTT ACGTCTACAC GCTGGCTAAA
GGGAAGGGGA ATGACGAGAA AAACTGGTAT CTGACCAGTA AATGGGACGG CGTAACGCCA
GCGGATACAC CCGATCCCAT CAATAATCCC CCTGTTGTGG ATCCGGAAGG CCCATCAGTT
TATCGCCCGG AGGCCGGAAG CTATATCAGC AACATTGCCG CAGCCAACTC GCTGTTTAGC
CATCGCTTAC ACGACCGTCT GGGTGAGCCG CAGTATACAG ATTCACTGCG AGTTCAGGAT
TCGGCAACCA GTATGTGGAT GCGTCATGTC GGAGGGCACG AACGTTTCAG GACCGGTGAT
GGTCAGCTAA ATACTCAGGC TAACCGCTAT GTATTGCAGC TAGGCGGCGA TTTGGCGCAG
TGGAGTAGCA CGCAGGATCG CTGGCATCTC GGCGTCATGG CAGGCTATGC CAATCAGCAC
AGTAATACTC AGAGTAATCA TGTGGGTTAT AAATCGGATG GGCGCATCAG CGGTTACAGC
GCAGGACTGT ACGCGACCTG GTATCAGAAC GATGCGAATA TAACAACCTT AGCCTGTGGG
GGAATGTCGG TGTGCAACTA G
 
Protein sequence
MNRIYRVIWN CTLQVFQACS ELTRRVGKTS TVNLRKSSGL TTKFSRLTLG VLLALSGSAS 
GASLEVDNGQ ITNINTDVAY DAYLVGWYGT GVLNILAGGN ASLTTITTSV IGGNEDSEGT
VNVLGGTWRL YDSGNNARPL NVGQSGTGTL NIKQKGHVDG GYLRIGSSTG GVGTVNVEGE
DSVLTTELFE IGSYGTGSLN ITDKGYVTSS IVATVGYQAN SNGKVVVEKG GEWLIKNNDS
SIEFQIGNQG TGEATIREGG LITAENTIIG GNATGIGTLN VQDQDSVITV RRLYNGYFGN
GKVNISNNGL INNKEYSLVG VQDGSHGVVN VTDKGHWNFL GTGEAFRYIY IGDAGDGELN
VSSEGKVDSG IITAGMKETG TGNITVKDKN SVITNLGTNL GYDGHGEMNI SNEGLVVSNG
GSSLGYGETG IGKVSITTGG MWEVNKNVYT TIGVAGVGNL NISDGGKFVS QNITFLGDKA
SGIGTLNLMD ATSSFDTVGI NVGNFGSGIV NVSNGATLNS TGYGFIGGNA SGKGIVNIST
DSLWNLKTSS TNAQLLQVGV LGTGELNITT GGIVKARDTQ IALNDKSKGD VRVDGQNSLL
ETFNMNVGTT GTGTLTLTNN GTLNVEGGEV YLGVFEPAVG TLNIGAAHGE VAADAGFITN
ATKVEFGLGE GVFVFNHTNN SDAGYQVDML ITGDDKDGKV MHDAGHTVFN AGNTYSGKTL
VNDGLLTIAS HTADGVTGMG SSEVTIASPG TLDILASTNS AGDYTLTNAL KGDGLMRVQL
SSSDKMFGFT HATGTEFAGV AQLKDSTFTL ERDNTAALTH AMLQSDIENT TSVNVGEQSI
GGLAMNGGTL IFDTDIPAAT LAEGYISVDT LVVGASDYTW KGRNYQVNGT GDVLIDVPKP
WNDPMANNPL TTLNLLEHDD NHVGVQLVKA QTVIGSGGSL TLRDLQGDEV EADKTLHIAQ
NGTVVAEGDY GFRLTTAPGD GLYVNYGLKA LNIHGGQKLT LAEHGGAYGA TADMSAKIGG
EGDLAINTVR QVSLSNGQND YQGATYVQMG TLRTDADGAL GNTRELNISN AAIVDLNGST
QTVETFTGQM GSTVLFKEGS LTVNKGGISQ GELTGGGNLN VTGGTLAIEG LNARYNALTS
VSPNAEVSLD NTQGLGRGNI ANDGLLTLKN VTGELRNSIS GKGIVSATAR TDVELDGDNS
RFVGQFNIDT GSALSVNEQK NLGDASVINN GLLTIATERS WAMTHSISGS GDVTKLGTGI
LTLNKDSAAY QGTTDIVGGE IAFGSDSAIN MASQHINIHN SGVMSGNVTT AGDVNVMPGG
TLRVAKTTVG GNLENGGTVQ MNSEGGKPGN VLTVNGNYTG NNGLMTFNAT LGGDNSPTDK
MNVKGDTQGN TRVRVDNIGG VGAQTVNGIE LIEVGGNSAG NFALTTGTVE AGAYVYTLAK
GKGNDEKNWY LTSKWDGVTP ADTPDPINNP PVVDPEGPSV YRPEAGSYIS NIAAANSLFS
HRLHDRLGEP QYTDSLRVQD SATSMWMRHV GGHERFRTGD GQLNTQANRY VLQLGGDLAQ
WSSTQDRWHL GVMAGYANQH SNTQSNHVGY KSDGRISGYS AGLYATWYQN DANITTLACG
GMSVCN