Gene EcSMS35_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2384 
Symbol 
ID6144679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2427935 
End bp2431699 
Gene Length3765 bp 
Protein Length1254 aa 
Translation table11 
GC content50% 
IMG OID641617257 
Productadhesin 
Protein accessionYP_001744429 
Protein GI170679991 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat
[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.985186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTA TTTTTCTACG CAAGGAGTAT TTATCTTTAC TCCCGTCAAT GATTGCATCT 
CTTTTCTCTG TTAACAGTGT CGCGGAGGTT TTGGATTCAT GCCAGGGATA TGATATCAAA
GCGAGTTGTC AGGCCAGCAG GCAAAGCCTT TCAGGCATTA CGCAGGACTG GAGCATCGCC
GATGGGCAAT GGGTGATTTT TTCGGGTATG GCCAATAATG CCAGCGGTGG TGCCGTATTT
TTGCAGCAAA GTGCTGAATT TACGATATCA CCACAAAATG AAACAGGGAT GACCCTGTTT
GCTAATAACT CGATTAGTGG CGAATATAAT AATGGCGGAG CTATCTTTGC TAAAGAAAAC
TCAACGATAA ATCTTGCGAA TGTTATTTTC GACAGTAACG TTGCAGGAGG CTATGGCGGA
GCAATCTATT CTGCGGGAAC AAACGATGCC GGTGATATGG ATTTAAATAT CACTAATGCA
ATATTTGCGA ATAATATCGC CAATGATGGT AAAGGTGGCG CTATTTATTC TATCAATAAT
GATATTTATT TAAGTGATGT CATTTTTGAT AATAATCAGG CATATACATC AACCAGCTAT
AGTGATGGGG ACGGCGGTGC CATTGATGTT ACGGATAATA GTACGGATAA CACACATCTT
TCAGGAAAAA CGATAATTAA TAATACCTCC TTTACCAATA ACTATGCAGA AGGTTATGGC
GGAGCAATTT ATACCAGCAG CACCACATCC CCGTATCTTA TTGATATTTC TGTTGATGAT
AATTATGACC AAAACAATGG TGTGATGATT GATGAAAATA ACAGCGCCTC GGGATATGAC
CATTCGGCGA CGGCAGCGGC GGGGGGCTTT ATGTATATCG GTCATAGTGT GGCTGAATTT
AACATTGCCG CCGATAAAAC GCTGGTAATA GGCAATACCA GCAACGATGG CGCGATTGAT
TCTCTTGCCG GAACGGGAGT CATCGTGAAA GAGGGAGCGG GAGAGTTAGT CCTCAATGCC
GACAATAATG CGTTTACCGG CGAGATGAGT ATCCAGAATG GCGAGGTGAC GCTGGGGCGC
AGCGATGAGT TAATGAATGT CGGCGACACG CACTGTCAGA GCGATCCGCA GGATTGCTTT
GGTCTGATGG TCGGCAGCAC TGTTCATTCT GAGTATCAGG CAGAACTGAA TGTTGGCAAT
ACACAACAAA CGTTTGTGCA CTCATTAACC GGTTTTGCTA ATGGCATTTT GAATATCGAC
GCAGGCGGCA ATGTTACCGT AAACCAGGGC GGTTTTTCCG GCTCGATTCA GGGTGAAGGA
CAGTTGACGG TAGCGCAAGA TGGCAGCTAT CTCCTGACAG GGGAGCAGTC GATGGCATTA
ACCGGCGATA TTGTGGTGGA AGATAACGCC GTATTGTCGC TAGCGGGGAA TCAGGCCGAT
TTACGGGCAA TGCAGAGCGA TCCGCAATCG ATCGTGCTAA ATGGCGGAGT ATTGGATCTC
TCTGATTTCA CCACTTGGGA TGGCGATAGC TCATACAATG ACGGTCTGCA AATCAGCGGC
AGTGGGGGGA CAGTGATTGG CAGTAATGAT GTCGTTGATA TCAGCAGTGG CGATGATTTA
CATATTGGCG GCAGTGATGC CAGCCAGAAT GGCGTTTATG TCGTCATTAA TGCGGGGGAT
CAGCGCGTAA CGCTGGCGAA TAACAACGGT TATCTCGGTA ATACGCAAAT CGCCTCCGGT
ACGCTTGAAG TCAGCGATAA TTCGCAATTG GGGGACACAA GTTACAACCG TTCAGTGATC
TTCACAGATC CCCAACAACA CAGTGAAATG GATGTGACGA CTGATGTTGA CACTCGCTCG
GCTACAACAG GTCAGGGCAG GAATATTGAA ATGCGCGCTG ATGGCGAAAT ACACGTTGAG
GATGGCGTAG ATACGCAATG GGGCGGGTTG ATGGCGGATA GTACCGGGCA ACAACTGGAT
AGCGTAAGCA CGTTGACTAA AAGTGGCGGC GGGACGCTGG AGTTGACCGC CAGCGGCACT
GCGACGTCTG CGGTACGTGT GGAGGACGGA ACGCTAAAAG GTGAAGCGGA GAATATTATT
CCTTATGTTT CTTCACTGTG GGTGGGGGAA GACGGTGTTT TTGAAACCGG GAAAAATCAG
GATATTCGTT CAATCGATGC CACCTCTGGC GGCGATATCG ACATAACCGA TGGTACTGTG
TTGCGATTGA CGCAGCAGGA TACGAACCAG GCGCTGGATG CCTCGCTGTT TAGCGGCGAC
GGTACGTTGG TGAATGCCAC CGATGGTGTG ACGCTGACAG GCGAGCTTAA TACCAACCTT
GAAACTGACA GCCTGACTTA TCTTTCAGAC GTGACGGTCA ATGGTGATCT AACTAATACT
TCCGGTGCTG TCAGTCTGCA AAATGGCGTT GCTGGCGACA CGCTCACGGT AAACGGTGAT
TACACCGGTG GCGGTACGTT GTTTCTCGAC AGCGAATTAA ACGGCGATGA CTCGGCAAGC
GACCAACTGG TGTTGAACGG TAATACTGCT GGCAACACGA CCGTGGTAAT TAATCCCATT
ACGGGTATTG GTGAGCCGAC ATCTACGGGC ATTAAAGTGG TTGATTTCGC AGCCGATCCA
ACGCAATTTC AAAACAATGC GCAGTTCAGT CTGGCGGGCA GCGGCTACGT CAATATGGGA
GCGTATGACT ACACGCTGGT GGAAGATAAC AACGACTGGT ATCTGCGATC GCAAGAAGTA
ACGCCACCAT CGCCACCTGA TCCAGACCCG ACTCCAGATC CTGATCCTAC GCCGGATCCT
GATCCAACAC CCGACCCGGC TCCAGCCCCT ACGCCTGCCT ACCAGCCGGT GCTGAATGCC
AAAGTTGGTG GCTATCTCAA TAACCTACGT GCGGCAAATC AGGCATTTGT GATGGAGCGA
CGCGATCACG CTGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT
CATTACACAG CAGCGGGGCA ACTGGCTCAG CATGAAGACA CTTCTACGGT GCAGCTTAGC
GGCGACTTGT TTAGCGGGCG CTGGGGCATG GATGGCGAGT GGATGCTTGG GGCGGTTGGT
GGCTACAGTG ATAACCAGGG CGACAGCCGC TCGAATATGA CCGGAACTCG CGCCGATAAC
CAGAACCACG GTTATGCCGT TGGTCTGACC TCAAGCTGGT TCCAGCACGG TAATCAGAAG
CAGGGGGCCT GGCTGGATAG CTGGCTGCAA TACGCGTGGT TTAACAATGA TGTTTCCGAA
CATGAAGATG GCGCGGATCA TTACCACTCA TCGGGGATTA TCGCCTCGCT GGAAGCCGGG
TATCAGTGGT TACCGGGGCA TGGAGTGGTG ATTGAACCAC AGGCGCAGGT GATTTATCAG
GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCATGCGC GGGTGTCACA ATCGCAGGGT
GATGATATTC AAACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCGGT TGGTGTCATA
CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATGCGA CGGAAATTGA AGAAGATGGC
AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AATAACGGGC
AATATCAGTC AGCGAGTTTC GCTGCGTGGT AGCGTGGCGT GGCAGAAAGG GAGTGATGAT
TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACAGTGAAAT GGTAA
 
Protein sequence
MRIIFLRKEY LSLLPSMIAS LFSVNSVAEV LDSCQGYDIK ASCQASRQSL SGITQDWSIA 
DGQWVIFSGM ANNASGGAVF LQQSAEFTIS PQNETGMTLF ANNSISGEYN NGGAIFAKEN
STINLANVIF DSNVAGGYGG AIYSAGTNDA GDMDLNITNA IFANNIANDG KGGAIYSINN
DIYLSDVIFD NNQAYTSTSY SDGDGGAIDV TDNSTDNTHL SGKTIINNTS FTNNYAEGYG
GAIYTSSTTS PYLIDISVDD NYDQNNGVMI DENNSASGYD HSATAAAGGF MYIGHSVAEF
NIAADKTLVI GNTSNDGAID SLAGTGVIVK EGAGELVLNA DNNAFTGEMS IQNGEVTLGR
SDELMNVGDT HCQSDPQDCF GLMVGSTVHS EYQAELNVGN TQQTFVHSLT GFANGILNID
AGGNVTVNQG GFSGSIQGEG QLTVAQDGSY LLTGEQSMAL TGDIVVEDNA VLSLAGNQAD
LRAMQSDPQS IVLNGGVLDL SDFTTWDGDS SYNDGLQISG SGGTVIGSND VVDISSGDDL
HIGGSDASQN GVYVVINAGD QRVTLANNNG YLGNTQIASG TLEVSDNSQL GDTSYNRSVI
FTDPQQHSEM DVTTDVDTRS ATTGQGRNIE MRADGEIHVE DGVDTQWGGL MADSTGQQLD
SVSTLTKSGG GTLELTASGT ATSAVRVEDG TLKGEAENII PYVSSLWVGE DGVFETGKNQ
DIRSIDATSG GDIDITDGTV LRLTQQDTNQ ALDASLFSGD GTLVNATDGV TLTGELNTNL
ETDSLTYLSD VTVNGDLTNT SGAVSLQNGV AGDTLTVNGD YTGGGTLFLD SELNGDDSAS
DQLVLNGNTA GNTTVVINPI TGIGEPTSTG IKVVDFAADP TQFQNNAQFS LAGSGYVNMG
AYDYTLVEDN NDWYLRSQEV TPPSPPDPDP TPDPDPTPDP DPTPDPAPAP TPAYQPVLNA
KVGGYLNNLR AANQAFVMER RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS
GDLFSGRWGM DGEWMLGAVG GYSDNQGDSR SNMTGTRADN QNHGYAVGLT SSWFQHGNQK
QGAWLDSWLQ YAWFNNDVSE HEDGADHYHS SGIIASLEAG YQWLPGHGVV IEPQAQVIYQ
GVQQDDFTAA NHARVSQSQG DDIQTRLGLH SEWRTAVGVI PTLDLNYYHD PHATEIEEDG
STISDDAVKQ RGEIKVGITG NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW