Gene ECH74115_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1395 
Symbol 
ID6967004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1387736 
End bp1390585 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content57% 
IMG OID643385369 
Productpertactin family protein 
Protein accessionYP_002269864 
Protein GI209399047 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.830364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.238303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC ATCTGAACAC CAGCTACAGG CTGGTATGGA ATCACATTAC GGGCACCCTG 
GTGGTGGCCT CCGAACTGGC CCGCTCACGG GGAAAACGCG CCGGTGTGGC GGTTGCGCTG
TCTCTTGCTG CTGTCACATC AGTCCCGGCA CTGGCTGCTG ACAAGGTTGT ACAGGCGGGA
GAAACCGTGA ACGATGGAAC ACTGACAAAT CATGACAACC AGATTGTCTT CGGTACGGCC
AACGGAATGA CCATCAGTAC CGGGCTGGAA CTGGGGCCGG ACAGTGAAGA AAACACCGGT
GGGCAATGGA TACAGAATGG CGGGATAGCC GGAAACACCA CTGTCACCAC AAATGGTCGT
CAGGTCGTGC TGGAGGGGGG AACAGCCAGT GATACGGTTA TTCGTGACGG CGGGGGACAG
AGCCTGAACG GACTGGCGGT GAACACCACA CTGAATAACA GAGGCGAGCA GTGGGTGCAT
GAGGGCGGGG TTGCCACCGG TACAATTATC AACCGCGACG GTTACCAGAG CGTTAAAAGT
GGCGGGCTGG CAACAGGAAC CATCATCAAC ACCGGCGCAG AAGGCGGCCC TGATTCTGAC
AACTCGTATA CGGGTCAGAA GGTCCAGGGA ACAGCAGAAT CCACCACCAT CAACAAAAAT
GGACGGCAGA TTATCTTATT TTCCGGGCTA GCCCGTGACA CTCTCATTTA CGCAGGTGGT
GACCAGTCGG TACACGGAAG GGCCCTGAAT ACCACACTGA ATGGCGGTTA CCAATATGTG
CACAGGGACG GACTTGCGCT GAACACGGTA ATTAACGAGG GGGGCTGGCA GGTTGTTAAG
GCAGGTGGCG CTGCCGGTAA CACCACCATA AATCAGAACG GTGAACTGAG GGTACATGCC
GGCGGGGAAG CCACTGCAGT CACCCAGAAC ACGGGCGGTG CACTGGTTAC CAGTACTGCT
GCAACTGTCA TCGGCACAAA CCGTCTGGGG AATTTCACGG TGGAAAACGG TAAGGCTGAC
GGTGTTGTTC TGGAATCCGG CGGTCGTCTG GATGTACTGG AGAGCCATTC AGCACAGAAT
ACCCTAGTGG ATGACGGCGG TACCCTGGCA GTGTCTGCCG GCGGTAAGGC GACAAGTGTC
ACCATAACAT CCGGTGGTGC CCTGATTGCA GACAGTGGTG CCACTGTTGA GGGGACCAAT
GCCAGCGGTA AGTTCAGTAT TGATGGCACA TCCGGTCAGG CCAGCGGCCT GCTGCTGGAA
AATGGCGGCA GCTTTACGGT TAATGCCGGG GGACAGGCTG GCAACACCAC TGTCGGACAT
CGTGGAACAC TGACGCTGGC TGCCGGGGGA AGTCTGAGTG GCAGAACACA GCTCAGTAAA
GGCGCCAGTA TGGTACTGAA TGGTGATGTG GTCAGTACCG GCGATATTGT TAACGCAGGG
GAGATTCGCT TTGATAATCA GACGACACCG AATGCCGCGC TGAGCCGTGC TGTTGCAAAA
AGTAACTCCC CGGTAACGTT CCATAAACTG ACCACCACGA ACCTCACCGG CCAGGGCGGC
ACCATCAATA TGCGTGTTCG CCTTGATGGC AGCAATGCCT CTGACCAGCT GGTGATTAAT
GGTGGTCAGG CAACCGGCAA AACCTGGCTT GCGTTTACAA ATGTCGGAAA CAGCAACCTC
GGGGTGGCAA CCACCGGACA GGGTATCCGG GTTGTGGATG CACAGAATGG CGCCACCACA
GAAGAAGGTG CGTTTGCCCT GAGTCGCCCG CTTCAGGCCG GCGCCTTTAA CTACACCCTG
AACCGTGACA GCGATGAAGA CTGGTACCTG CGCAGTGAAA ATGCTTATCG TGCTGAAGTC
CCCCTGTATA CATCCATGTT GACACAGGCA ATGGACTATG ACCGGATTCT GGCAGGCTCC
CGCAGCCATC AGACCGGTGT AAACGGTGAA AATAACAGCG TCCGTCTCAG CATTCAGGGC
GGTCATCTCG GTCACGATAA CAACGGCGGT ATTGCCCGTG GAGCCACGCC GGAAAGCAGC
GGCAGCTATG GCTTCGTCCG TCTGGAGGGT GACCTGCTCA GAACAGAGGT TGCCGGTATG
TCTCTGACGA CAGGGGTGTA TGGTGCTGCA GGCCATTCTT CCGTTGATGT TAAGGATGAT
GACGGTTCCC GCGCCGGCAC GGTCCGGGAT GATGCCGGCA GTCTGGGCGG ATACCTGAAT
CTGGTACACA CATCCTCCGG CCTGTGGGCT GACATTGTGG CCCAGGGAAC CCGTCACAGC
ATGAAAGCGT CATCGGACAA TAACGACTTC CGCGCCCGGG GCTGGGGCTG GCTGGGCTCA
CTGGAAACCG GTCTGCCCTT CAGTATCACT GACAATCTGA TGCTGGAGCC ACAACTGCAG
TACACCTGGC AGGGACTCTC CCTGGATGAC GGCCAGGATA ACGCCGGTTA TGTGAAGTTC
GGGCATGGCA GTGCACAACA TGTGCGTGCC GGTTTCCGTC TGGGCAGCCA CAACGATATG
ACCTTTGGTG AAGGCACCTC ATCCCGTGAC ACCCTGCGCG ACAGTGCAAA ACACAGTGTG
AGTGAACTGC CGGTGAACTG GTGGGTACAG CCTTCTGTTA TCCGCACCTT CAGCTCCCGG
GGTGACATGA GCATGGGGAC AGCCGCAGCC GGCAGTAACA TGACGTTCTC ACCGTCCCGG
AATGGCACGT CACTGGACCT GCAGGCCGGA CTGGAAGCCC GTATCCGGGA AAATATCACC
CTGGGCGTTC AGGCCGGTTA TGCCCACAGC GTCAGCGGCA GCAGCGCTGA AGGCTATAAC
GGTCAGGCTA CGCTGAATAT GACTTTCTGA
 
Protein sequence
MKRHLNTSYR LVWNHITGTL VVASELARSR GKRAGVAVAL SLAAVTSVPA LAADKVVQAG 
ETVNDGTLTN HDNQIVFGTA NGMTISTGLE LGPDSEENTG GQWIQNGGIA GNTTVTTNGR
QVVLEGGTAS DTVIRDGGGQ SLNGLAVNTT LNNRGEQWVH EGGVATGTII NRDGYQSVKS
GGLATGTIIN TGAEGGPDSD NSYTGQKVQG TAESTTINKN GRQIILFSGL ARDTLIYAGG
DQSVHGRALN TTLNGGYQYV HRDGLALNTV INEGGWQVVK AGGAAGNTTI NQNGELRVHA
GGEATAVTQN TGGALVTSTA ATVIGTNRLG NFTVENGKAD GVVLESGGRL DVLESHSAQN
TLVDDGGTLA VSAGGKATSV TITSGGALIA DSGATVEGTN ASGKFSIDGT SGQASGLLLE
NGGSFTVNAG GQAGNTTVGH RGTLTLAAGG SLSGRTQLSK GASMVLNGDV VSTGDIVNAG
EIRFDNQTTP NAALSRAVAK SNSPVTFHKL TTTNLTGQGG TINMRVRLDG SNASDQLVIN
GGQATGKTWL AFTNVGNSNL GVATTGQGIR VVDAQNGATT EEGAFALSRP LQAGAFNYTL
NRDSDEDWYL RSENAYRAEV PLYTSMLTQA MDYDRILAGS RSHQTGVNGE NNSVRLSIQG
GHLGHDNNGG IARGATPESS GSYGFVRLEG DLLRTEVAGM SLTTGVYGAA GHSSVDVKDD
DGSRAGTVRD DAGSLGGYLN LVHTSSGLWA DIVAQGTRHS MKASSDNNDF RARGWGWLGS
LETGLPFSIT DNLMLEPQLQ YTWQGLSLDD GQDNAGYVKF GHGSAQHVRA GFRLGSHNDM
TFGEGTSSRD TLRDSAKHSV SELPVNWWVQ PSVIRTFSSR GDMSMGTAAA GSNMTFSPSR
NGTSLDLQAG LEARIRENIT LGVQAGYAHS VSGSSAEGYN GQATLNMTF