Gene ECH74115_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1381 
Symbol 
ID6968450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1376901 
End bp1377965 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID643385359 
Productpotra domain, shlb-type family 
Protein accessionYP_002269854 
Protein GI209399951 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.163797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTCAC TGACACTGAT GTCTGCTTTA TTATCGCCTT TATCTCTTCA GGCAGCGGAT 
GTCCGGCGTA GCGGAGATGA AGCATTTATC ATTCAGCAGC AGCGTCAGGA AGCCCTTGAG
CAACAACTGA CGCCTTCAGC CCCTGATGTT CGCCTTTCTG CACCTGGCTC TTTTGCCCAT
AAGATTAATT TTCCTGTTGA AACGCCCTGT TTTCAGATTA AACAGACGGA ACTGAAGGGG
GCTGATGCGT TACCACACTG GCTGCCTTTA CAAAAAATCG CCAACGGGGC GGTCGGGCAT
TGCCTGGGGG CGAAAGGAAT TAATCTGCTG ATGAGTACAT TGCAGAACCG TCTGGTCGAT
CATGGTTATG TCACCACCCG TGTTCTGGCA CCTTCGCAGG ATTTAAAAAG CGGTATCCTC
CGGCTGGTTA TTATTCCCGG TGTTGTGCGA CATGTGCGTC TGACACCGGA CAGTGATGAC
TATATTCAGT TGTATTCCTC ATTCCCGGCA CACGAAGGTT CTCTGCTGGA TTTACGGGAC
ATTGAGCAGG GGCTGGATTT AGGTAACAGC CGGATACAGG GACAACATAC TGAGCTGAAT
GCAACCAGTG GAAATCTGTC TACACAGAAT GCGCAACTGA GTGCCGATAC GCTTTCCGCC
CGGACTGCCG GGCAGTTCAG CAGTAATGGC GGTACGATAA ATGCCGACAC ACTGCAGATA
TCGGCACAAA GCCTGTCAAA TCGTAAAGGC AGTCTGATTC AGACGGGAAC AGGGGATTTT
TCGCTGAGTC TGCCGGGAAG CGTGGATAAC CGGGAAGGGC TGCTTGCGGC AAATGGCGCG
GTGCGTCTGG ATGCACTGAG CCTTGATAAT CGCAAGGGGA AAGTGCAGGC GGAGCAGTCA
CCCTCCCTTC AGAAATCCCC GCCCACGTTT CTGAAACCGT TTGTGGCTGG TGTCTGTGCG
GCATTGCTGG CGGTCAGCGT GGCTATTCCG GGATGGCAGT TTCTGACACA GCCATCACCG
GAGGAGCAGC ATTTTACCTG GGGGAATGGT TGTAAAAAGC AGTGA
 
Protein sequence
MLSLTLMSAL LSPLSLQAAD VRRSGDEAFI IQQQRQEALE QQLTPSAPDV RLSAPGSFAH 
KINFPVETPC FQIKQTELKG ADALPHWLPL QKIANGAVGH CLGAKGINLL MSTLQNRLVD
HGYVTTRVLA PSQDLKSGIL RLVIIPGVVR HVRLTPDSDD YIQLYSSFPA HEGSLLDLRD
IEQGLDLGNS RIQGQHTELN ATSGNLSTQN AQLSADTLSA RTAGQFSSNG GTINADTLQI
SAQSLSNRKG SLIQTGTGDF SLSLPGSVDN REGLLAANGA VRLDALSLDN RKGKVQAEQS
PSLQKSPPTF LKPFVAGVCA ALLAVSVAIP GWQFLTQPSP EEQHFTWGNG CKKQ