Gene ECH74115_4887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4887 
Symbol 
ID6971693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4521505 
End bp4523565 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content53% 
IMG OID643388575 
ProductAsmA family protein 
Protein accessionYP_002273003 
Protein GI209398465 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.647897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG CAGGCAAAAT AACCGCTGCG ATTTCAGGGG CTTTCTTGTT GTTGATTGTC 
GTGGCGATCA TTTTGATTGC AACTTTTGAC TGGAATCGAC TCAAACCGAC CATCAACCAG
AAAGTCTCTG CGGAGTTGAA TCGTCCGTTC GCTATCCGTG GCGATCTGGG CGTGGTGTGG
GAGCGGCAAA AGCAAGAAAC TGGCTGGCGC AGCTGGGTGC CGTGGCCCCA TGTACACGCG
GAAGACATCA TTCTTGGCAA TCCACCGGAT ATTCCCGAAG TCACGATGGT GCATTTGCCA
CGCGTAGAAG CAACGCTGGC CCCGCTGGCG CTGCTGACCA AAACGGTCTG GCTGCCGTGG
ATCAAGCTCG AAAAGCCCGA CGCGCGCCTG ATTCGCCTCT CTGAAAAGAA CAATAACTGG
ACGTTTAATC TCGCCAACGA TGATAACAAA GACGCGAATG CAAAGCCGTC GGCATGGTCG
TTTCGGCTGG ATAATATTCT TTTCGATCAA GGGCGGATCG CCATTGATGA CAAAGTAAGC
AAAGCGGATC TGGAGATTTT TGTTGATCCC TTAGGCAAGC CGCTGCCGTT CAGCGAAGTT
ACTGGATCGA AAGGTAAAGC GGATAAAGAA AAGGTGGGCG ATTACGTTTT TGGCCTGAAG
GCGCAGGGAC GTTATAACGG TGAACCGCTC ACGGGTACGG GAAAAATAGG CGGTATGCTG
GCGCTGCGTG GCGAAGGGAC GCCGTTTCCG GTACAGGCTG ATTTCCGTTC AGGTAATACC
CGTGTTGCTT TTGATGGCGT CGTGAATGAC CCAATGAAGA TGGGCGGTGT CGATTTACGG
CTTAAATTTT CTGGCGATTC ACTGGGTGAT CTCTATGAAC TGACGGGCGT TCTGCTGCCC
GATACCCCGC CGTTTGAAAC GGATGGTCGG CTGGTAGCGA AAATCGACAC TGAAAAATCG
TCGGTCTTTG ATTATCGCGG CTTTAATGGG CGAATTGGCG ATAGCGATAT CCACGGTTCT
CTGGTCTACA CCACCGGCAA GCCACGACCA AAACTGGAAG GTGATGTCGA GTCGCGGCAA
TTGCGGCTGG CGGACCTGGG ACCGTTGATT GGCGTTGATT CCGGGAAAGG GGCAGAAAAG
TCGAAACGGT CTGAACAGAA GAAGGGCGAA AAAAGCGTTC AGCCTGCGGG CAAAGTGCTG
CCTTATGACC GCTTCGAAAC CGATAAATGG GACGTTATGG ATGCCGATGT TCGCTTCAAA
GGGCGGCGCA TTGAGCATGG CAGTAGCCTG CCGATTAGCG ATCTTTCTAC TCATATCATC
CTCAAAAATG CTGACTTGCG CCTGCAACCG CTGAAATTTG GCATTGCGGG TGGCAGCATT
GCGGCGAATA TTCATCTGGA AGGCGATAAA AAGCCGATGC AGGGGCGGGC AGATATTCAG
GCTCGTCGAC TGAAACTGAA AGAACTGATG CCCGATGTGG AACTGATGCA GAAGACGCTG
GGGGAAATGA ACGGTGACGC GGAACTGCGC GGTAGCGGTA ACTCGGTGGC GGCACTTTTA
GGCAACAGTA ACGGCAACCT GAAACTGTTG ATGAATGACG GGCTGGTGAG CCGCAACCTG
ATGGAGATTG TTGGGCTGAA TGTCGGCAAC TACATTGTCG GTGCGATATT TGGTGACGAT
GAGGTGCGGG TGAACTGCGC GGCGGCGAAT CTGAATATTG CCAACGGCGT GGCACGCCCG
CAGATTTTTG CTTTCGATAC TGAGAACGCG TTGATTAACG TTACCGGCAC GGCAAGTTTT
GCTTCGGAAC AGCTGGATTT GACTATTGAT CCGGAGAGTA AAGGTATTCG GATTATCACA
CTGCGTTCGC CGCTGTATGT GCGTGGGACG TTTAAAAATC CTCAGGCTGG GGTGAAAGCC
GGGCCGTTGA TTGCCCGTGG TGCTGTTGCT GCGGCACTGG CAACGCTGGT TACGCCAGCG
GCAGCGTTGC TGGCACTGAT CTCACCTTCC GAAGGGGAGG CTAATCAGTG CCGGACGATT
TTGTCGCAGA TGAAGAAGTG A
 
Protein sequence
MSKAGKITAA ISGAFLLLIV VAIILIATFD WNRLKPTINQ KVSAELNRPF AIRGDLGVVW 
ERQKQETGWR SWVPWPHVHA EDIILGNPPD IPEVTMVHLP RVEATLAPLA LLTKTVWLPW
IKLEKPDARL IRLSEKNNNW TFNLANDDNK DANAKPSAWS FRLDNILFDQ GRIAIDDKVS
KADLEIFVDP LGKPLPFSEV TGSKGKADKE KVGDYVFGLK AQGRYNGEPL TGTGKIGGML
ALRGEGTPFP VQADFRSGNT RVAFDGVVND PMKMGGVDLR LKFSGDSLGD LYELTGVLLP
DTPPFETDGR LVAKIDTEKS SVFDYRGFNG RIGDSDIHGS LVYTTGKPRP KLEGDVESRQ
LRLADLGPLI GVDSGKGAEK SKRSEQKKGE KSVQPAGKVL PYDRFETDKW DVMDADVRFK
GRRIEHGSSL PISDLSTHII LKNADLRLQP LKFGIAGGSI AANIHLEGDK KPMQGRADIQ
ARRLKLKELM PDVELMQKTL GEMNGDAELR GSGNSVAALL GNSNGNLKLL MNDGLVSRNL
MEIVGLNVGN YIVGAIFGDD EVRVNCAAAN LNIANGVARP QIFAFDTENA LINVTGTASF
ASEQLDLTID PESKGIRIIT LRSPLYVRGT FKNPQAGVKA GPLIARGAVA AALATLVTPA
AALLALISPS EGEANQCRTI LSQMKK