Gene ECH74115_2377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2377 
Symbol 
ID6968002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2250877 
End bp2252133 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content52% 
IMG OID643386250 
Producthypothetical protein 
Protein accessionYP_002270734 
Protein GI209398905 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000029957 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0111859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCTG ATGCGAAAAA CTTGATGAGC GACGGGAATG TGCAAATTGT TAAGACCGGC 
GAGGTCATTG GCGCGACGCA ACTTACTGAA GGCGAGTTGA TTGTTGAAGC TGGCGGAAGA
GCCGAAAATA CCGTGGTCAC GGGGGCTGGC TGGTTGAAAG TGGCAACCGG TGGGATCGCC
AAATGCACAC AGTACGGTAA CAATGGCACG CTATCGGTCA GCGACGGTGC CATTGCCACA
GATATTGTTC AGTCCGAGGG AGGCGCAATT AGTCTCTCTA CGCTCGCTAC GGTTAATGGC
CGCCATCCCG AAGGTGAATT CAGCGTTGAT CAGGGTTATG CCTGCGGTTT GTTGCTGGAA
AATGGCGGTA ACCTGCGTGT ACTGGAAGGA CATCGCGCGG AAAAAATCAT TCTCGATCAA
GAGGGCGGCC TGTTGGTTAA TGGGACAACC TCAGCGGTCG TGGTAGATGA AGGTGGTGAA
TTGTTGGTGT ATCCAGGTGG GGAAGCCAGC AATTGTGAGA TTAATCAGGG CGGCGTTTTT
ATGCTGGCGG GGAAAGCCAA TGATACGTTG CTTGCTGATG GCACCATGAA TAATCTCGGT
GGTGAAGACT CTGACACTAT TGTTGAGAAT GGAGCCATCT ATCGTCTGGG GACGGATGGT
CTTCAGCTCT ACAGTTCCGG TAAGACGCAA AACCTGTCCG TTAATGTGGG TGGTCGGGCT
GAAGTGCATG CCGGTACGCT GGAAAATGCG GTAATACAAG GTGGAACAGT GATCCTGTTG
TCACCCACCA GCGCGGACGA AAATTTTGTC GTAGAGGAAG ATCGCGCACC GGTTGAACTG
ACCGGGAGTG TTGCATTACT GGACGGCGCT TCAATGATTA TTGGCTATGG CGCAGATCTG
CAACAATCAA CGCTTACTGT ACAGCAGGGC GGTGTATTGA TTCTCGACGG CAGTACGGTA
AAAGGTGACA GTGTCACTTT CAGTATTGGT AACATCAATC TGAATGGCGG AAAACTGTGG
CTGATCACTG GTGCGGCAAC GCATGTGCAG CTGAAAGTGA AACGCCTGCG CGGAGAGGGA
GCGATTTGCC TGCAAACCAG TGCGAAAGAA ATTTCACCTG ACTTCATCAA TGTGAAAGGG
GAAGTTAACG GGGATATACG CGTTCAGATA ACAGATGCCA GTCGGCAAAC TCTGTGTAAC
GCACTGAAAC TACAGCCAGA CGAAGACGGG ATTGGCGCAA CGCTCCAGCC TGCGTAA
 
Protein sequence
MGSDAKNLMS DGNVQIVKTG EVIGATQLTE GELIVEAGGR AENTVVTGAG WLKVATGGIA 
KCTQYGNNGT LSVSDGAIAT DIVQSEGGAI SLSTLATVNG RHPEGEFSVD QGYACGLLLE
NGGNLRVLEG HRAEKIILDQ EGGLLVNGTT SAVVVDEGGE LLVYPGGEAS NCEINQGGVF
MLAGKANDTL LADGTMNNLG GEDSDTIVEN GAIYRLGTDG LQLYSSGKTQ NLSVNVGGRA
EVHAGTLENA VIQGGTVILL SPTSADENFV VEEDRAPVEL TGSVALLDGA SMIIGYGADL
QQSTLTVQQG GVLILDGSTV KGDSVTFSIG NINLNGGKLW LITGAATHVQ LKVKRLRGEG
AICLQTSAKE ISPDFINVKG EVNGDIRVQI TDASRQTLCN ALKLQPDEDG IGATLQPA