Gene ECH74115_0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0915 
Symbol 
ID6966921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp926222 
End bp927538 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content59% 
IMG OID643384937 
Producttail fiber protein 
Protein accessionYP_002269437 
Protein GI209398128 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAG TACAAATATC AGGCGTGCTG AAAGATGGTG CGGGAAAACC AATACAGAAC 
TGCACCATTC AACTGAAAGC CAGACGTAAC AGCACCACGG TGGTGGTGAA CACGGTGGCC
TCTGAAAATC CGGATGAGGC AGGGCGTTAC AGCATGGACG TCGAGTATGG TCAGTACAGC
GTCACTCTGT TGGTGGAGGG ATTCCCGCCA TCACATGCCG GGACCATCAC CGTGTATGAA
GATTCTCAAC CGGGGACGCT GAATGATTTT CTCGGTGCCA TGTCGGAGGA TGACGTCCGG
CCGGAGGCAC TGCGTCGTTT TGAACTGATG GTGGAAGAAG CGGCGCGTCA CGCTGAGGAG
GCGAAGAAGA ATGCCGGAGA GGCGGAGACG TCCGCGAGGA ATGCCGGCAT ATCAGCCAGT
CAGGCAGAAG AGAGCGCGGC AAATGCTGAC ACTTCAGCAG GGGATGCATC GGAGTCAGCC
CGGCAGGCGG CAGAAAGTGC AGCCGCTGCA AAGCAGTCAG AGGAGGCGTC CTCGTCCTCG
GCTTCTGCGG CCGCTCAAAA AGCCAGTGAG TCATCACAAA GTGCAGCAGA AGCTGAATTG
TCAAGAAAGA CGGCAGAAAG TGCAGCCGGT AATGCAGCCA GGGATGCAAC GACCGCAACA
GAAAAAGCCC GGGAATCAGC AGAAAGCGCA CAGTCAGCGG AACAAAGCAG AATAGCGGCG
GAAGACGCCG TAAACAGAAT TCCCACCGTG GTGGGGCCTC CCGGACCAAA GGGGGAACCG
GGTCCCGCGG GTCCTCAGGG GCCGAAGGGA GATAAAGGAG AGCGTGGAGA CACCGGTCCG
GCAGGGGCAA CCGGTGAACG GGGACCGGGA GGAGATACAG GTCCGGCAGG TCCGCAGGGG
CCGAAAGGCG ACAGGGGAGA GCGGGGAGAG ACCGGTCTGA CAGGAAGTAC AGGTCCACAG
GGGCCAAAGG GAGATACCGG GGCAACAGGT CCGGCAGGAC CGCAGGGACC GAAAGGGGAA
ACAGGTGCGG CTGGCCCGGT GGGGGCTACC GGACCTCAGG GGGCGAAGGG CGACCCGGGG
GAGACACAAA TACGGTTCCG TCTGGGGCCG ATGAGAATTA TTGAGACAAA CAGCTATGGC
TGGTTCCCGG GTACAGATGG TGCGCTCATC ACCGGACTGA CCTTTCTTGA CCCCAAAGAT
GCCACACAGG TTCAGGGGAT GTTTCAGCAT TTGCAGGTCA GATTTGGTGA CGGGCCATGG
CAGGATGTTA AGGGACTGGA TGAAGTGGGC AGTGATACAG GCAGAACTGG AGAATGA
 
Protein sequence
MAAVQISGVL KDGAGKPIQN CTIQLKARRN STTVVVNTVA SENPDEAGRY SMDVEYGQYS 
VTLLVEGFPP SHAGTITVYE DSQPGTLNDF LGAMSEDDVR PEALRRFELM VEEAARHAEE
AKKNAGEAET SARNAGISAS QAEESAANAD TSAGDASESA RQAAESAAAA KQSEEASSSS
ASAAAQKASE SSQSAAEAEL SRKTAESAAG NAARDATTAT EKARESAESA QSAEQSRIAA
EDAVNRIPTV VGPPGPKGEP GPAGPQGPKG DKGERGDTGP AGATGERGPG GDTGPAGPQG
PKGDRGERGE TGLTGSTGPQ GPKGDTGATG PAGPQGPKGE TGAAGPVGAT GPQGAKGDPG
ETQIRFRLGP MRIIETNSYG WFPGTDGALI TGLTFLDPKD ATQVQGMFQH LQVRFGDGPW
QDVKGLDEVG SDTGRTGE