Gene ECH74115_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1643 
Symbol 
ID6970828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1587551 
End bp1589101 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content51% 
IMG OID643385603 
Productprophage tail fibre domain protein 
Protein accessionYP_002270097 
Protein GI209400441 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTTG AGGATGCGAG CACGACGAAA AAGGGGATAG TACAGCTCAG CAGTGCGACT 
AACAGCACTT CCGAGTCACT GGCGGCAACG CCAAAAGCCG TTAAGGCCGC GTATGAGCTG
GCTAACGGGA AATACACCGC ACAGGATGCA ACGACAGCAC AGAAAGGGAT AGTTCAGCTT
AGCAACGCGA CCAACAGCAC ATCTGAAATG CTGGCGGCAA CGCCAAAGTC GGTAAAGGCA
GCCTATGACC TTGCTAACGG GAAATATACT GCTCAGGACG CTACGACAGC ACAAAAAGGA
ATTGTCCAGC TCAGTAGTGC AACCAACAGC GCATCTGAAA CGCTTGCCGC GACACCGAAA
GCAGTGAAAG CAGCTAATGA TAATGCGAAT GGTCGGGTAC CTTCTGCCCG TAAGGTGAAT
GGTAAGGCGC TTTCATCGGA TATAACACTG ACGCCGAAAG ATATTGGTAC GCTTAACTCA
ACAACAATGT CATTCAGCGG TGGTGCTGGT TGGTTCAAAT TAGCAACGGT AACCATGCCA
CAGGCGAGTT CTGTTGTTTC AATTACGTTG ATTGGTGGCG CGGGATTTAA CGTGGGGTCA
CCTCAACAGG CAGGTATATC TGAACTTGTT TTGCGTGCAG GTAATGGTAA TCCGAAGGGG
ATTACTGGTG CTTTATGGCA GCGCACATCG ACAGGGTTTA CAAATTTTGC CTGGGTCAAT
ACATCTGGTG ATACTTACGA TATTTACGTT GCAATCGGAA ATTATGCGAC TGGTGTAAAT
ATTCAATGGG ATTATACCAG TAATGCCAGC GTGACGATTC ATACGTCACC AGCATATTCT
GCTAATAAGC CGGAAGGGTT AACGGACGGT ACAGTTTATT CACTCTATAC GCCATCAGAG
CAGTTTTATC CGCCTGGCGC ACCAATCCCG TGGCCATCAG ATACCGTTCC GTCTGGCTAT
GCCCTGATGC AGGGGCAGAC TTTTGACAAA TCTGCATACC CGAAACTTGC AGCCGCTTAT
CCGTCAGGCG TGATCCCTGA TATGCGTGGC TGGACGATTA AGGGCAAACC TGCCAGTGGT
CGGGCCGTAT TGTCTCAGGA ACAGGACGGC ATTAAATCGC ACACCCACAG CGCCAGCGCA
TCCAGTACGG ATTTGGGGAC GAAAACCACA TCGTCGTTTG ATTACGGCAC TAAATCCACG
AATAACACCG GGGCGCACAC GCACAGTGTG AGCGGTACAG CCGCAAGTGC CGGAAACCAT
ACTCATAGTG TCACAGGCGC ATCAGCAGTC AGCCAGTGGT CACAAAATGG GTCAGTACAT
AAGGTAGTGT CTGCGGCCAG TGTGAATACA AGTGCTGCAG GAGCGCACAC TCATAGTGTC
AGCGGCACAG CTGCATCTGC AGGTGCTCAC GCACATACTG TCGGTATTGG TGCTCATACG
CACTCTGTTG CGATTGGCTC ACATGGACAC ACCATCACCG TTAACGCTGC GGGTAACGCG
GAAAACACTG TCAAAAACAT CGCATTTAAC TACATTGTGA GGCTTGCATA A
 
Protein sequence
MALEDASTTK KGIVQLSSAT NSTSESLAAT PKAVKAAYEL ANGKYTAQDA TTAQKGIVQL 
SNATNSTSEM LAATPKSVKA AYDLANGKYT AQDATTAQKG IVQLSSATNS ASETLAATPK
AVKAANDNAN GRVPSARKVN GKALSSDITL TPKDIGTLNS TTMSFSGGAG WFKLATVTMP
QASSVVSITL IGGAGFNVGS PQQAGISELV LRAGNGNPKG ITGALWQRTS TGFTNFAWVN
TSGDTYDIYV AIGNYATGVN IQWDYTSNAS VTIHTSPAYS ANKPEGLTDG TVYSLYTPSE
QFYPPGAPIP WPSDTVPSGY ALMQGQTFDK SAYPKLAAAY PSGVIPDMRG WTIKGKPASG
RAVLSQEQDG IKSHTHSASA SSTDLGTKTT SSFDYGTKST NNTGAHTHSV SGTAASAGNH
THSVTGASAV SQWSQNGSVH KVVSAASVNT SAAGAHTHSV SGTAASAGAH AHTVGIGAHT
HSVAIGSHGH TITVNAAGNA ENTVKNIAFN YIVRLA