Gene ECH74115_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3035 
Symbol 
ID6969184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2816426 
End bp2817625 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content52% 
IMG OID643386867 
Productphage Tail Collar domain protein 
Protein accessionYP_002271335 
Protein GI209395954 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00110884 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGTGA AATACTACGC CATTCTGACT AATCAGGGCG CGGCACGGCT GGCTAACGCG 
ACGATGCTCG GCAGTAAGCT GAATCTGACG CAAATGGCCG TTGGTGATGC AAATGGTGTC
TTGCCGACAC CTGACCCGGC ACAGACAAAA CTTATTAACC AGAAACGCAT CGCGCCGCTG
AATCTTCTGA GTGTTGACCC GAACAACCAG AGCCAGATTA TTGCGGAGCA AATCATCCCT
GAGAACGAGG GCGGATTCTG GATCCGTGAG ATTGGGCTTT ATGATGATGA AGGCGTACTC
ATTGCGGTGG CGAACTGCCC GGAAACGTAC AAACCGCAGT TACAGGAAGG CAGCGGTCGT
ACCCAGACTA TCCGCATGAT TCTGGTTGTC ACGAATACCG AAGCCATCAC GCTGAAAATC
GACCCGTCGG TGGTACTGGC GACCCGTAAA TACGTGGATG ATAAAGTCCT GGAATTAAGG
CTGTATGTGG ATGACCAGAT GAGAAACCAC ATTGCCGCAC AGGATCCTCA TACCCAGTAT
GCGCAGAAAC ATAATCCGAC ATTTACCGGA GAACCAAAAG CGCCAACGCC TGCCGCAGGA
AATAACACCA CGCGGATTGC GACCACTGAG TTTGTTCAGA CCGCTATTAC CGCTCTGATT
AACGGCGCGC CAGACACGCT GGACACACTG AAAGAAATTG CCGCGGCCAT TAACAATGAC
CCGAAATTCA GCACCACCAT TAACAATGCG CTGTCAGGTA AGCAGCCACT GGATGAGACG
CTGACTCATT TGAGTGGAAA GGATGTTGCC GGTCTTCTCG CATACCTTGG TTTGGGAGAA
GCGGCAAAAC GGGATGTGGG GACAGGGGAA AATCAGATAC CGGACATGGC CTCTTTTGCC
AGTGGTGATG GATGGATGAA ATTACCCAAC GGTAAAATCC TGCAATATGG TCGTGGTGCG
GTTACGCCGA CATTATCGAC GCAAACAATG AGAATTACAT TCAGCATCCC TTTCCCCAAA
AAAGCGGACT GCGCCATGCT TACTCATTCT GGTGATGGCG GTGCGCCTTT AGGCGCTGGG
CGAGGGTTCG TGATGACTGC AGAAGGCCCA ACGTTAACCG GCTTTAATTC TGCTTACAGA
ACGTCATCAA CCAGCGACAC GGTATCGATG AATTACAGTT GGTGGGCTGT TGGTGAGTAA
 
Protein sequence
MTVKYYAILT NQGAARLANA TMLGSKLNLT QMAVGDANGV LPTPDPAQTK LINQKRIAPL 
NLLSVDPNNQ SQIIAEQIIP ENEGGFWIRE IGLYDDEGVL IAVANCPETY KPQLQEGSGR
TQTIRMILVV TNTEAITLKI DPSVVLATRK YVDDKVLELR LYVDDQMRNH IAAQDPHTQY
AQKHNPTFTG EPKAPTPAAG NNTTRIATTE FVQTAITALI NGAPDTLDTL KEIAAAINND
PKFSTTINNA LSGKQPLDET LTHLSGKDVA GLLAYLGLGE AAKRDVGTGE NQIPDMASFA
SGDGWMKLPN GKILQYGRGA VTPTLSTQTM RITFSIPFPK KADCAMLTHS GDGGAPLGAG
RGFVMTAEGP TLTGFNSAYR TSSTSDTVSM NYSWWAVGE