Gene ECH74115_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2167 
Symbol 
ID6967986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2077748 
End bp2080261 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content56% 
IMG OID643386062 
Producthypothetical protein 
Protein accessionYP_002270551 
Protein GI209399271 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.981242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCA ATGCGTACCT GTCACAACAG CGTAAGGCGT GGGACGTTCT CAGTGATTTC 
TGCTCGGCGA TGCGCTGTAT GCCGGTATGG AACGGGCAGA CGCTGACGTT TGTGCAGGAC
CGTCCGTCAG ATGTGGTGTG GCCCTACACC AGCAGTGATG TGGTGGTGGA TGATAACGGC
GTGGGGTTTC GCTACAGCTT CAGCGCCCTG AAGGACCGCC ACACGGCGGT GGAGGTGAAT
TACACCGACC CGCAGAACGG CTGGCAGACC TCCACGGAAC TGGTGGAAGA CCCGGAAGCC
ATACTGCGCT ACGGGCGCAA CCTGCTGAAG ATGGATGCGT TCGGCTGTAC CAGTCGCGGT
CAGGCCCACC GTGCCGGGCT GTGGGTGATA AAGACCGGAC TGCTGGAAAC GCAGACGGTG
GATTTCACGC TCGGGTCACA GGGGCTGCGT CACACACCCG GTGACATTAT TGAAATCTGT
GATAACGACT ATGCCGGGAC CATGACCGGC GGACGTATCC TGTCCATCGA TGCCGCCAGC
CGCACCCTGA CACTGGACCG TGAGGTGACC CTGCCGGAGA CCGGTGCCGC CACGGTGAAC
CTGATTAACG GCAGCGGTAA GCCGGTGAGC GTGGCCATCA CTGCACACCC CGCGCCGGAC
CGGATACAGG TCAGCACCCT GCCGGATGGC GTGGAGACAT ACGGTGTGTG GGGGCTCTCC
CTGCCGTCAC TGCGTCGTCG CCTGTTCCGC TGTGTCTCCA TCCGGGAAAA CACGGACGGC
ACCTTTGCCA TCACGGCAGT GCAGCACGTA CCGGAAAAAG AAGCCATCGT GGATAACGGG
GCGCACTTTG ACGGCGACCA GAGCGGCACC CTGAACAGCG TCATCCCTCC GGCAGTGCAG
CACCTGACGG TGGAGGTGAG TGCAGCTGAC AGCCAGTATC TGGCGCAGGC GAAATGGGAC
ACGCCGCGGG TGGTGAAGGG CGTGCGCTTC AGTCTGCGCC TGACCAGTGG AAGCGGTCAG
GACAGCCGTC TGGTGACCAC CGCCATCACT GCGGATACAG AGCATCGTTT CAGTGGTCTG
CCGCTCGGGG AATACACCCT GACAGTCAGG GCAATTAACA GTTATGGCCA GCAGGGGGAA
CCGGCCATCA CCACCTTCCG GATTAACGCG CCAGCAAAAC CCGCCACCAT TGAACTGACG
CCGGGGTATT TTCAGATAAC GGCGGTACCG GTGCTGGCGG TGTATGACCC GACGGTGCAG
TTTGAGTTCT GGTTTTCGGA AAAACGCATC ACGAACACGG CACAGGTGGA AAAATCTGCC
CGTTATCTGG GGAGCGGCAG TCAGTGGACT GTCCAGGGAA GCCGGATTAA GCCGGGGACG
GATTTCTGGT TTTACGTGCG CAGCGTCAAC CTGGTGGGGA AATCTGCGTT TGTGGAAGTC
AGCGGGCAGC CCAGCAATGA TGGTGAAGGG TATCTGGAAT TTTTCCGGGA AAAAATAGGA
AAACTGCATC TGGCTCAGGG GCTATGGGAG CTGATAGACA ACAGCCAGCT TGCGGATGAG
ATGGCGGAGA TGAAGACCAC CATCACGGAA ACCCGCAATG AAATCACACA GACGGTCAGT
AAAACGCTGG AGAACCAGAG CGCCACTATA CAGCAGATAC AGCGCGTGCA GAAGGACACA
AATGATGACC TGGCTGCGCT GTACATGCTG AAGGTTCAAA AAACGAAAGA CGGCATTCCC
TATGTGGCCG GGATTGGTGC AGGGATTGAG GATACTGATG GCCAGCCACT GAGCAACATA
CTGCTGCTGG CTGACCGTAT CGCGATGATA AATCCGGAGA GCGGCAACAG CACGCCGTTA
TTTGTGGCGC AGGGGAATCA GCTGTTCATG AACGACGTGT TCCTGAAGCG ACTGTTTGCG
GTGAGTATCA CCTCGTCCGG CAATCCCCCG ACGTTTTCCC TGACGCCGGA CGGGCGACTG
ACGGCGAAAA ATGCGGATAT CAGTGGCAGT GTGAATGCGA ACTCAGGGAC GCTCAACAAC
GTCACGATTA ATGAGAACTG TCAGATTAAG GGGAAACTGT CAGCCAACCA GATTGAAGGC
GATATTGTCA AAACGGTCAG CAAGTCTTTC CCCCGCACGA GCACTTATGC CAGTGGCACC
ATCACGGTAA GAATCAGTGA TGATCAGAAG TTTGACCGGC AGGTCATGAT ACCGCCAGTG
TTATTCCGCG GTGGTAAGCA TGAGAATTTC AACAGTAATA ACCAACAGTC ATACTGGTAT
TCAACCTGCC GGTTAAGAGT GACCCGCAAT GGTCAGGAGA TTTTTAATCA GTCCACGACG
GATGCTCAGG GCGTATTTTC CTCAGTTATA GATATGCCTG CCGGACAGGG GACGCTGACA
CTGACATTCA CCGTATCTTC ATCAGGAGCG AATAACTGGA CACCAACAAC CAGTATCAGC
GATCTGCTGG TTGTGGTGAT GAAAAAATCC ACAGCAGGTA TCAGTATCAG CTGA
 
Protein sequence
MTFNAYLSQQ RKAWDVLSDF CSAMRCMPVW NGQTLTFVQD RPSDVVWPYT SSDVVVDDNG 
VGFRYSFSAL KDRHTAVEVN YTDPQNGWQT STELVEDPEA ILRYGRNLLK MDAFGCTSRG
QAHRAGLWVI KTGLLETQTV DFTLGSQGLR HTPGDIIEIC DNDYAGTMTG GRILSIDAAS
RTLTLDREVT LPETGAATVN LINGSGKPVS VAITAHPAPD RIQVSTLPDG VETYGVWGLS
LPSLRRRLFR CVSIRENTDG TFAITAVQHV PEKEAIVDNG AHFDGDQSGT LNSVIPPAVQ
HLTVEVSAAD SQYLAQAKWD TPRVVKGVRF SLRLTSGSGQ DSRLVTTAIT ADTEHRFSGL
PLGEYTLTVR AINSYGQQGE PAITTFRINA PAKPATIELT PGYFQITAVP VLAVYDPTVQ
FEFWFSEKRI TNTAQVEKSA RYLGSGSQWT VQGSRIKPGT DFWFYVRSVN LVGKSAFVEV
SGQPSNDGEG YLEFFREKIG KLHLAQGLWE LIDNSQLADE MAEMKTTITE TRNEITQTVS
KTLENQSATI QQIQRVQKDT NDDLAALYML KVQKTKDGIP YVAGIGAGIE DTDGQPLSNI
LLLADRIAMI NPESGNSTPL FVAQGNQLFM NDVFLKRLFA VSITSSGNPP TFSLTPDGRL
TAKNADISGS VNANSGTLNN VTINENCQIK GKLSANQIEG DIVKTVSKSF PRTSTYASGT
ITVRISDDQK FDRQVMIPPV LFRGGKHENF NSNNQQSYWY STCRLRVTRN GQEIFNQSTT
DAQGVFSSVI DMPAGQGTLT LTFTVSSSGA NNWTPTTSIS DLLVVVMKKS TAGISIS