Gene ECH74115_3314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3314 
Symbol 
ID6969490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3047081 
End bp3048574 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content51% 
IMG OID643387126 
Producthypothetical protein 
Protein accessionYP_002271590 
Protein GI209397294 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0795943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.253919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCGA TACTGGTGAG CTGCCTTCAG TTTTTAGTGG CCTGGCATAA GCACGAAGTC 
AAATACGACA CACTGATTAC CGACGTACAA AAGTATCTCG ATACCTATTT TGCCGACCTG
AAATCCACTA CTGACCGGCT CCAGCCGCTG ACCTTAGATA CATGCAAGCA AGCTAACCCC
GAACTGACCG CCCGCGCGGC GTTTAGCATG AATGTCCGAA CGTTTGTGCT GGTGAAAGAT
AAAAAAACAT TCTGTTCATC TGCGACCGGT GAGATGGACA TTCCACTCAA TGAATTGATT
CCTGCGCTCG ACATTAATAA AAATGTCGAT ATGGCGATCT TACCCGGCAC GCCGATGGTG
CCGAACAAAC CCGCAATCGT CATCTGGTAT CGCAACCCTT TGCTGAAAAA TAGCGGCGTC
TTTGCCGCTC TGAATCTCAA CCTGACGCCT TCTCTCTTTT ATAGCTCACG GCAGGAAGAT
TACGATGGCG TCGCCCTCAT TATTGGCAAT ACTGCGCTAT CTACCTTTTC TTCACGTTTG
ATGAACGTTA ACGAATTAAC CGACATGCCA GTCCGTGAAA CTAAAATTGC GGGCATTCCT
CTGACCGTTC GGCTTTATGC AGATGACTGG ACATGGAACG ATGTGTGGTA CGCATTTTTA
CTGGGCGGCA TGAGTGGAAC TGTCGTTGGC CTGCTCTGCT ATTACCTGAT GAGCGTACGT
ATGCGCCCCG GCAGAGAAAT CATGACCGCC ATCAAGCGCG AACAATTTTA CGTGGCGTAT
CAACCGGTGG TGGATACACA AGCTTTGCGA GTAACGGGCC TGGAAGTACT GCTACGCTGG
CGGCATCCTG TCGCGGGAGA AATTCCCCCG GATGCCTTCA TTAACTTTGC CGAATCGCAA
AAGATGATTG TGCCGCTGAC TCAGCACCTG TTTGAGTTAA TTGCCCGCGA TGCCGCAGAA
TTAGAAAAAG TGCTGCCGGT AGGCGTCAAA TTTGGCATTA ACATTGCGCC GGACCATCTG
CACAGCGAAA GCTTTAAAGC AAATATCCAG AAACTGCTCA CTTCCCTACC CGCACACCAT
TTCCAGATTG TGCTGGAAAT TACCGAGCGC GATATGCTGA AAGAGCAAGA AGCCACACAA
CTCTTCGCCT GGCTGCACTC GGTCGGCGTA GAAATTGCTA TTGATGACTT CGGCACCGGG
CACAGCGCGC TTATCTATCT TGAGCGTTTT ACGCTCGATT ATCTGAAAAT TGACCGTGGA
TTTATCAACG CCATCGGTAC GGAAACGATC ACTTCACCCG TACTTGACGC GGTGCTGACG
CTGGCGAAAC GCCTCAATAT GCTGACGGTT GCTGAGGGGG TCGAAACGCC GGAACAGGCG
CGATGGCTAA GCGAACGCGG CGTTAATTTC ATGCAAGGCT ACTGGATTAG TCGCCCGTTA
CCGCTGGACG ATTTTGTCCG CTGGCTGAAG AAACCGTATA CGCCGCAGTG GTAA
 
Protein sequence
MIAILVSCLQ FLVAWHKHEV KYDTLITDVQ KYLDTYFADL KSTTDRLQPL TLDTCKQANP 
ELTARAAFSM NVRTFVLVKD KKTFCSSATG EMDIPLNELI PALDINKNVD MAILPGTPMV
PNKPAIVIWY RNPLLKNSGV FAALNLNLTP SLFYSSRQED YDGVALIIGN TALSTFSSRL
MNVNELTDMP VRETKIAGIP LTVRLYADDW TWNDVWYAFL LGGMSGTVVG LLCYYLMSVR
MRPGREIMTA IKREQFYVAY QPVVDTQALR VTGLEVLLRW RHPVAGEIPP DAFINFAESQ
KMIVPLTQHL FELIARDAAE LEKVLPVGVK FGINIAPDHL HSESFKANIQ KLLTSLPAHH
FQIVLEITER DMLKEQEATQ LFAWLHSVGV EIAIDDFGTG HSALIYLERF TLDYLKIDRG
FINAIGTETI TSPVLDAVLT LAKRLNMLTV AEGVETPEQA RWLSERGVNF MQGYWISRPL
PLDDFVRWLK KPYTPQW