Gene ECH74115_1132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1132 
Symbol 
ID6969020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1161605 
End bp1162795 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID643385137 
Producthypothetical protein 
Protein accessionYP_002269636 
Protein GI209400981 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.250632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG 
TGGGTCTTTT CCGGGGCCGT TGCCCGTATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC
GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA
ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC
CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC
AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC
GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGCGCAG AATATCAGCG CGCGGCATTA
ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTGCGATTT ACGATCGCAG CGATGTTGCG
GTACGTAAAA AAGAAGGGAT GGAGCTGACC CAGGGCCTCG TCACCGGTGA GTTGCCGCCT
GCCCTGCTGC CGATTGAAGA ACATGGCATG AAGCTGCTGG TGGACATACA GCACGGACAC
AAAACGGGCT ACTACCTGGA CCAGCGAGAC AGCCGCCTGG CTACCCGCCG CTACGTTGAA
AATAAACGCG TACTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG
GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCACTGGA TATTGCACGG
CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC
TTTAAATTGC TGCGTACCTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC
CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGTGG CTATAAAGAT
ATCAACATGC TGGCGATTCA GCTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT
TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC
GGTCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCGG CCGATCATCC GGTGATCGCT
ACCTATCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
 
Protein sequence
MSVRLVLAKG REKSLLRRHP WVFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ 
IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF
GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CAIYDRSDVA VRKKEGMELT QGLVTGELPP
ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM
GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD
PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA
GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM