Gene ECH74115_4986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4986 
Symbol 
ID6967153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4636774 
End bp4638033 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID643388668 
Producthypothetical protein 
Protein accessionYP_002273095 
Protein GI209398044 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG4942] Membrane-bound metallopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.263173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGGG CCGTGAAACC GCGCAGGTTT GCAATCAGGC CCATCATCTA CGCCAGCGTT 
CTTAGCGCTG GCGTATTGTT GTGCGCCTTT TCCGCCCACG CGGATGAGCG TGACCAACTC
AAATCTATTC AGGCCGATAT CGCCGCAAAA GAGCGCGCGG TACGCCAAAA GCAACAACAA
CGCGCAAGCC TGCTCGCACA ATTGAAAAAG CAGGAAGAAG CGATCTCTGA AGCCACCCGT
AAGCTGCGCG AAACGCAAAA CACGCTCAAT CAACTCAATA AACAGATTGA TGAGATGAAC
GCGTCGATTG CCAAACTGGA GCAGCAAAAA GCCGCCCAGG AGCGCAGCCT CGCCGCACAA
CTGGATGCCG CATTCCGTCA GGGCGAGCAT ACCGGTATTC AGCTGATTCT CAGCGGTGAA
GAAAGCCAGC GTGGACAGCG TTTACAGGCT TATTTCGGCT ATCTCAACCA GGCGCGACAA
GAAACCATTG CCCAGTTGAA GCAAACGCGT GAAGAAGTCG CCATGCAGCG TGCTGAACTG
GAAGAGAAAC AGAGCGAGCA ACAAACGCTG TTATATGAGC AGCGCGCCCA ACAGGCGAAA
CTGACTCAGG CGCTGAACGA GCGTAAAAAG ACGCTGGCAG GGCTGGAGTC TTCCATCCAG
CAAGGTCAGC AACAGTTGAG CGAGCTGCGC GCCAACGAAT CCCGTCTGCG TAACAGCATT
GCCCGTGCGG AAGCCGCGGC GAAAGCGCGT GCAGAACGAG AAGCACGTGA GGCCCAGGCG
GTTCGCGACC GCCAGAAAGA AGCGACGCGC AAAGGCACCA CCTACAAACC GACCGAAAGC
GAAAAATCGC TGATGTCCCG AACTGGTGGC CTGGGGGCGC CGCGTGGTCA GGCATTCTGG
CCGGTTCGCG GGCCGACGCT GCATCGCTAT GGTGAACAGC TACAGGGCGA ACTACGCTGG
AAAGGAATGG TTATCGGTGC CTCTGAAGGT ACTGAAGTTA AAGCGATTGC CGATGGTCGG
GTGATTCTGG CTGACTGGCT GCAAGGTTAC GGTCTGGTGG TGGTGGTTGA GCACGGTAAA
GGCGACATGA GTCTTTACGG CTATAATCAG AGCGCACTGG TGAGCGTTGG TTCGCAGGTT
CGCGCGGGCC AGCCAATTGC ACTGGTGGGC AGCAGTGGCG GTCAGGGTCG GCCTTCACTC
TATTTCGAAA TTCGCCGCCA AGGTCAGGCG GTCAATCCAC AGCCGTGGTT GGGAAGATAA
 
Protein sequence
MTRAVKPRRF AIRPIIYASV LSAGVLLCAF SAHADERDQL KSIQADIAAK ERAVRQKQQQ 
RASLLAQLKK QEEAISEATR KLRETQNTLN QLNKQIDEMN ASIAKLEQQK AAQERSLAAQ
LDAAFRQGEH TGIQLILSGE ESQRGQRLQA YFGYLNQARQ ETIAQLKQTR EEVAMQRAEL
EEKQSEQQTL LYEQRAQQAK LTQALNERKK TLAGLESSIQ QGQQQLSELR ANESRLRNSI
ARAEAAAKAR AEREAREAQA VRDRQKEATR KGTTYKPTES EKSLMSRTGG LGAPRGQAFW
PVRGPTLHRY GEQLQGELRW KGMVIGASEG TEVKAIADGR VILADWLQGY GLVVVVEHGK
GDMSLYGYNQ SALVSVGSQV RAGQPIALVG SSGGQGRPSL YFEIRRQGQA VNPQPWLGR