Gene ECH74115_4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4376 
Symbolgcp 
ID6972032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4052144 
End bp4053157 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content57% 
IMG OID643388099 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002272537 
Protein GI209400727 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000149135 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT 
GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGG
GGCGTTGTGC CTGAACTGGC CTCCCGCGAT CATGTGCGTA AAACCGTACC GTTGATCCAG
GCGGCGCTAA AGGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA
GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT
GCCTGGAACG TTCCGGCGAT CCCTGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG
CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTTGCGCTGC TGGTTTCCGG CGGTCATACG
CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT
GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA
CCGTTACTGT CGAAAATGGC GGCGCAGGGT ACTGCCGGGC GCTTTGTTTT CCCGCGTCCG
ATGACCGACC GTCCGGGGCT GGATTTCAGC TTCTCCGGCC TGAAAACCTT CGCGGCAAAT
ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ACATCGCCCG CGCCTTTGAA
GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGTG CGCTGGATCA GACGGGCTTT
AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCT
GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT
AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT
CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCTGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM
LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE
DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA