Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4376 |
Symbol | gcp |
ID | 6972032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4052144 |
End bp | 4053157 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643388099 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002272537 |
Protein GI | 209400727 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000149135 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGG GGCGTTGTGC CTGAACTGGC CTCCCGCGAT CATGTGCGTA AAACCGTACC GTTGATCCAG GCGGCGCTAA AGGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT GCCTGGAACG TTCCGGCGAT CCCTGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTTGCGCTGC TGGTTTCCGG CGGTCATACG CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA CCGTTACTGT CGAAAATGGC GGCGCAGGGT ACTGCCGGGC GCTTTGTTTT CCCGCGTCCG ATGACCGACC GTCCGGGGCT GGATTTCAGC TTCTCCGGCC TGAAAACCTT CGCGGCAAAT ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ACATCGCCCG CGCCTTTGAA GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGTG CGCTGGATCA GACGGGCTTT AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCT GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCTGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA
|
| |