Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3357 |
Symbol | gcp |
ID | 6145887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3435048 |
End bp | 3436061 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618186 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001745336 |
Protein GI | 170682206 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000189454 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGTA AAACCGTACC GTTGATCCAG GAGGCGCTGA AAGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA GGCCCAGGAT TAGTCGGCGC ACTGCTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT GCCTGGGACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG CTGGAAGATA ACCCACCGGC ATTTCCGTTT GTGGCGCTGC TGGTTTCCGG CGGTCATACG CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TTGGCGAGTC TATCGATGAT GCTGCTGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGG CCATTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGCCCG ATGACCGACC GTCCGGGGCT GGATTTCAGC TTTTCTGGTC TGAAAACCTT CGCGGCGAAC ACGATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGAG CACTGGATCA GACGGGCTTT AAGCGACTGG TAATGGCAGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCT GAAATGATGA AAAAACGCCG TGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCAGAGT TACCGGCCGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ EALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWDVPAIPVH HMEGHLLAPM LEDNPPAFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA
|
| |