Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3528 |
Symbol | gcp |
ID | 5588002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3539197 |
End bp | 3540210 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640927155 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001464525 |
Protein GI | 157157805 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000459441 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGCA AAACCGTACC GTTGATCCAG GCGGCGCTAA AGGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCAACCG TGGGGCGTTC TCTGGCGTTT GCCTGGAACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTCGCGCTGC TGGTGTCCGG CGGTCATACG CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTTTT CCCGCGTCCG ATGACCGACC GTCCGGGGCT GGATTTCAGC TTTTCTGGTC TGAAAACCTT TGCGGCGAAC ACGATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA GATGCGGTGG TCGATACGTT GATGATTAAG TGTAAGCGAG CGTTGGATCA GACGGGCTTT AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGCA CGTTACGGGC GAAGCTGGCG GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT CTCGGCGTTA GCGTGCGTCC GCGCTGGCCA CTGGCAGAGT TACCGGCTGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA
|
| |