Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0635 |
Symbol | |
ID | 6065430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 685365 |
End bp | 686378 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641600042 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001723638 |
Protein GI | 170018684 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000117915 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CACGTGCGCA AAACCGTACC GTTGATCCAG GCGGCGCTAA AGGAGTCTGG CTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA GGCCCTGGAT TAGTCGGCGC ACTGCTGGTT GGCGCAACCG TGGGGCGTTC TCTGGCGTTT GCCTGGAACG TTCCGGCAAT CCCGGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTCGCGCTGC TGGTGTCCGG CGGTCATACG CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT GCCGCCGGTG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGA CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGTCCG ATGACCGACC GTCCGGGGCT GGATTTCAGC TTCTCCGGTC TGAAAACCTT CGCGGCAAAT ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA GATGCGGTGG TCGATACGCT GATGATTAAG TGCAAGCGAG CGTTGGATCA GACTGGCTTT AAGCGACTGG TCATGGCAGG CGGCGTGAGT GCTAACCGTA CGTTACGGGC GAAGCTGGCT GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAGTT TTGTACTGAT AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCCGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA
|
| |