Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET1426 |
Symbol | gcp |
ID | 3229245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | + |
Start bp | 1296570 |
End bp | 1297550 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637120986 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_182134 |
Protein GI | 57233782 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000536694 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC TCGGTATAGA AAGCTCCTGT GATGAAACCG CTGCCGCAGT AGTGGCAGAC GGGGTAAATA TTTTATCCAA CCGGGTATCC TCGCAGATAG ATATCCACTC CCGTTACGGC GGGGTAGTCC CCGAAGTGGC TTCCCGCCAG CACCTGCTTT CCATATTACC GGTCATAAGT GACGCACTTA AGGAAGCACG TACCGGATTT GATGAAATTT CGGCCATAGC TGTAACCAAC GGGCCGGGTC TGGCAGGCTC TCTGATAGTG GGGGTAAATG CCGCCAAAGC CATAGCCGCC GCCCGCGGCA TACCCCTGGT GGCGGTAAAC CACCTGCACG GCCATATCTA TGCCAACTGG CTTTCCGGCA GGATACCGGA ATTCCCCTGC CTGTGCCTGA CTGTCTCAGG CGGGCATACC GACCTGGTGC TGATGAAAGG GCATGGTCAG TATCAGCTGC TGGGACGTAC CCGTGATGAT GCCGCCGGAG AAGCCTTTGA CAAAGCCGCC AGAATACTGG GTTTAAGCTA TCCAGGCGGG CCGGCCATAG ACAGAGCTTC GCAGGACGGT GAGGCAGTAC TGGATTTGCC GCGCTCGTGG ATACCCGGCA GCCATGACTT CAGCTTTAGC GGACTGAAAA CCGCCCTGCT CCGGCTGGTG GAAAACGGCG AAGTCTGTTC GGTAAATGAC GCCGCCGCCA GCTTTCAAAA AGCGGTGGTA GATGTACTGG TAACCAAGAC CCTGAACTGC GCCCATGAGT ACAACGTAAA GCAGATACTG CTGGCAGGCG GAGTGGCCGC CAATAACCTG CTGCGTAAAC AGCTAAGCGA ACAATCCCCT CTGCCGGTTT CCATACCACC CATAGGCTTA TGTACCGACA ATGCCGCCGT AATAGCCTCC TGCGGCTATT TCCGCTTTAT ATCCGGCGGT CAGGACAGGC TGGACATGGA TGTACTGCCG GCGCTGTCCG TTGTTTCCTG A
|
Protein sequence | MKILGIESSC DETAAAVVAD GVNILSNRVS SQIDIHSRYG GVVPEVASRQ HLLSILPVIS DALKEARTGF DEISAIAVTN GPGLAGSLIV GVNAAKAIAA ARGIPLVAVN HLHGHIYANW LSGRIPEFPC LCLTVSGGHT DLVLMKGHGQ YQLLGRTRDD AAGEAFDKAA RILGLSYPGG PAIDRASQDG EAVLDLPRSW IPGSHDFSFS GLKTALLRLV ENGEVCSVND AAASFQKAVV DVLVTKTLNC AHEYNVKQIL LAGGVAANNL LRKQLSEQSP LPVSIPPIGL CTDNAAVIAS CGYFRFISGG QDRLDMDVLP ALSVVS
|
| |