Gene DET1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1426 
Symbolgcp 
ID3229245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp1296570 
End bp1297550 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content56% 
IMG OID637120986 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_182134 
Protein GI57233782 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000536694 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC TCGGTATAGA AAGCTCCTGT GATGAAACCG CTGCCGCAGT AGTGGCAGAC 
GGGGTAAATA TTTTATCCAA CCGGGTATCC TCGCAGATAG ATATCCACTC CCGTTACGGC
GGGGTAGTCC CCGAAGTGGC TTCCCGCCAG CACCTGCTTT CCATATTACC GGTCATAAGT
GACGCACTTA AGGAAGCACG TACCGGATTT GATGAAATTT CGGCCATAGC TGTAACCAAC
GGGCCGGGTC TGGCAGGCTC TCTGATAGTG GGGGTAAATG CCGCCAAAGC CATAGCCGCC
GCCCGCGGCA TACCCCTGGT GGCGGTAAAC CACCTGCACG GCCATATCTA TGCCAACTGG
CTTTCCGGCA GGATACCGGA ATTCCCCTGC CTGTGCCTGA CTGTCTCAGG CGGGCATACC
GACCTGGTGC TGATGAAAGG GCATGGTCAG TATCAGCTGC TGGGACGTAC CCGTGATGAT
GCCGCCGGAG AAGCCTTTGA CAAAGCCGCC AGAATACTGG GTTTAAGCTA TCCAGGCGGG
CCGGCCATAG ACAGAGCTTC GCAGGACGGT GAGGCAGTAC TGGATTTGCC GCGCTCGTGG
ATACCCGGCA GCCATGACTT CAGCTTTAGC GGACTGAAAA CCGCCCTGCT CCGGCTGGTG
GAAAACGGCG AAGTCTGTTC GGTAAATGAC GCCGCCGCCA GCTTTCAAAA AGCGGTGGTA
GATGTACTGG TAACCAAGAC CCTGAACTGC GCCCATGAGT ACAACGTAAA GCAGATACTG
CTGGCAGGCG GAGTGGCCGC CAATAACCTG CTGCGTAAAC AGCTAAGCGA ACAATCCCCT
CTGCCGGTTT CCATACCACC CATAGGCTTA TGTACCGACA ATGCCGCCGT AATAGCCTCC
TGCGGCTATT TCCGCTTTAT ATCCGGCGGT CAGGACAGGC TGGACATGGA TGTACTGCCG
GCGCTGTCCG TTGTTTCCTG A
 
Protein sequence
MKILGIESSC DETAAAVVAD GVNILSNRVS SQIDIHSRYG GVVPEVASRQ HLLSILPVIS 
DALKEARTGF DEISAIAVTN GPGLAGSLIV GVNAAKAIAA ARGIPLVAVN HLHGHIYANW
LSGRIPEFPC LCLTVSGGHT DLVLMKGHGQ YQLLGRTRDD AAGEAFDKAA RILGLSYPGG
PAIDRASQDG EAVLDLPRSW IPGSHDFSFS GLKTALLRLV ENGEVCSVND AAASFQKAVV
DVLVTKTLNC AHEYNVKQIL LAGGVAANNL LRKQLSEQSP LPVSIPPIGL CTDNAAVIAS
CGYFRFISGG QDRLDMDVLP ALSVVS