Gene Dgeo_2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2225 
Symbol 
ID4056900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2345624 
End bp2346886 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content71% 
IMG OID641231268 
Productpeptidase M16-like protein 
Protein accessionYP_605688 
Protein GI94986324 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACTG TTGCCGCACC CGGAACCCAC GTGTGGACCC TGGAGGGAGG ACTGACCGCC 
GCGTTTGAGC GCCGCCGAGG GCCGGGCTTC GCACTGGACC TGCGGGTGCC GGTGGGGAGC
GCCCACGATC CGGTGGGTCG GGAAGGTTCC GCGGGCGTCT TGGAGGAGTG GCTCTACAAG
GGCGCAGGGG GTCGGAACGC CCGCGCTTTC CAGGATGCGC TGGATGACCT GGGCGTGCGC
CGGGGCGGGG GTGTGGGCCC GGAGGCCACT CGCTTCAGCG TGAGTGGCCT GACGGCGGAC
CTTCCCGCCG CGCTCGGGTT GCTGGCGGAC CTGCTTCTGC GGCCTGCCCT GCCGCCAGAG
GAACTGCCGG TCCTGGCGGA TCTCGCCCGG CAGGATCTGG AGGGGCTGGA GGACAGTCCG
CCTGACCTGC TGGCGATCGA AGCGCGGCGG CGGGCCTTTC CCCGTGACCC GGCCTCGCCC
TTCGCGGGCT ACGCTCATCC GGCCAGCGGC ACAGCCGAGG GGCTTTCGAA CCTGACGGCA
CAGAATCTGC GTGCCTTTCT CAACCGCTAC GGCACACGCG GCAGCGTGCT GGGCCTGGTG
GCCGACGCCG ATCCCGGGGA GGTGCGCGGC CTGCTGGAAC GGGCGTTTGC CGGCTGGCAC
CCCGGCGAGA CCGCACCCGT TCCTGCCGAC TTCCATCCTG GCCTGCGCGT GCATGTTCCC
CACGCCGAGG CCGAGCAGAC ACACCTCAGC GTCACCGCGC CGGGTGTCGC GCCGCGCGAT
CCCGACTGGC TGTCTTGGCA GGTGGCGCTG ATGGCGCTCT CGGGTGGGAG TGCCAGCCGC
CTCTTTCATG CAGTCCGCGA GGAACGGGGC TTGGCCTACA GTGTCAGCGC GGCGCCGATC
CTCCTGGGGG GACGGGGTTT CCTGGCCGCC TACGCGGGCA GTACACCCGA GCGCGCGCCC
GAGACGCTGG CTGTGCTGTT GGCTGAACTC GCTCGGCTGC CGCAGGGCCT CACCGAGGCT
GAGTTTGAGC GGGCCCGCCG TGGCCTCACC GCCAGCGTCG TGTTCGGCGC CGAGAGCCTG
CGCGCCCGTG CCAGCAGCCT CACGCGTGAT CTGGCGGTGT TTGGGCGGGT GCGCGGCGTG
GCCGAACACC GTGCCCAGAT TGCGGCCCTC ACCCTGGAGC GGGTGAACGC TTTCCTGGCT
GAGTATGACC CTGTCGCACA GGCGACCATC GTGACGCTTG GCCCGGCGGA GGTACAGGCA
TGA
 
Protein sequence
MTTVAAPGTH VWTLEGGLTA AFERRRGPGF ALDLRVPVGS AHDPVGREGS AGVLEEWLYK 
GAGGRNARAF QDALDDLGVR RGGGVGPEAT RFSVSGLTAD LPAALGLLAD LLLRPALPPE
ELPVLADLAR QDLEGLEDSP PDLLAIEARR RAFPRDPASP FAGYAHPASG TAEGLSNLTA
QNLRAFLNRY GTRGSVLGLV ADADPGEVRG LLERAFAGWH PGETAPVPAD FHPGLRVHVP
HAEAEQTHLS VTAPGVAPRD PDWLSWQVAL MALSGGSASR LFHAVREERG LAYSVSAAPI
LLGGRGFLAA YAGSTPERAP ETLAVLLAEL ARLPQGLTEA EFERARRGLT ASVVFGAESL
RARASSLTRD LAVFGRVRGV AEHRAQIAAL TLERVNAFLA EYDPVAQATI VTLGPAEVQA