Gene Dgeo_2314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2314 
Symbol 
ID4059261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2433137 
End bp2434849 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content65% 
IMG OID641231362 
Productpeptidase M3A and M3B, thimet/oligopeptidase F 
Protein accessionYP_605775 
Protein GI94986411 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02289] oligoendopeptidase, M3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTG CCCTGAAGGC CGTTGAGCGC GTCTTGCACG TCCCTGACGA ACAGACCCGC 
TGGGAAACCT ACGCGCCGCG TTACGAGCAG CTTCTTCAGG CTGACCTGAC GGCGGAAGGG
GTGCCCGATT GGCTGGCCGA GTGGAGTGCG CTGGACGCGG AGGTGTCGCA GGCGGAGAGC
AAGCTGGCCA CCCATGCCGA TCTGCACACG GATGACCCGG ACATTCAGGC CCGCTACGCG
CGGTTTCTAG AAAACGTGTC ACCTCAGGCA CAGCGGGCCG AGCAGGCCCT GACCAAGAAG
CTGCTGGCAG TCCCCGGCTA CACTCCGGCT CCCGACTTTG CCCTGGCGTA CCGCCGCTTT
CGCGACGCTG CCGCCCTGTT CCGTGAAGCG AATGTCGAAC TGGGTGTCAC GCACGAGGCG
CAGATGAACC GTCACGGGGT GATCACGGGG AATCAGAAGG TGATCCTGAA CGGCGAAGAG
CGGACCGTGC CGCAGGCCAA GCAGCTTCTC GACAGCCCCG ACCGGACCGA GCGTGAGGCG
GCCTGGCAGG CGCTTGCCAC GAGCAACCAG GAGGTCGCCC CTGCGCTCGA CGCGTTGATG
CTGGAGTTGA TCGGCACGCG CCAGCAGCTC GCCTGGAACG CCGACCTGCC GAGCTACCGC
GACTTCATGT GGCGGCGTCT TGACCGGGTG GATTACACGC CGGGGGACTG CCGCGCCTTT
CACGAGGCCG TGCGGGAAGA GGTGGTGCCC CTCGTTGCCG AGATCATGGG CGGGATCGCA
GCGCGGTTGG GCCTGGAGTG CGTGCGCCCC TGGGACTACA ACCGCAACAA CCTGCTCGAC
CCGGCAGGTC GCCCGCCGCT CCAGCCCTTC AGGACAGGCG CCGAACTAGA AGAGCTTGCG
CAGACTGCTT TTGAGGGGTT AGACGGCGAG CTGGCCGCGC GGTTTCGCTC CCTGCGGGCG
GGCCTGCTCG ACCTGGAGTC GCGGCCCGGC AAGATGACGC ACGCCTACTG TCAGTACTTC
CCTGTCGCGA ACGAGCCCTT TGTGCTCATG AACGTGGTGG GCACCTCCGA GGACGTGCGG
GTGCTCTTTC ACGAGGTGGG GCACGCCTTC CACGGCTTCC TGAGTGGGGA CGCGCAGCCG
CTGGTCTGGA ACCGCTGGAG TCCGATCGAG TTCATCGAGA TTCCCTCGAT GGCGATGGAG
TTTCTCACGC TGGACCACCT GGGGCACGTG TTGAACCGGG AAGAACTCGC TCGCTACCGC
GAGAAGCAGC TGCAGGGGGT GGTGGCTTTC CTGCCCTGGG CGGCCCAGAT GGACGCCTTC
CAACACTGGC TGTACGCGGA GGCTGGCCCG AATGTCACCA TCCGTGACCT TGACGCCAAG
TGGCTGGAAC TGGACCGGAC CTTTCATCCC TTTGTGAATT GGGAGGGTCT GGACGAACGC
GTGCGGGCCA AAGGCTGGCA CTACTACCAC ATCTTCCGTG CGCCCTTCTA CTACATCGAA
TACGCGATGT GCTCCCTCGC GGCAGTTGGG ATCTGGCGGG AAGCGCGGCA GCATCCGGCG
CAAGCTCTCG CCCACTACAA GGCCAGTCTG CGCCTGGGCA GCACGGTGCC GGTGCCTGAG
CTGTACCGTG CCGCCGGGGT GGAGTTCCGC TTCGACCGCG AGCATATCCG GGGCTTGATG
GCCTTCCTGA AAGAGCAGTT GGCGGAAGGC TGA
 
Protein sequence
MTTALKAVER VLHVPDEQTR WETYAPRYEQ LLQADLTAEG VPDWLAEWSA LDAEVSQAES 
KLATHADLHT DDPDIQARYA RFLENVSPQA QRAEQALTKK LLAVPGYTPA PDFALAYRRF
RDAAALFREA NVELGVTHEA QMNRHGVITG NQKVILNGEE RTVPQAKQLL DSPDRTEREA
AWQALATSNQ EVAPALDALM LELIGTRQQL AWNADLPSYR DFMWRRLDRV DYTPGDCRAF
HEAVREEVVP LVAEIMGGIA ARLGLECVRP WDYNRNNLLD PAGRPPLQPF RTGAELEELA
QTAFEGLDGE LAARFRSLRA GLLDLESRPG KMTHAYCQYF PVANEPFVLM NVVGTSEDVR
VLFHEVGHAF HGFLSGDAQP LVWNRWSPIE FIEIPSMAME FLTLDHLGHV LNREELARYR
EKQLQGVVAF LPWAAQMDAF QHWLYAEAGP NVTIRDLDAK WLELDRTFHP FVNWEGLDER
VRAKGWHYYH IFRAPFYYIE YAMCSLAAVG IWREARQHPA QALAHYKASL RLGSTVPVPE
LYRAAGVEFR FDREHIRGLM AFLKEQLAEG