Gene Dret_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2159 
Symbol 
ID8420010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2454927 
End bp2456195 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content58% 
IMG OID645038753 
Productcarboxyl-terminal protease 
Protein accessionYP_003199021 
Protein GI258406279 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.272376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTGA GCCATTGGCT GGGTATTGTG GCACTCCTGG CCCTTTTGAC CGCCACTCCC 
GGGCCGGGGC AGGCCACGGA CGCCAATCGA TTCGAGTCCC TGAAACGCTT CAGCCAGGTT
ATGGATCTGA TCGAAAAGAG CTATGTCCGG GACATCGACC GGGAAGAACT CATCACCGGG
GCCCTGAAAG GGATGCTCAG CGAGCTCGAC CCCCATTCAG CATACATGTC CCCCGACTCC
TTCCAGGAGA TGCAGGTGGA GACCTCTGGG GAATTCAACG GCATTGGCAT ACAGATTTCC
ATGGAAAACG GACGATTGAC CGTTGTCTCT CCTATCGAGG ACACCCCGGC CTACGAGGCA
GGCTTTGAGG CCGGCGACAT CATCATGGAG ATCAACGGCG AATCGACCCA GGATATCACC
TTGATGGAGG CGGTGAAAAA AATCCGCGGG CCCAAGGGCA GCACGGTCGA TCTCAAGGTC
CTGCACCCCG AGGCCCAGAA ACCGGAAACA ATCACTGTCA AACGGGACAC CATCCCCTTG
GAATCGGTCA AATCCGAACC GCTGGGTGGC GGCTATCTCT ACCTCAGGGT CACCAATTTC
CAGGAAAAGA CCACCGAGGA TCTGCAAAAG GCCTTGCACA AACGCGACGG CCGTCTCGCG
GGCGCGATTT TGGACCTGCG CAACAACCCC GGAGGCCTTT TACCCCAGGC TGTCTCTGTC
GCCGACACCT TTCTCAAGGA AGGGAAGATC GTCTACACCG AGGGCAAGGT CAAAAATGCC
AAAATGGAAT TTTCGGCCCA GAAGCAGCAA TCCGACATCG ATGTGCCGCT GATCGTCCTT
ATCAATCCCG GTTCAGCCTC GGCTTCCGAG ATTGTCGCCG GGGCCCTTCA GGACCAGCAA
CGCGCCGTGA TCCTCGGCGA ACGGTCCTTT GGCAAGGGCT CCGTCCAAAC GGTCATCCCC
CTGACAGACG GCTCGGGCAT CAAATTGACC ACGGCGCTGT ACTACACCCC GAATGGCCGT
TCGATTCAGG CCGAAGGGAT CATTCCAGAC ATCGTGGTTC CCTTTTCTCC GCCACAGGAG
ACCAACGGCA CCGCCCGCCC GATTCTCCGG GAACAGGATC TCACCAGGCA CTTGGAACAA
ACCGAGAATC CGGAACACGA CTCGGTCAAA CGCAGCGACA AGGCCAAAGA GATGCTCGAA
AAAGACAATC AACTCCGAAT GGCGCTGCAA CTCGTCAAGA GCCTGCCCCG CCTCCGGAAA
ATCCAATAA
 
Protein sequence
MRLSHWLGIV ALLALLTATP GPGQATDANR FESLKRFSQV MDLIEKSYVR DIDREELITG 
ALKGMLSELD PHSAYMSPDS FQEMQVETSG EFNGIGIQIS MENGRLTVVS PIEDTPAYEA
GFEAGDIIME INGESTQDIT LMEAVKKIRG PKGSTVDLKV LHPEAQKPET ITVKRDTIPL
ESVKSEPLGG GYLYLRVTNF QEKTTEDLQK ALHKRDGRLA GAILDLRNNP GGLLPQAVSV
ADTFLKEGKI VYTEGKVKNA KMEFSAQKQQ SDIDVPLIVL INPGSASASE IVAGALQDQQ
RAVILGERSF GKGSVQTVIP LTDGSGIKLT TALYYTPNGR SIQAEGIIPD IVVPFSPPQE
TNGTARPILR EQDLTRHLEQ TENPEHDSVK RSDKAKEMLE KDNQLRMALQ LVKSLPRLRK
IQ