Gene Dret_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0172 
Symbol 
ID8417976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp218129 
End bp219526 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content58% 
IMG OID645036737 
ProductPeptidase M23 
Protein accessionYP_003197052 
Protein GI258404310 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0593266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTG CCCCGCATTC ACGACACAAG CTCCATTTTC AGCGTCCCCG GAAGACCTCT 
CCGCGGCGGG TCGGCATTGT CGCTGTCCTC GGTCTGATTT TGTTTGCCGC CAGCCTTATT
TCCCTGCCTG AAGACGAACC GCCCAAGCCG AACTTCCAGG ACACATCCTC GCCGGTACAG
GCGGGTCAGC CCCAGCAGAC ATCCAAGGCA CAGCAGACTG CCAGCAAACC AGTTGCCAAA
GAACCCTCCC TGGGCAGCCA AATGGCCGCT CCCCGGGCCG AAGCGGCCTT GCCGCCGTTG
GAAACGACCA AGGGGACCAT CCAGCCCGGC CAGACCGCGA CACAATTGCT GGCAAACTAT
TTCGAAGCAC CCACGATTTA CGCCTTGAGC CGCCAATGCG AGGAGGTCTA TCCCCTGACC
CGCCTCAAAG CCGGACAACC TTACACTATC GCCTCGCGCA ACGGCGAATT CAAACAGCTC
AGTTATGAGA TAAACGAAAC GGCGAAGCTG CTGATCGAAA AGCAGGACGG CGAATTTTGC
GTTGCCAAAC AACCGATCGC CTACGAAACC AGGAAAACCT TGGTCAGTGG AACTATACAC
TCCAGCCTGT ACACGGCAGT CTCTGAAGCC GGAGAAGAGC CGGAATTCGC GTTGTTGCTT
GCAGAAATCT TTGCCTGGGA TGTCGATTTC GTCCGGGATC TGCGCCAGGG AGACCATTTC
ACCGCCCTTG TGGAAAAACG GTACCGCAAA GGAAAGCACA GCGGATACGG CCGGATTCTG
GCCGCCAGTT TCACCAATAA AGGCAAAACG TTTCAGGCCT TCTGGTACGA GGATACCCAG
GGAGAAGGCT CCTATTTCGA TGCCCGCGGA CAATCGGTAC GCAAAGCCTT TTTGAAGGCA
CCGCTTTCCT TCACCCGCAT CTCCTCAGGG TATTCCAACA ACCGCTTGCA CCCCGTGCTC
AAAATCCGCC GCCCCCACCA CGGCATCGAT TACGCGGCCC GAACTGGCAC CCCTATCAAA
ACCGTAGGTG ACGGTGTTAT CATGACTCGC TCCTACGCCA AAGGCGCGGG CCGGTACGTC
AAGGTCCGCC ATCCCAACGG CTATGTCACG GTCTATAACC ATATGAGTCG CTTTGCCAGC
AATCTGCGGA CCGGCCAAAA AGTCCGGCAG GGGGAGGTCA TCGGTTATGT AGGCAGCACC
GGACTCTCCA CCGGCCCCCA TCTCGATTTT CGCATGAAAA AACACGGGAC GTATGTCAAT
CCGCTCAAGG TGGAGTCTCC CCCGTGCGAA CCGGTTCCCA GCGAAGAGAA AGAACGATTC
CAGGCCCACA TCCAACCGTT GCTCGCCCAA CTTCAGCAGG CGGAACAGCA ATACCTGGCC
ACGGCCGAGA CCCCGTAA
 
Protein sequence
MRIAPHSRHK LHFQRPRKTS PRRVGIVAVL GLILFAASLI SLPEDEPPKP NFQDTSSPVQ 
AGQPQQTSKA QQTASKPVAK EPSLGSQMAA PRAEAALPPL ETTKGTIQPG QTATQLLANY
FEAPTIYALS RQCEEVYPLT RLKAGQPYTI ASRNGEFKQL SYEINETAKL LIEKQDGEFC
VAKQPIAYET RKTLVSGTIH SSLYTAVSEA GEEPEFALLL AEIFAWDVDF VRDLRQGDHF
TALVEKRYRK GKHSGYGRIL AASFTNKGKT FQAFWYEDTQ GEGSYFDARG QSVRKAFLKA
PLSFTRISSG YSNNRLHPVL KIRRPHHGID YAARTGTPIK TVGDGVIMTR SYAKGAGRYV
KVRHPNGYVT VYNHMSRFAS NLRTGQKVRQ GEVIGYVGST GLSTGPHLDF RMKKHGTYVN
PLKVESPPCE PVPSEEKERF QAHIQPLLAQ LQQAEQQYLA TAETP