Gene Dret_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1778 
Symbol 
ID8419619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2048528 
End bp2049874 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content59% 
IMG OID645038362 
ProductPeptidase M23 
Protein accessionYP_003198640 
Protein GI258405898 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000277073 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGGC GAAAAGGACA AGGATCCTCA CGTAAAGGCA CGTTTCTCGT TTTTCTGCTT 
CTGCTTCTCC TGGCAGGCGG AGGAACGCTG TATTTCTTGA ATGCGGAAGG CACTCCCCCG
CAAATCACCC TGACACCGCA AACCTCGTAT ATCGGGCAGG ATGCCCAACT CGAAGTCGCG
TTGCAGGATC CGGACAGTGG GTTACGGCAG GTGACCGTCA CGGCTGTCCA GAATGGGACC
CGGATTCAGG TCCTGCCCCC GACGGACATT CGAGGCGGAC AATGGCAGCA AGCCTTCTCT
TTGAAGAATC TCGGGTTGCG CGAGGCCCCG TTCGAGTTGC AGGTCGTGGC CACGGATCGC
TCCTGGCGCC GGCTGGGCAA GGGGAATATC GCCCAAGTGC AGCACCAATT TACCCTGGAC
ACGACCATGC CCTCGGTCGG GGTGCAGTCT GTGCACCATA ACCTGAACCA GGGTGGCTCG
GGGATGGTTA CCTTTTCGGC CAGCGAACCC CTCGCTCGGG CTGGGGTGCA GATCGGGGAG
AGCTTTTTCC CGGCCTACCA GACCGAAAAC GACGACTGGA TCTGTCTGTT TGCCTTCCCC
TATTTCGCCA CACCAGGAGA GGACCACCCC GTCCTTGTGG CTGAAGACCG GGCCGCGAAC
AAATTCAGGA CCGGGTTTAC GTATCACGTC AATGCCCGCA ATTTCCCCCA GGACCGCATT
CGGGTTTCCG ATGCCTTTTT GCAGGCCAAG ATGCCGCAAT TTCAGAATGA TTTTCCGGAG
GCCGGGACCT TGCTTCAGGT TTTTCTGCAG GTCAACGGAC CCATGCGGGA TCAAAACCGG
TCCTGGTTGC GGGAGGTCGG TCAAAAAACG GTGGCCAAAC GGTTGTGGGA GGGCAAGTTC
CTGCGCCTGC CGAACGCCGC CCGCAGGGCA TCGTTCGGCG ACCAACGGAC GTATGTCCAT
GGTGGAGAGG CCATCGACCG AGCGACCCAT CTGGGCGTGG ATCTGGCTTC AGTGGCGCGA
GCCGAAGTCC CGGCCGCGAA TTCAGGTCGG GTGGTCTTCA GCGATTTTCT GGGAATCTAT
GGCAATGTGG TGGTCATCGA TCACGGTTTC GGTTTGCAAA GCCTGTACGC CCACTTGAGC
AAGAGTATGG TCCAGGAAGG TCAAGAGGTG ACCAAGGAGC AGATTATCGG CAAGACCGGC
GCCACGGGGC TGGCCGGCGG CGACCACCTC CATTTTGCTA TGCTGGTCTC CGGGCAGCCG
GTCAATCCGG TGGAATGGTG GGACACCAAC TGGATCGCGA ACAATATTCT CTCCAAGTGG
GAATATCTGC AAGAAAACGC TTCGTAG
 
Protein sequence
MARRKGQGSS RKGTFLVFLL LLLLAGGGTL YFLNAEGTPP QITLTPQTSY IGQDAQLEVA 
LQDPDSGLRQ VTVTAVQNGT RIQVLPPTDI RGGQWQQAFS LKNLGLREAP FELQVVATDR
SWRRLGKGNI AQVQHQFTLD TTMPSVGVQS VHHNLNQGGS GMVTFSASEP LARAGVQIGE
SFFPAYQTEN DDWICLFAFP YFATPGEDHP VLVAEDRAAN KFRTGFTYHV NARNFPQDRI
RVSDAFLQAK MPQFQNDFPE AGTLLQVFLQ VNGPMRDQNR SWLREVGQKT VAKRLWEGKF
LRLPNAARRA SFGDQRTYVH GGEAIDRATH LGVDLASVAR AEVPAANSGR VVFSDFLGIY
GNVVVIDHGF GLQSLYAHLS KSMVQEGQEV TKEQIIGKTG ATGLAGGDHL HFAMLVSGQP
VNPVEWWDTN WIANNILSKW EYLQENAS