Gene Dret_2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2401 
Symbol 
ID8420261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2745972 
End bp2747219 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID645039002 
ProductSte24 endopeptidase 
Protein accessionYP_003199261 
Protein GI258406519 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCT GGCTTTTGGG CCTCATGGGC CTGCTCCTTC TCCATTATTG TATCCATTGC 
GGTGTGGAAT GGCTCAACCT GCGTTCTCTG TCCACAGCCG TGCCAGTGGC GGTCCAAGAC
ACCATCGACG CGTCGACGTA TGCCAGATCG CAGGCCTATA CCACCTCCCG CACCCGTTTG
GACATCACAA CCGCCAGCCT GGACCTCGGA GTGTGGCTGC TCCTTTTGGG CAGCGGCGTG
CTCGGGGATC TCGACGCCTG GATCGGTGCC GCCGGCTTGG GAGAGACAGT CTCCGGGCTG
GTCTTTTTCG CCGCCCTGGG CCTGGGATTG TATCTCGTCC ATCTCCCTGT ACACATCTAC
GCCACCTTTC GCATCGAGCA GCGCTACGGT TTCAACACCA CCACCGCCGG GGTGTTCTGG
GCCGATCAGC TCAAAACCCT TGTCCTGACC GCACTCCTCG CAGGGGTCCT GCTCAGCACC
GTCCTGCTCT TTTTCCAGGC CTTTCCCCGT ACAGGCTGGC TCTGGGCTTG GCTGAGCATC
AGCCTGGTGG TGCTTCTGCT TCAGGTCGTG ACGCCACGCT GGATCCTGCC CCTGTTCAAC
CGGTTCACCC CCCTTGAGGA GGGTCCCTTG CGGCAGCAGT TGACCGACTT GGCCCATGCG
GCAGGCTTCC GCCTGGCCTC CATTGCGGTC ATGGACGGCT CCAAACGCTC GACCAAGGCC
AACGCCTTTT TCGCCGGCCT GGGCAAAACG AAGCGTATCG CGCTGTTCGA CACCCTGGTG
CAGACCTTGA CTCCCCGGGA AGTCGCCGCG GTGCTGGCCC ATGAAATCGG GCACAATGTC
TTGGGGCATA TTCCCCGCCT GATCGGTGGT ACGGTACTCA AAATCGGTCT TTTTCTCGCT
TTGTTCGCTT TGCTCAAGGA CCACCAGGGA TTGATCCAGG GCGCCGGTTT CGAAGAGGCC
AGCCTCCACG CCGGGTTGAC CGTCTTTTTC CTTGTACTGA CCCCGGTGGG ACTCCTGCTT
GGAGCCTGGC ACAATACCCG TGCCCGGCGC TACGAATTCG AGGCCGACCG CTATGCGGCC
CGGTTGACCG AGGCGCCCCA GGACCTTATC TCGGCCTTGA AGCGGTTGGC CGCCCACAAT
ATGGCCAACC TCACCCCCCA TCCCTGGCAT GTGGCCTTGT ACGCTTCCCA CCCTCCGCTG
CTCAAACGCC TCGAAGCCCT GGACCAAGAG GCCGGCGACC ATGTCTGA
 
Protein sequence
MSFWLLGLMG LLLLHYCIHC GVEWLNLRSL STAVPVAVQD TIDASTYARS QAYTTSRTRL 
DITTASLDLG VWLLLLGSGV LGDLDAWIGA AGLGETVSGL VFFAALGLGL YLVHLPVHIY
ATFRIEQRYG FNTTTAGVFW ADQLKTLVLT ALLAGVLLST VLLFFQAFPR TGWLWAWLSI
SLVVLLLQVV TPRWILPLFN RFTPLEEGPL RQQLTDLAHA AGFRLASIAV MDGSKRSTKA
NAFFAGLGKT KRIALFDTLV QTLTPREVAA VLAHEIGHNV LGHIPRLIGG TVLKIGLFLA
LFALLKDHQG LIQGAGFEEA SLHAGLTVFF LVLTPVGLLL GAWHNTRARR YEFEADRYAA
RLTEAPQDLI SALKRLAAHN MANLTPHPWH VALYASHPPL LKRLEALDQE AGDHV