Gene Dret_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0725 
Symbol 
ID8418538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp857004 
End bp858065 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID645037289 
Productpeptidase M24 
Protein accessionYP_003197595 
Protein GI258404853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00261455 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.742968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCC GACACCGCCG CGCCGCTCTG CGCAGCGTCC TTTCTGATCA GGGCCTCGAC 
GCCTTGATCG TCTCCCATGC CGCCAACCGG TACTATCTGA GCCATTTTGA ACTCCACGAC
CCGCAGTGTA ACGAAAGCGC GGGCTGGATC GTGGTTACCG CCCAGGGCCG GGACTGGTTG
CTGACCGATC CCAGGTATAC CGAGGCCGCC AAACAGGTCT GGCCCGCTGA GGACCTCTTC
GTCTATACCG GAAAACGCAA CACCTCCGTC AGTGGCATGC TACGCGATTT GGGGCTGGAG
ACCATCGGCT TCGAGGCCCG TAGCCTGGAC GTGGAAACTT TTCAGGAACT CAGCCGCGAC
CTCAACCTCC GGCCAACCAC CAACTTGGTC GAAAATCTCC GTTTGAGCAA AGACACCCAG
GAAATTGAAT GCCTGCGGCA ATCCTGCAAA CTCAATCACT TTGTTTTCGA ATCTGTGGAG
GCAATCCTCC AACCCGGGCG AACCGAGGCC TGGCTGTCCT GGCAGATCGA AAAACTCTTC
CGGGAAAACG GCGCCACCGA GTTGGCCTTT GCCACCATTG CCGCAGTGGG GCCAAACGCG
GCCCTGCCCC ATGCCATCCC CGGTGAGACT CCCATTACGG AACAATGCCC GGTGCTGATC
GATACCGGCG GACGCAAAAT GCAGTATTGT TCAGATCAGA CCCGCACATT TTGGGTCGGT
CAGACTCCCT CGCAACAATT CCTGCAGACC CGCGAACGGG TCCAGGAGGC CCAGCGCAAG
GCCATCGAGG CCATTGCCCC GGGAATGCCG GTCAAAGACC TGTATACAGT GGCCAAGGAA
ACCTTCCGCG CCCACGGCCA GGAGGACTAT TTCACCCACG CCCTGGGACA CGGCATCGGT
CTGGAAACCC ATGAAGCCCC GAGTCTGAGT CCCTACAGCG AACACCTTCT GCAACCCGGC
ATGGTCATCA CCATTGAACC CGGACTGTAT TACCGGGAAT GGGGCGGCGT ACGCTGGGAA
CACATGGTTC TGGTGACGGA CAATGGCGCC GAAGTCCTCT GA
 
Protein sequence
MPTRHRRAAL RSVLSDQGLD ALIVSHAANR YYLSHFELHD PQCNESAGWI VVTAQGRDWL 
LTDPRYTEAA KQVWPAEDLF VYTGKRNTSV SGMLRDLGLE TIGFEARSLD VETFQELSRD
LNLRPTTNLV ENLRLSKDTQ EIECLRQSCK LNHFVFESVE AILQPGRTEA WLSWQIEKLF
RENGATELAF ATIAAVGPNA ALPHAIPGET PITEQCPVLI DTGGRKMQYC SDQTRTFWVG
QTPSQQFLQT RERVQEAQRK AIEAIAPGMP VKDLYTVAKE TFRAHGQEDY FTHALGHGIG
LETHEAPSLS PYSEHLLQPG MVITIEPGLY YREWGGVRWE HMVLVTDNGA EVL