Gene Dret_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1148 
Symbol 
ID8418976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1344161 
End bp1345411 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID645037723 
Productmetallophosphoesterase 
Protein accessionYP_003198014 
Protein GI258405272 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.809261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTC GTTTTCTCCA CGCCGCCGAC CTCCATCTCG ATAGCCCGCT GCGGGGCCTG 
GAAACCTATC CCGACGCCCC GGTGGAGCAG ATTCGTAACG CCACCCGCCG GGCCCTGGAC
AATCTGGTCA CCCTGGCCCA GGAGCAGGAG GTCGCCTTTG TCCTTTTGGC CGGGGATATC
TTCGACCAGT CCTGGCGCGA CTTCCACACC GCCCTGTTTT TCGCCCAGTG CATGGGCCGG
CTTCGTGAGG CCGGGATCCC GGTCTATGGG GTCAGCGGCA ACCACGACGC GGCCAACCCC
ATCGGCAAGA CCTTGCGCCC GCCGGACAAT GTCCACTTTT TTTCCGCCAC CAAACCAGGG
TCGGTGACCC TGGAGCACTG CAACACAGTC ATCCACGGCC AGAGCTATTC CAGCCGCGAG
ACCAGCGAGG ACTTGGCCGC CGAGTATCCG CCGGCCGTCG CCGGGGCCTT GAATATCGGC
CTGCTGCACA CCAGTTTGAC CGGACGTCCC GGTCACGAAC CCTACGCCCC GACCCATCCG
GATATCCTGG GCAACAAGGG CTACGACTAC TGGGCCCTGG GCCATGTCCA CGAACGAGAG
GTTGTCACCC GTGATCCCTG GATCGTCTTT CCCGGCACCA TCCAGGGCCG GCACATCCGG
GAAACTGGTC CCAAAGGGTG CAGTCTGGTC GAGGTCGAGG ACGGTCGCAT CAGGGATGTC
GTGCATCAGG ACATTGATGT CCTGCGCTGG TTTCGCGGTA ACGTCGAGTG TGGTCCTTGT
ACCTGCGATG ACGATGTCCG CCACGCAGTC CGCGCAGAAC TGCAGGCCGC CCGGGACGCC
GGTGAGGGCC GCCCGGTCGC GGTCCGCCTG GAATGCACCG GCGCGACCCA AATGCACGCC
CAGCTCCACG ACAGGCAACG CCATTTCCAG GAGGAATGGC GGACACTGGC CGCTGAAATG
GGCGATCTGT GGATCGAACA GATTCGGCTG CACACCCGTC CACCAGAAGA TCAATCCCAG
GAGGTGGATC CGGAATCACC GCTGGGGGAA TTGATGCAGT GTATTGCAGC CCAGGAATTG
CCGGAAAGTT GCACCAATGA GTTGGAAGAT TTGATGAAAC AACTGCCCAA GGAGATCACC
GAGGGCGAGG AGGGTTTCAA TCTCAAGGAT CCACAGCAGT GGCAGCGGAT GCAGGACGAC
GTCCGTGAAC TCCTGCTTGG GCGTTTGCTG CGCCAGGGAG GTCCGCAATG A
 
Protein sequence
MTFRFLHAAD LHLDSPLRGL ETYPDAPVEQ IRNATRRALD NLVTLAQEQE VAFVLLAGDI 
FDQSWRDFHT ALFFAQCMGR LREAGIPVYG VSGNHDAANP IGKTLRPPDN VHFFSATKPG
SVTLEHCNTV IHGQSYSSRE TSEDLAAEYP PAVAGALNIG LLHTSLTGRP GHEPYAPTHP
DILGNKGYDY WALGHVHERE VVTRDPWIVF PGTIQGRHIR ETGPKGCSLV EVEDGRIRDV
VHQDIDVLRW FRGNVECGPC TCDDDVRHAV RAELQAARDA GEGRPVAVRL ECTGATQMHA
QLHDRQRHFQ EEWRTLAAEM GDLWIEQIRL HTRPPEDQSQ EVDPESPLGE LMQCIAAQEL
PESCTNELED LMKQLPKEIT EGEEGFNLKD PQQWQRMQDD VRELLLGRLL RQGGPQ