Gene Dret_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1404 
Symbol 
ID8419233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1637872 
End bp1638903 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content58% 
IMG OID645037979 
Productmetal dependent phosphohydrolase 
Protein accessionYP_003198269 
Protein GI258405527 
COG category[R] General function prediction only 
COG ID[COG3481] Predicted HD-superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.248325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAA AGAACCTGTT TGTTTCCGAT CTGCAACAGG GCTCGGCCAT TGAAGATCTG 
TTCCTTATTG CCGAAGCCCG AAGCGCGGAA ACACGCAACG GCCAGCCCTA TTGGGATCTC
GTCCTCCAGG ACGCAACCGG GAAGGTGTCG GCCAAGATAT GGGCGCCGTT GAGCCAACAG
GCCAACGGCC TGGCTCCAGG GCAATTTCTT CATGTCCAGG CCAAAGTCGA ACTCTTTCGC
GAAAAATTCC AGCTCAATAT CACTCGGTTT GAGGAAATCG ATCCTGAAGG GGAGACATTG
GACTGGTCGG CATTTGTGCC CCGGACAGCA GAAGCGCCGG AAACCATTTT AGAGGATCTG
GAACAACTCT GCAGAGACGA ATTGCAGCAC AAACCGTGGC GCGCCCTCTG CCGCTCCGTG
CTCCGGGATC CTGAAATCCG GCAACGACTC CTGCAGGCCC CCGCGGCCAA ATCGGTCCAC
CACGCCTATC GCGGTGGCCT TCTGGAACAC ACCCGACAAG TCTGTCGCGT CTGTCTGCAA
TTCGCCGCGC TCTATCCGGA TCTGGACAAG GAATTGCTCT TTGTCGCGGC CTTGTTCCAT
GATTTCGGCA AGGCCTGGGA ACTTGAGGGG CTGGCGACAT GGGATTACAG TGATGCTGGC
CAACTCCTGG GGCATATCCA TCTTGGGCTC GAACGTCTCG AGCCGTTTCT GCGCCGCCAA
AAAGGCCTGG ATCCGGAATT GGCCCTGCAC CTCAAGCACG CCATCCTCAG CCACCACGGG
GAGTTGGAGT TCGGGTCGCC GAAACGACCG AAAACCCCGG AAGCCTTTGC CCTGCACTTT
GCAGACAACC TCGATTCCAA ATTGACAACA GCGTCCGCCG CGCTCAATGA CCTGGGCGAA
CATGATGGGG GGTGGACACC GAAAGTCTGG GCGCTCCAGC GGCAACTGTT CAAACGGACA
CCGACACCGG CTCCGATGGC GACACAACAC CGCCCCAGGG AGGATCAATG TTCATTACCT
TTGAAGGAAT AG
 
Protein sequence
MQSKNLFVSD LQQGSAIEDL FLIAEARSAE TRNGQPYWDL VLQDATGKVS AKIWAPLSQQ 
ANGLAPGQFL HVQAKVELFR EKFQLNITRF EEIDPEGETL DWSAFVPRTA EAPETILEDL
EQLCRDELQH KPWRALCRSV LRDPEIRQRL LQAPAAKSVH HAYRGGLLEH TRQVCRVCLQ
FAALYPDLDK ELLFVAALFH DFGKAWELEG LATWDYSDAG QLLGHIHLGL ERLEPFLRRQ
KGLDPELALH LKHAILSHHG ELEFGSPKRP KTPEAFALHF ADNLDSKLTT ASAALNDLGE
HDGGWTPKVW ALQRQLFKRT PTPAPMATQH RPREDQCSLP LKE