Gene Dret_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0736 
Symbol 
ID8418550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp868353 
End bp869594 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content56% 
IMG OID645037301 
Producthypothetical protein 
Protein accessionYP_003197606 
Protein GI258404864 
COG category[R] General function prediction only 
COG ID[COG1896] Predicted hydrolases of HD superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000863321 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTA TTCGCAAGGG GCTATTGCAG CTTGTTTTTT CCGGATCCTA TATGAAACGC 
TGGAACGACA AGCTCCGGCC CATGGAATTG TGGGAGGTCG ACAAGCAGGC CCACAAAATG
ATCGTGGCCT GGCTGCTTTT TCTCTGCAAC ACCCGCTCCA TGTCTGAGGC CCAGCGAACT
GAGGTGGGCA ATGGAATTAT CGAAGGGGGG CTCTTTGAGT ATTTTTACCG CTTGGTGATC
ACCGATATCA AACCCCCGGT CTTTTACCAG ATCAAAGCCA ACCCGGAACA CTACGAGCAA
TTGACCAAGT GGGTCCTGTC CCAACTCCAT CCCCGGGTTC GGCCCCTGGG CGGGGCATTT
TGGGAACGCC TGGAAGCGTA TTTTCTTTTG CCTTCCGAAC ACACCCTGGC CGAGGACATC
CTCGAGGCTG CCCATATGTG GGCCAGTTCC TGGGAGTTCC AACTCATCAA GGGGGTCAAT
CCATGGGACG ACGAACTCGA GGCCATCGAA GCCAATTTCG CGGAGAAACT GGAAGCCAAA
TCCCATCTCC ACGGGGTCTC GGAGATTACG GCCGGTCCCC ACTCCGCGCT GGGGCGGTTG
GCCCACCTCT GTGGGCAATT GCGTTTTCAA AAACGGTGGT CGCAGATTCC GCGCATCCCT
GAAACATCGG TCCTCGGGCA CATGTTTATC GTGGCCAGTT ATGCGTTTTG CATGAATCTC
GTGTTGGAGA CAGGACAACG GCGGCGGATG AATACGTTCT TTTCCGGTTT GTTTCACGAT
TTGCCCGAAT TGCTGACCCG GGACATCATT TCCCCGGTCA AACGGTCGGT GCAGCCGATC
GGGGAGATGA TCAAGGAGTA CGAAGAGCAG GAATTGACTC GGCGGGTCCT GTCTCCGCTT
CAAAGTGGCG GGCACAGCGA TGTTGCCCAA ACCTTGTCCT ATTATCTGGG GTTGGAGGCC
GGTTCGGAGT TCGCGGATAC GGTGCGCGAA AACGGGGTGG TGCGCAGGGT GGAGTGGGAC
AATTTCGCCA GCCAGTGGGA TCGTGACGAA CTCGACCCCA AGGACGGCCG GGTGGTGAAG
GTCTGCGACC ATCTGGCCGC GTTTATCGAG GCCTATACCT CCACCCGCAA CGGTATCAAC
ACCGACCAAT TACAACAGGC GTTGTGGCGT TTGCGCAGTC AATACAGCCA GGTTTCCCTG
GGCGAACTCC ATATCGGAGC CCTGCTGGCG GATTTCGATT GA
 
Protein sequence
MTSIRKGLLQ LVFSGSYMKR WNDKLRPMEL WEVDKQAHKM IVAWLLFLCN TRSMSEAQRT 
EVGNGIIEGG LFEYFYRLVI TDIKPPVFYQ IKANPEHYEQ LTKWVLSQLH PRVRPLGGAF
WERLEAYFLL PSEHTLAEDI LEAAHMWASS WEFQLIKGVN PWDDELEAIE ANFAEKLEAK
SHLHGVSEIT AGPHSALGRL AHLCGQLRFQ KRWSQIPRIP ETSVLGHMFI VASYAFCMNL
VLETGQRRRM NTFFSGLFHD LPELLTRDII SPVKRSVQPI GEMIKEYEEQ ELTRRVLSPL
QSGGHSDVAQ TLSYYLGLEA GSEFADTVRE NGVVRRVEWD NFASQWDRDE LDPKDGRVVK
VCDHLAAFIE AYTSTRNGIN TDQLQQALWR LRSQYSQVSL GELHIGALLA DFD