Gene Dret_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0652 
Symbol 
ID8418464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp778212 
End bp780713 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content58% 
IMG OID645037215 
Producthypothetical protein 
Protein accessionYP_003197522 
Protein GI258404780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.326198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCA TTGTGAGTGG AGACTCTGAG TCAGCAAAGC GACCACAACG GGGTTCTTCC 
CGATGCCACT CCCGAGGCCG GGCACTGTGC GCATTAGCGC TCTGGCTTCT TAGCGTGCTT
ATTTTTATCG CTACGGCGCC GTCTGAGGTG CAGGCTCAGG CGGCACATAT CCTAGAACGT
ATGCAGCGAA TTCCTGGAGA CCGGGAGACG GTGCTGCAAT TCGAACTGGA TTCCAAGACG
CAATTTACGG TGAATGCCTT AGGTCAGCGA ATACTGGTCC GCTTTGAGCG CACCGCCCCT
GCCGAGGGGG TGAGTGGGGT CCAGGCTGGC GGGGATGTGG TGCAGAGTCG AGTGGCCCGT
ATCGGGCAAC AGACCGAGGT GACCGTCATT TTGCGACGAG CTCCGGAAGA TGTCGAAACC
CTGACATCAT CGGATCAAAG CCTGGTCCGT GTGGCGCTTC GTTGGTCAGA GCAAGGGCGC
TGGGTGCGGC CCGGGCTGAC CAATAATCCC GGTGATCGCC TGACGATCGT CCGGGAAGGG
GGGGCTGTCG GACGCTATCT TAGTTCAGAG TACAGCGGCC AGTGGGACAA TTTTTTTCAA
ACCTATTCAG CACCTGTTGC CCCCTGGGTG GATATGCAAC TGACACTGGC ACCTTACGCT
GTCTTGGCCG GAGGATTGAA TGTCGCTGAG ATTGATGCCC TTGCGCTGCG AGCAGCAAAG
CAGGGCAATT GGGAGGGGAC CGTCAGCGCG TTGACACCAC GTTACGCGAA TGCCACCTCC
CCTTCCCAGG ACACGTTGGC CGCCCGCGGC TTGTTGGCCA CAGGCGCCCC CCGTAATGCC
TTGGCTCGCC TAGAACGCCT GCCACGCCAC CTGCCTCCGA AGGGACCGTT GGATGTGCGA
GTGCACTACC TGTACTTGCA CTCTTTGATG GCACTTGATC GCCATTACCA GGCGTATACC
GCATACCACG ATATCACCTT GCCTGAGGGG ATCTCCCCAG ACATGCGCTT GTATTGGCGG
GTGTTGGGCG CGGAATTGGG GTTGAAGACC GGGCAGCCGC AAGAGGCCCT TGCTGAGCTG
GATCAACCGC CCCTGGCTGG AGTGGTCTCG GTTCGTTTAG TTGCCGCCCG TCGGGTGCAG
GCTCTTTACG ATCTTGGTCG CCGCGAAGAG GCGTGGCAAG TGAGCCGGAG AACCGAGCTG
CCGTTGGACT TTCTTCGTCG CGTGCCGGCG GCGTTAGAGC GTTACGCCGA TCTTCTTTAT
GGCCGGGGCA AATTTGCCGA GGCTATGCGC ATCTACAACG CTCTAGCTGA TTCATTGCTC
ACAACCGGGC CCAAGGCTAT GGCCTTATGG CGTATGGGCA TGAGTCTGCG CTACAGCGGG
CGTCCACAAC GGTCCAAGCA GGTCATGTCC TTAATTTTGG ACAAATGGCC GCACACGGCT
GCCGGCTACC GTGCCCGCGT CGTTCGCAAC GATATTGCCG TCATGGAGGA GATGGGCGAA
CTCAAACCCA GCCATGTCAC TGCTTACGAG AAGGTCATGA GCCAATCACC TCAGCGTAAG
GTCCGTGGCG AAGCAGCTTT TAAACGCATT CTGGTCATGC GGGCCATGGG CGAGAAGAAA
CGGGCTGTGC GTTGGTTATC TGATTTTCTT GTCGCATTCG GCGCTGGCAA CCTCCAGAAT
GAGGCTCGGG TGCTTATGGG GAGCATGCTC CCATCGGTGG TGCGGAGTTA CTTGGACCAG
GGCGATTATG TCCGAGGGTT GGCTCTTGTG GCCGAACACC GTGATATTCT CGTGCACACG
GATATGCCCC ACCCCTTTTT GGAAACAGTG GGGGACACCT TTGAAAGGCT CGGGTTATAC
GAACGCGCAG CTCGGGTTTT TTTGTACATG TTAGCCCAGA GCAATGATCC CGGGCGACGG
GAACTTTTAT TGCCTCGAAC AGTTCGGTTG TGGCGCCATG TTGGCGATCC TTTGCGCGCC
GCCGAGTATG CGGACATGTA TTTGCAAGAT TTCCCCAACG GGGACCAGGC CGGGGAAGTG
ATCGCTGAAA CTGCAGCGAC ATTTCTGGAC AATGACGAAC CAGAAAAAGC TTTGGACTGG
CTTTTGAGGC CGCAGCGTCC CTATAGTCGC CGCTTGGACG TTTTGACGGC CCGGGCGCTA
TACGCGTTGC AGGAATTCAA CCGCATGGGG CGCTATCTGG AACGGGCTAC ACTGGCGGAG
CATATGCTTT CACCGCGTAC GCGGTACATC TGGGCCGACG GGTGCTACCA GTTGGGGCAA
TACGAACGGG TCCTGCCGCT GTGGCGGACG TTGTTCGACG ATCCGATTTT TGGCACTCGG
GCACTCTACA AGGCCGCAGA TTCCTTGGCG CAGACCGGCA GGTATCGGGA CTCGGCTAAA
CTCTACACCC GTTTGGCCGA TGAGACAGAC AATCAGGTGT GGCGGAAAAT GGCCGAGGAA
AGTCGTGCCA TGGGGCAGGT CCGCGCCACA TTGGCGAACT AG
 
Protein sequence
MFIIVSGDSE SAKRPQRGSS RCHSRGRALC ALALWLLSVL IFIATAPSEV QAQAAHILER 
MQRIPGDRET VLQFELDSKT QFTVNALGQR ILVRFERTAP AEGVSGVQAG GDVVQSRVAR
IGQQTEVTVI LRRAPEDVET LTSSDQSLVR VALRWSEQGR WVRPGLTNNP GDRLTIVREG
GAVGRYLSSE YSGQWDNFFQ TYSAPVAPWV DMQLTLAPYA VLAGGLNVAE IDALALRAAK
QGNWEGTVSA LTPRYANATS PSQDTLAARG LLATGAPRNA LARLERLPRH LPPKGPLDVR
VHYLYLHSLM ALDRHYQAYT AYHDITLPEG ISPDMRLYWR VLGAELGLKT GQPQEALAEL
DQPPLAGVVS VRLVAARRVQ ALYDLGRREE AWQVSRRTEL PLDFLRRVPA ALERYADLLY
GRGKFAEAMR IYNALADSLL TTGPKAMALW RMGMSLRYSG RPQRSKQVMS LILDKWPHTA
AGYRARVVRN DIAVMEEMGE LKPSHVTAYE KVMSQSPQRK VRGEAAFKRI LVMRAMGEKK
RAVRWLSDFL VAFGAGNLQN EARVLMGSML PSVVRSYLDQ GDYVRGLALV AEHRDILVHT
DMPHPFLETV GDTFERLGLY ERAARVFLYM LAQSNDPGRR ELLLPRTVRL WRHVGDPLRA
AEYADMYLQD FPNGDQAGEV IAETAATFLD NDEPEKALDW LLRPQRPYSR RLDVLTARAL
YALQEFNRMG RYLERATLAE HMLSPRTRYI WADGCYQLGQ YERVLPLWRT LFDDPIFGTR
ALYKAADSLA QTGRYRDSAK LYTRLADETD NQVWRKMAEE SRAMGQVRAT LAN