Gene Dret_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0954 
Symbol 
ID8418774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1128018 
End bp1129418 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID645037521 
Producthypothetical protein 
Protein accessionYP_003197820 
Protein GI258405078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.01555e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.72086e-08 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGTCTG CTCTAAATCA GAGCGATATT TATAAAGACC GTGCTCATCG GAAGTACTTT 
AGCGATATGA AAAAATTATT TGACTACGTA TACGATATAA TTGACGTACT CGGGATGTCA
CTGTTTGGAA ATATTGAAAA GCTCCGGGTG CATAAGCTAT ACAATATCGA TCCAAAGATG
CTTTCAGCGA TCATTGAGAC CAAGCTAGAG CATTACTTTT CTGGCTTCGA ATGCCCTCTA
TGTGGTTGCA AGTGTCGTCA TAATAACAAA TTAAAGAGAC ATATTGAAAC TACTTACGGA
ACCATATCAT TTGATTCACC TTATTATAAA TGCAAAAATT GCGATAAATT TTATGAACCG
TACGTCTCTG CGTTGAACAT TCGAAAGGGA AAGTACCAAT ACGACGTCCA AAAAATTGTC
GCCAAAGTGG CCTCGTCGGT GCCTTTCGAT GAAACAGCGG AAATCTTGTA CGATACGCAT
GGTCTTTCCC TCTCCCAGGC TACAGTTCAT GAGTTGACCA ATGAACTAGC CGCCCATGCT
CGACTTGAAG AGATCAGCCC AGAAGCAGCA ACTATCCATG AGATTATCGA ACAGGCTACC
CGCCCCAATA AAGCCAGGCC GGTGCTTGTT TTCGCCGCAG ATGGAGCCAT GTCCCCAACT
CGAACAGAAA AGGGCAAGCC CAATGCGTGG AAAGAGGCCA AAGGTATACG GGTCTACTTG
CTTGATGGGC AGCATTTAGC CCAAGTGCTG AGTTGGCATC AGATATGCGA CAAACACCGC
TTCAAATCTT TTTTGCAGCA GATTAAACAG CAAAACCTTT TCCCGGAAGA CAAAGTTCGG
ATCTGCTGTA TAGGCGACGG AGCCGGGTGG ATATGGGAGG CGATGGAAGA GGCTTTCCCA
GACGCGCGGC AAATTTTGGA TTATTACCAT TGCCAAGAAC ATATCCACGA ATTTGCCAAC
GCCCGCTTTG CTGACAAGGC AGCCCGGGAC AAGTGGCTGC GTGAAACGAC AAATCGGCTC
TTTCAAAATA AACTGAGCAG TGTTCTCGCT GGATTGAGGC GAATGAAGCT CACCGGCGAG
GCAGCAGAGA AGCGAGACGC TCTGTGCTCG TATTTGTCAA AGAACAAGGA CCGCATGGAT
TACGGGAAAG CCAAGCGGGG CGGCTATCCT CTGGGCAGCG GGGCCATAGA AAGCGCTAAC
AAATTTATAA GCCACATACG GCTCAAACGC TCAGGAGCCT GGTGGAAAGT CGACTTGGCT
AACAATATCC TGGCTTTGCG CTGTTCGAGG TACAACAAAT CCTTCGATCG CTTCTTTGCC
GCCTATGAAA AAAGCCAAAG GCAGGACTTA GGGCCGGCTC GACCCCACCT ATCGCTTTGT
CAAGGTGCCG GTCGCAAGTA G
 
Protein sequence
MQSALNQSDI YKDRAHRKYF SDMKKLFDYV YDIIDVLGMS LFGNIEKLRV HKLYNIDPKM 
LSAIIETKLE HYFSGFECPL CGCKCRHNNK LKRHIETTYG TISFDSPYYK CKNCDKFYEP
YVSALNIRKG KYQYDVQKIV AKVASSVPFD ETAEILYDTH GLSLSQATVH ELTNELAAHA
RLEEISPEAA TIHEIIEQAT RPNKARPVLV FAADGAMSPT RTEKGKPNAW KEAKGIRVYL
LDGQHLAQVL SWHQICDKHR FKSFLQQIKQ QNLFPEDKVR ICCIGDGAGW IWEAMEEAFP
DARQILDYYH CQEHIHEFAN ARFADKAARD KWLRETTNRL FQNKLSSVLA GLRRMKLTGE
AAEKRDALCS YLSKNKDRMD YGKAKRGGYP LGSGAIESAN KFISHIRLKR SGAWWKVDLA
NNILALRCSR YNKSFDRFFA AYEKSQRQDL GPARPHLSLC QGAGRK