Gene Dret_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0398 
Symbol 
ID8418203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp486104 
End bp487504 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content48% 
IMG OID645036960 
Producthypothetical protein 
Protein accessionYP_003197274 
Protein GI258404532 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value7.7648e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.524896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACTG CTCTAAATCA GAGTGATATT TATAAAGAAC GTGCTCATCG GAAGTACTTT 
AGCGATATGA AAGAATTATT TGACAACGTA TACGATATAA TTGACGTACC CGGGCTGTCA
CTTTTTGAAA ATATTGAAAA ACTCTGTGAG CATAAGCTAT ACAATCTCGA TCCAAAAATG
CTGTCTGCGA TCATTGAGAC CAAGCTAGAG CATTACTTTT CTGGCTTCGA ATGCCCTCTA
TGTGGTTGCA AGTGTTCTCA TAATAAAAAA TCAAAGAGAC ATATCGAAAC CACTTACGGA
ACCGTATCAT TTGATTCACC TTATTATAAA TGCAAAAACT GCGATAAATT TTATGAACCG
TACGTCTCTG CGTTGAACCT TCGAAAGGGA AAGTACCAAT ACGACGTCCA AAAAATTGTC
GCCAAAGTGG CCTCGTCGGT GCCTTTCGAT GAAACAGCGG AAATCTTGTA CGATACGCAT
GGCCTTTCCC TCTCCCAGGC TACAGTTCAT GAGTTGACCA ATGAACTAGC CGCCCATGCT
CGACTTGAAG AGATCAGCCC AGAAGCAGCA ACTATCCATG AGATTATCGA ACAGGCTACC
CGCCCCAATA AAGCCAGGCC GGTGCTTGTT TTCGCCGCAG ATGGAGCCAT GTCCCCAACT
CGAACAGAAA AAGGCAAACC CAATGCGTGG AAAGAGGCCA AAGGTATACG GGTCTACTTG
CTTGATGGGC AGCATTTAGC CCAAGTGCTG AGTTGGCATC AGATATGCGA CAAACACCGC
TTCAAATCCT TTTTGCAGCA GATTAAACAG CAAAACCTTT TCCCGGAAGA CAAAGTTCGG
ATCTGCTGTA TAGGCGACGG AGCCGGGTGG ATATGGGAGG CGATGGAAGA GGCTTTTCCA
GACGCACGGC AAATTTTGGA TTATTACCAT TGCCAAGAAC ATATCCACGA ATTTGCCAAC
GCCCGCTTTG CTGACAAGGC AGCCCGGGAC AAGTGGCTGC GTGAAACGAC AAATCGGCTC
TTTCAAAATA AACTGAGCAG TGTTCTCGCT GGATTGAGGC GAATGAAGCT CACCGGCGAG
GCAGCAGAGA AGCGAGACGC TCTGTGCTCG TATTTGTCAA AGAACAAGGA CCGCATGGAT
TACGGGAAAG CCAAGCGGGG CGGCTATCCT CTGGGCAGCG GGGCCATAGA AAGCGCTAAC
AAATTTATAA GCCACATACG GCTCAAACGC TCAGGAGCCT GGTGGAAAGT CGACCTGGCT
AACAATATCC TGGCCTTGCG CTGTTCGAGG TACAACAAAT CCTTCGATCG CTTCTTTGCC
GCCTATGAAA AAAGCCAGCG GCAGGACTTA GGGTCGGCTC AGCCCCACTT GTCGCTTTGT
CAAGGGGGTC GTCGCAAGTA G
 
Protein sequence
MQTALNQSDI YKERAHRKYF SDMKELFDNV YDIIDVPGLS LFENIEKLCE HKLYNLDPKM 
LSAIIETKLE HYFSGFECPL CGCKCSHNKK SKRHIETTYG TVSFDSPYYK CKNCDKFYEP
YVSALNLRKG KYQYDVQKIV AKVASSVPFD ETAEILYDTH GLSLSQATVH ELTNELAAHA
RLEEISPEAA TIHEIIEQAT RPNKARPVLV FAADGAMSPT RTEKGKPNAW KEAKGIRVYL
LDGQHLAQVL SWHQICDKHR FKSFLQQIKQ QNLFPEDKVR ICCIGDGAGW IWEAMEEAFP
DARQILDYYH CQEHIHEFAN ARFADKAARD KWLRETTNRL FQNKLSSVLA GLRRMKLTGE
AAEKRDALCS YLSKNKDRMD YGKAKRGGYP LGSGAIESAN KFISHIRLKR SGAWWKVDLA
NNILALRCSR YNKSFDRFFA AYEKSQRQDL GSAQPHLSLC QGGRRK