Gene Dret_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0678 
Symbol 
ID8418491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp808792 
End bp810381 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content57% 
IMG OID645037242 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003197548 
Protein GI258404806 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.828772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGT ATAAAGGGGC AGAGATCAAG GTCGGCATTT TTGTGTGCAT CGCGCTGGTG 
GCCCTCGGCT ATATGTCCAT GCAGGTCGGA CAGGGCCTTT TCGTCGACAA GGACACCAAA
AAAGTCAGTG TTTTTTTTGA CAATGTTTCC GGCCTCAAGC AGGGAGCTCC AGTGGAAATC
GCCGGGATTG AAGTCGGGCA GGTCCATTCC ATCGGTCTGG CTGACGGCCG CGCCGAACTG
ACGCTGGCAC TGGACAAGGG CATTGCTGTT CCGGTGGACG TCAAGGCGGT CATCCGAACG
CGGGGCGTAC TCGGGGATAA ATTCGTCGAA TTGCGCGGCG GTTCGCCGGA GATGCCGAAT
CTGGAGGAAG GGCAACGGAT CACCCGTTCG TCGTCGCCCG CGGATCTGGA CCAGCTGCTG
CAGAAAGTGG GACAGATCGC TGAGGATATC AGCACCGTCA GTCAATCGGT GTCCAATGTG
TTCGGCGGTG AGCAGGGCGA GGCCCAATTG CAGACCATCA TGGACAACGT CCGGGAATTG
ACGGTGACCC TCAATGGTTT GGTCCAGCAG AATGCTGAAA GCGTCGAGCG CATTGTGGCC
AACCTGGACG GGTTTACCGC GGATATGCGG GAAATGACCG CCCGGAACAA GGACCAGATC
AACGATGCTG TTTCCGACCT GGGGGCGTTT GCCGCGAATA TGCGGGAGAT TACCGGAGAG
AATAAAGACG GCATCCAGCG TATTGTGACC AACCTCGACC GGGCCTCCGG CAAATTGCAG
ACCACGATGC AGGCCATGCA GGATGTGCTG GGCAAGGTGG ACGAGGGGCA AGGCACCTTG
GGCACGATGG TCAACGACGA AGAGATGGCC TACGACCTGA AAAAGACAAT GGCCTCCCTG
GAATCGGTTT CGCGCAAAAT CGACGAAGGA CGCGGGACCC TGGGCAAGCT GGTCAACGAC
GACACCACCG GCAAAGAACT GGACAAGGCC CTGGAAGGGG TGAACACCTT TTTGGCCAAG
CAGGAGCAGT TCAAGACTTC CGTGGACTTC ACCTCCGAAT ACCTCGCCGA CAGCGGAGAT
ATGAAATCCT ATCTCAATCT CAAGCTCCAG CCCACCGAGG ACAAGTATTA CCTGCTCTCC
CTGGTCGACG ATCCCAAGGG GCGCACGGAG ACGACCTCTT TGGTCAGGCG GTATAAGAAA
GATGGTGAAC CCTGGAAGAC CTATCGGGAA GAGGAGGAGG AAACCCAGGA GGACGGTCTG
AAATTTTCGG CCCAGATGGC CAAGCGGTGG AACGATTGGG TCCTGCGCGG CGGGATCATC
GAGTCGTCCG GTGGCCTCGG AGTCGACTAC TACCTCTGGG ACGACCGGAT CCGGCTCTTT
GCCGAGGCGT TCGACTTCGA CGACGAGAAT CCGCCGCACC TCAAAGCCGG GGGCAAGCTT
TACTTCCTGA ACAATTTCTA CATCAGTGCC GGCATGGACG ATTTTGCCAG TGACACGGGC
GATGAGTCCT TTTTTACCGG CCTTGGACTG CGATTTTCCG ACGACGATTT GAAATATATC
CTGGGCAAGA CGCCGCTGCC GCAGCAATAG
 
Protein sequence
MSVYKGAEIK VGIFVCIALV ALGYMSMQVG QGLFVDKDTK KVSVFFDNVS GLKQGAPVEI 
AGIEVGQVHS IGLADGRAEL TLALDKGIAV PVDVKAVIRT RGVLGDKFVE LRGGSPEMPN
LEEGQRITRS SSPADLDQLL QKVGQIAEDI STVSQSVSNV FGGEQGEAQL QTIMDNVREL
TVTLNGLVQQ NAESVERIVA NLDGFTADMR EMTARNKDQI NDAVSDLGAF AANMREITGE
NKDGIQRIVT NLDRASGKLQ TTMQAMQDVL GKVDEGQGTL GTMVNDEEMA YDLKKTMASL
ESVSRKIDEG RGTLGKLVND DTTGKELDKA LEGVNTFLAK QEQFKTSVDF TSEYLADSGD
MKSYLNLKLQ PTEDKYYLLS LVDDPKGRTE TTSLVRRYKK DGEPWKTYRE EEEETQEDGL
KFSAQMAKRW NDWVLRGGII ESSGGLGVDY YLWDDRIRLF AEAFDFDDEN PPHLKAGGKL
YFLNNFYISA GMDDFASDTG DESFFTGLGL RFSDDDLKYI LGKTPLPQQ