Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0678 |
Symbol | |
ID | 8418491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 808792 |
End bp | 810381 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645037242 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003197548 |
Protein GI | 258404806 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.828772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTGT ATAAAGGGGC AGAGATCAAG GTCGGCATTT TTGTGTGCAT CGCGCTGGTG GCCCTCGGCT ATATGTCCAT GCAGGTCGGA CAGGGCCTTT TCGTCGACAA GGACACCAAA AAAGTCAGTG TTTTTTTTGA CAATGTTTCC GGCCTCAAGC AGGGAGCTCC AGTGGAAATC GCCGGGATTG AAGTCGGGCA GGTCCATTCC ATCGGTCTGG CTGACGGCCG CGCCGAACTG ACGCTGGCAC TGGACAAGGG CATTGCTGTT CCGGTGGACG TCAAGGCGGT CATCCGAACG CGGGGCGTAC TCGGGGATAA ATTCGTCGAA TTGCGCGGCG GTTCGCCGGA GATGCCGAAT CTGGAGGAAG GGCAACGGAT CACCCGTTCG TCGTCGCCCG CGGATCTGGA CCAGCTGCTG CAGAAAGTGG GACAGATCGC TGAGGATATC AGCACCGTCA GTCAATCGGT GTCCAATGTG TTCGGCGGTG AGCAGGGCGA GGCCCAATTG CAGACCATCA TGGACAACGT CCGGGAATTG ACGGTGACCC TCAATGGTTT GGTCCAGCAG AATGCTGAAA GCGTCGAGCG CATTGTGGCC AACCTGGACG GGTTTACCGC GGATATGCGG GAAATGACCG CCCGGAACAA GGACCAGATC AACGATGCTG TTTCCGACCT GGGGGCGTTT GCCGCGAATA TGCGGGAGAT TACCGGAGAG AATAAAGACG GCATCCAGCG TATTGTGACC AACCTCGACC GGGCCTCCGG CAAATTGCAG ACCACGATGC AGGCCATGCA GGATGTGCTG GGCAAGGTGG ACGAGGGGCA AGGCACCTTG GGCACGATGG TCAACGACGA AGAGATGGCC TACGACCTGA AAAAGACAAT GGCCTCCCTG GAATCGGTTT CGCGCAAAAT CGACGAAGGA CGCGGGACCC TGGGCAAGCT GGTCAACGAC GACACCACCG GCAAAGAACT GGACAAGGCC CTGGAAGGGG TGAACACCTT TTTGGCCAAG CAGGAGCAGT TCAAGACTTC CGTGGACTTC ACCTCCGAAT ACCTCGCCGA CAGCGGAGAT ATGAAATCCT ATCTCAATCT CAAGCTCCAG CCCACCGAGG ACAAGTATTA CCTGCTCTCC CTGGTCGACG ATCCCAAGGG GCGCACGGAG ACGACCTCTT TGGTCAGGCG GTATAAGAAA GATGGTGAAC CCTGGAAGAC CTATCGGGAA GAGGAGGAGG AAACCCAGGA GGACGGTCTG AAATTTTCGG CCCAGATGGC CAAGCGGTGG AACGATTGGG TCCTGCGCGG CGGGATCATC GAGTCGTCCG GTGGCCTCGG AGTCGACTAC TACCTCTGGG ACGACCGGAT CCGGCTCTTT GCCGAGGCGT TCGACTTCGA CGACGAGAAT CCGCCGCACC TCAAAGCCGG GGGCAAGCTT TACTTCCTGA ACAATTTCTA CATCAGTGCC GGCATGGACG ATTTTGCCAG TGACACGGGC GATGAGTCCT TTTTTACCGG CCTTGGACTG CGATTTTCCG ACGACGATTT GAAATATATC CTGGGCAAGA CGCCGCTGCC GCAGCAATAG
|
Protein sequence | MSVYKGAEIK VGIFVCIALV ALGYMSMQVG QGLFVDKDTK KVSVFFDNVS GLKQGAPVEI AGIEVGQVHS IGLADGRAEL TLALDKGIAV PVDVKAVIRT RGVLGDKFVE LRGGSPEMPN LEEGQRITRS SSPADLDQLL QKVGQIAEDI STVSQSVSNV FGGEQGEAQL QTIMDNVREL TVTLNGLVQQ NAESVERIVA NLDGFTADMR EMTARNKDQI NDAVSDLGAF AANMREITGE NKDGIQRIVT NLDRASGKLQ TTMQAMQDVL GKVDEGQGTL GTMVNDEEMA YDLKKTMASL ESVSRKIDEG RGTLGKLVND DTTGKELDKA LEGVNTFLAK QEQFKTSVDF TSEYLADSGD MKSYLNLKLQ PTEDKYYLLS LVDDPKGRTE TTSLVRRYKK DGEPWKTYRE EEEETQEDGL KFSAQMAKRW NDWVLRGGII ESSGGLGVDY YLWDDRIRLF AEAFDFDDEN PPHLKAGGKL YFLNNFYISA GMDDFASDTG DESFFTGLGL RFSDDDLKYI LGKTPLPQQ
|
| |