Gene Dret_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0430 
Symbol 
ID8418235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp528748 
End bp529905 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID645036991 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003197305 
Protein GI258404563 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.138711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.304103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTC GCCGCTCCCC AATGTACCTC TTTTTGCTCG TCCTGACCGT GTCCGTGTGG 
GCTGGGTTTC AGGGCTGGCG GACGCTTTTG AACAATTTCG CCGTTGAGGT CGCCCACCTC
GGCGGGCACC ACATGGGCGT CATCCAATCG GTCCGCGAGG TCCCGGGGTT TCTCGCTCTG
CTGGTCATTT ATATCCTGCT GATCGTCAAA GAACACCGCT TGGCCGCGGT TTCCGTACTC
ATCCTCGGCC TCGGCGTGGT CCTGACCGGA TTTTTTCCCT CGTTCTGGGG CGTTCTGCTG
GCCACTTTGC TCATGTCCTT CGGCTTCCAC TATTTTGAAA CCGTCAACCA GTCCCTGACA
CTGCAATACT TCTCCGTGGG CGACGCCCCG CTCGTTTTCG GTCGATTGCG CGCAATCGGC
GCTGCGACCA GTATCGGCGT CGGCCTCTCC ATCTTCGCCC TGGCCAACTG GCTGCCCTAT
AAGCTCCTTT TTGCCCTGCT GGGCTGCATC AGCATTGCCG GCGCCATGTG GTGCCTGTTC
CAGGACCCCA CGGACACCAA TATGCCGTCG CAGAACAAGC ATATGGTCCT GCGGCGGCGG
TACTGGCTCT TTTACACCCT GACCCTGCTC GCCGGGGCCC GGCGGCAGAT CTTCATCGCC
TTCGCCGTAT TCTTGCTCGT GGAGAAATTC GGACTCAGCG TCCAGGAGAT CACCTTGTTG
TTCGTGGCCA ACCAGGCCCT GAACTACTTT GTCAGCCCCC TGGTCGGACG GGCCATCAAC
CATTTTGGCG AACGCTCGGT CTTGAGCGTG GAATACGCCT CGCTCATCGT CGTCTTCCTG
GTTTACGCCC TCAGCGATTC CCAATGGCTG GTCCTGGCCA TGTATATCGT GGACCACGTG
GTTTTCAATT GCGCCATGGC CATCCGGACC TTTTTCCAGA AAATCGGGGA TCCCGGTGAC
ATCGCCCCGA GCATGGCCGT CGGCTTTACC ATCAACCATA TCGCGGCGGT GCTCATTCCG
GCCGCAGCCG GCCTGATCTG GCTCGTCGAC CCCGCCTGGG TTTTTCTCGG TGGCGTGGGG
TTGAGCCTGT GCTCGCTGCT CCTGGTCCAG GCCATCCCCT GGCAGCTCAA AAGAAGCCGC
ACCGCTTCAT CCGGTTAG
 
Protein sequence
MSFRRSPMYL FLLVLTVSVW AGFQGWRTLL NNFAVEVAHL GGHHMGVIQS VREVPGFLAL 
LVIYILLIVK EHRLAAVSVL ILGLGVVLTG FFPSFWGVLL ATLLMSFGFH YFETVNQSLT
LQYFSVGDAP LVFGRLRAIG AATSIGVGLS IFALANWLPY KLLFALLGCI SIAGAMWCLF
QDPTDTNMPS QNKHMVLRRR YWLFYTLTLL AGARRQIFIA FAVFLLVEKF GLSVQEITLL
FVANQALNYF VSPLVGRAIN HFGERSVLSV EYASLIVVFL VYALSDSQWL VLAMYIVDHV
VFNCAMAIRT FFQKIGDPGD IAPSMAVGFT INHIAAVLIP AAAGLIWLVD PAWVFLGGVG
LSLCSLLLVQ AIPWQLKRSR TASSG