Gene Dret_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1838 
Symbol 
ID8419679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2108462 
End bp2109781 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content60% 
IMG OID645038422 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003198700 
Protein GI258405958 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.047376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAAGC ATATCAGAAT CTTCGCCAAC GATACAACAG ACCAGCAGCG TAACTGGTGG 
GCGTGGTGTT TGTACGATTG GGCCAACTCC GCTTTTGCCA CCGTCATCCT GGCAGCGGTT
TTGCCGGTCT ATTTCGTGAG CCTGGTCCCC TCTGGCGGCG CGCCGGTGCC CGGGATGGAC
CTCCGACTGC CGGCGACAGT GCTTTGGAGT TATGCCGTGG CCCTGTCGAT GGGGTTTCTG
GTCCTGTTCG CCCCCTATCT CGGTGCCTGG GCCGATAGGC ACGGGCGGCA TCTGCCTTTA
CTGGCCGTGT TGTGTTGCAG CGGGGCGCTG GCTACAGCCG GGCTTGCGGC AGTCGATTCG
TATTGGGCTG CGGCGGGGCT GTTCATGTTT GGGAATTGGT GTTTTGCGGC CGGAAACATC
CTCTATAACG CGTATCTTCC AGCTTTGGCC AAAACGGACA GCGAGAAGGA CCTCCTGTCG
GCCCGTGGCT TTGCCGCCGG GTATGCCGGG GGTGGCCTGG CTCTGCTGGT GGTGATGGTG
CTGATACTCA AGCCCGGCTG GTTTGGGTTG CCGGACCCCG GCGCCGCCAC GCGGGCCGGG
TTCGTGCTCA CGGGGCTGTG GTGGATCGGG TTCGCCCTGC CGGTCTTTAC CCGGTTGCGC
GGTGTGGCCA TGCGGGGACC CGATACGGCC CTGCCCCGGG GATTGTCGGG GTATCTGCGG
ATCTTTGGTG AGATTCGGGG CCATGCCAAC CTGTTGCGAT TTTTGGTGGC CTATTTGCTC
TATAACGACG GTATCCAGAC AGTAATCGCT GTTTCTGCGG TCTTTGCCAG GGAGACGGTT
GGCTTGTCTC AGAATACGGT GCTGGGGTGT TTTTTGATGG TCCAGTTCGT GGCCGCTCCT
GGAGCGTTGG TCTTTGGGCG GTTGGCCAAC AGGTGGGGCG TCAAACCGAT GCTGCTCGCA
GCGCTGGTGC TTTTCATGGG GGTCGTGGTC TATGCGGTGC GCCTGGAGAC GGCCGGTCAG
TTCTGGATTT TGGGAGGGGC CGTGGCTCTG GTGCTTGGAG GCAGCCAGGC CTTGTCCCGT
TCGCTTTTCG CCAGGATGGT CCCGGAAGAA AAGAAAGCGG AGTTTTTCGG TTTTTTCAGC
ATTAGCGCCA AATTCGCGGC CATTGTCGGT CCTTTTGTCT TTGCCAGTGT GGCAACCCTA
ACCGGGTCTG TACGCGGCGG CCTGCTCTTT TTGCTGGTCT TCTTTCTTGC CGGTGCAAGC
TTGTTGTGGA CTGTGGAGAC TCCCCGGGCT GAGGCTTCGA ACACAGGAGC GTCTTCTTGA
 
Protein sequence
MRKHIRIFAN DTTDQQRNWW AWCLYDWANS AFATVILAAV LPVYFVSLVP SGGAPVPGMD 
LRLPATVLWS YAVALSMGFL VLFAPYLGAW ADRHGRHLPL LAVLCCSGAL ATAGLAAVDS
YWAAAGLFMF GNWCFAAGNI LYNAYLPALA KTDSEKDLLS ARGFAAGYAG GGLALLVVMV
LILKPGWFGL PDPGAATRAG FVLTGLWWIG FALPVFTRLR GVAMRGPDTA LPRGLSGYLR
IFGEIRGHAN LLRFLVAYLL YNDGIQTVIA VSAVFARETV GLSQNTVLGC FLMVQFVAAP
GALVFGRLAN RWGVKPMLLA ALVLFMGVVV YAVRLETAGQ FWILGGAVAL VLGGSQALSR
SLFARMVPEE KKAEFFGFFS ISAKFAAIVG PFVFASVATL TGSVRGGLLF LLVFFLAGAS
LLWTVETPRA EASNTGASS