Gene Dret_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2355 
Symbol 
ID8420215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2686238 
End bp2687395 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID645038957 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003199216 
Protein GI258406474 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGC TCCACGACCT CAACCTCTGC CTCCTGCTGC TCGTGGTCAT GTTCGCCGTA 
ACCGGCGGCA GCCTGGTCGG CCCGATTCTC CCGGAAATGA TCGCTCCGCT GGGCGCGACC
CAGCAAACCG TTGGCCTGGC CCTGAGCGTC TACACCCTGG GAGCCCTCAT CACCACGCCG
ATCTTTGGGG TCCTGGCCGA CCGCGTGGGC CGCAAACGGA TCATCGTCCC CACCACTCTG
CTCTTTGGTA TCGCAGGGCT GCTGATCACC CTGACCGAAA GCTTCTGGCT CGTCCTTGTC
TACCGGGCCC TGCAAGGCAT CGGTGTCGGC GGGATGATGA ACTCGGTCAT CGTGGCCATT
GGCGACCGCT ATTCTGGCAT CGAGCGCCAA CAGGCCATGG GCTACCGCGT CACTGCCCAG
GGACTCACCA ATGCGGCCGT TCCCTTTCTC TCAGGGGCAC TGGCGACCAT CGCCTGGTTT
CTTCCCTTTT ATATCCATTC CCTGGCCATC GCAGTCGGGC TCCTGGCGGC CTGGAAACTC
GAGGAACCGG TGCAGGCGCG CCCCTCGGCA AATTATCTCA CGCAGGCTCT GGCTGCGGTT
CTCACCCTCC GGGCGTTCTG GCTCTTCTTT TCCAATTTCA TGGGCTTTTT TCTACTCTAC
TGCCTGGTGG TCTACATGCC GCTTTTTGTG GTCAACGAAC TCGGCCACTC CACACTGCAC
GCCGGTCTGG CCCTGTCTGT GGGCGCCGGT GTCAACTCCC TGGTCGCCAC CCAGGCTGGA
CGCCTCCGCC GCCGCTTCAG TGAAGAGACG CTGGTCCTGA CCGGCTTTCT CTGCGCCGGG
ATCGCGCTGC TGGCCTTGGG GCTGAGCCCG ACCTACGGAA CCATGCTCCT GTGCTTCGTG
CTCTGGGGCC TCGGCTTTGG CGCACTCATG CCCACCCTGA ACGCGGCCGC GGCCGGCCTG
GTCTCTGCAG AATTGCGCGG CGGGGTGCTC TCGCTGTTCA CCCTGCTGAT CTACCTGGGC
CAGACCGTCT CCCCTCTCTT TTTCGCCTTG TTTCTCAAAA ACGGAACAGT GCACCACACC
TTTTTTATCG CCAGTGGGCT GACACTTTTG CCGCTGTCCC TGACGCTTCT CGTCCGCAGC
CGCCAAGACA CCACCTGA
 
Protein sequence
MRVLHDLNLC LLLLVVMFAV TGGSLVGPIL PEMIAPLGAT QQTVGLALSV YTLGALITTP 
IFGVLADRVG RKRIIVPTTL LFGIAGLLIT LTESFWLVLV YRALQGIGVG GMMNSVIVAI
GDRYSGIERQ QAMGYRVTAQ GLTNAAVPFL SGALATIAWF LPFYIHSLAI AVGLLAAWKL
EEPVQARPSA NYLTQALAAV LTLRAFWLFF SNFMGFFLLY CLVVYMPLFV VNELGHSTLH
AGLALSVGAG VNSLVATQAG RLRRRFSEET LVLTGFLCAG IALLALGLSP TYGTMLLCFV
LWGLGFGALM PTLNAAAAGL VSAELRGGVL SLFTLLIYLG QTVSPLFFAL FLKNGTVHHT
FFIASGLTLL PLSLTLLVRS RQDTT