Gene Dret_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2339 
Symbol 
ID8420199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2660098 
End bp2663226 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content60% 
IMG OID645038941 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_003199200 
Protein GI258406458 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCA AAATATTCAT CGAACGCCCC AGGTTCGCGG TCGTTCTCGC GATCCTTATC 
ACCCTGGCTG GAATGATCGC TGTCTATTCC CTGCCCGTGG CTGAACACCC GGACATCACT
CCCCCGGTGA TCCGGGTCAG CGCCGTCTAT CCCGGGGCCA GTTCCGAAGT GGTCCGGGAC
ACCATTGCCG CGCCCATCGA AAAACAGATG AACGGCGTGG AGGACATGCT CTACATGCAG
TCCGAATCCA CCGACGACGG CCGCTACTCC CTGGAAGTAA CCTTTTCCGT AGATTCTGAC
CCGGACATCG ACCAGGTCAA TGTCCAGAAC CGACTCCAGC TGGCCGAGTC CAGCCTGCCG
CAGGCGGTTT TGGATCAGGG TATCGACGTA CGACGTCGCT CCTCGGATAT GCTCGGGGTG
GTCTCTTTCA CCTCCCCGGA CGGCTCCCGT GACCGGTTGT TCATGAGCAA CTATATCAGC
CGGACCATCT CCGACGCTGT GCAGCGCGTC GACGGGGTCA GCGACGTCTT CATCTTCGGC
GAGGCCGAAT ACAGCATGCG TATCTGGGTC GACCCGGACA AGCTCACCGC CCTGGACATG
AATGGCAACG AGGTCATCCA GGCCATCCGG GAACAAAGTG TCCAGGCCAC GCTGGGCTCC
ATCGGGACCG CCCCCACGGT TCCCGGGCAG AAACTCCAAT ACACCCTCAA AGCCCAGGGG
CGGCTCAAGA GCGCCGAAGA ATTCGAAAAC ATCATCATCC GCAGCAACGA CCAGGGCGGT
CAGGTCCGGG TCAAGGACGT GGCCGAGGTC GAACTGGGCA ACAAAACCTA CAGCGCCGCC
GGGAACTTCA ACAACCAGGC CGCGGTCAAT GTCGCCCTGT ACCGCTCCTC GGAAGCCAAC
GCCATGGAGA CCATGGAGGC GGCCCGGGCG GAACTCGAGC GCCAGGCAGA GCTTCTGCCC
GAGGGCATGA CCTACACCAT CCCCTACGAC ACCACCAAAT ACATCCAGGC CACCATTGAT
GAAATCGTCA CCACCCTGGC CCTGGTCTTC ATCCTGGTGG TCTTGGTCAT ATTCATCTTT
TTGCAAAACT TCCGGGCCAC ACTCATTCCG GCGGCCGCCG TGCCCGTGTC CATCATCGGC
ACCTTCGCCT TTTTGCTGGC CATAGGCTTC AACCTGAACA CCATCACCCT TTTTGCCCTG
ATCCTGGCCA TCGGACTGGT GGTCGACGAC GCCATTGTCG TGGTCGAAAA CGTGCACCGC
ATCATGGAAG AAGAGGACCT GAGCCCCAAA CAGGCGTCCA TCAAGGCCAT GGAGCAAGTG
TCAGCACCGA TCGTAGCGAC TTCGCTGGTT CTGCTGGCTA TCTTTATCCC CGTCGCCTTT
ATGCCTGGAA TCACAGGCCT TTTATATAAG CAATTCGGAC TCACCCTGTG CGTGTCCATC
ATCATTTCCT CGTTTTGCGC CCTGACCCTG AGCCCGGCCC TGTGCACCGT GCTGCTTTCC
AAACCCAAGC CCCACAACCG CGGCCCCTTC GGCTGGTTCA ACTCCCTCTT GGGCAAAACC
CGCTTTGGCT ATACCAGCGT GGTCGGCTGG ATGATCCGCC ATCTTGCCGT TGCCCTGGGC
CTTTTCCTGC TTGTGCTCGG CGGCTCCTGG TACTTCTACG GCACGTTGCC CACCAGTTTC
CTGCCTCAAG AGGACAAAGG CGGCTTTCTC ATCGACGTCC AACTCCCGGA AGGGGCTACC
CTGCAGCGCA CGGAAACGGT TACCGAGCGG GCCACCAAAT TGCTTCAGGA ACTCGAAGGC
GTGGAAAATG TCCTGGCCAT CAACGGCTTC AGCCTGATGA CCGGCAGCGC GGAAAACGTC
GGTTTCCTCA TCGCGGACCT CGACCCCTGG CAAGAGCGTC AGGATCCGGA ACTGCACATC
AATGCCCTGG TGGACAAGAC AAACAAAAAG CTCAACGCCA TCACCACCGC GACCATCCGG
TCCTTTGTCC CGCCGCCCAT CCAGGGTCTG GGCCTGACCG GCGGCTTCGA CTTCCGCCTG
CAGGCCACCC AGGGGCAACC GCCCACGGAG CTGGCCGAGG TGGCCCGCGG GCTGGCCGGC
CGTGCCAACC AGGATCCCAA GCTCACCCGG GTGTACACGA CCTTTTCGGC CAACACCCCC
CAAATCAACC TCGAACTAGA CCGCTCCCAG ATGGGACAGC TCGGGGTGCA GGTGAGCCGC
CTGTTCGGGA CATTGAACCA GCAACTCGGC GCCCAGTATG TGAACGATTT CAATCTCTAC
GGCCGCACCT ATCAGGTCAA AATGCGCTCC CAGGCCGAAT ACCGCCAAAG CCGGCAAGAC
ATCCTGGACC TGCACGTGGT CAACGACCGA GGCCAGAACG TGCCCGTGGA GAACTTCGTC
GACCTGTCCA CAACTATCGG GGCCAAGACG GTCAACCGGT ACAATCAGTT CTCCAGCGCG
GCGATCAAAG GCCAGGCCGC ACCGGGGTAC TCCTCTGGGC AGGCCATGGC GGCCATGCAA
CAACTGGCGG ACAGGACCCT GCCCGACGGC TACGCCTTTG AATGGTCCTC CATGTCCTAT
CAGGAACAAA AAGCCAGCGG CACCGTGATC TACCTCTATG CCCTGGCCAT TGTCTTCGCC
TATCTCTTCC TGGTCGCTCT TTACGAGTCC TGGAACCTGC CCTTGTCCAT CGTGCTCTCT
GTGGTCGTGG CCACCCTGGG CAGTTTTGTC GGGCTCTGGG TCACTTCCTA CTCTCTGTCC
ATCTACGCCC AGATCGGGCT GGTGCTGCTG GTCGGACTGG CGGCGAAAAA CGCGATTTTG
ATCGTGGAAT TCGCCCGCAG TAGCAAAACA GAAGGGGCCA CAACGTACAA AGCTGCCATT
GAAGCCGCAG GCGTGCGCTT CCGCCCGGTA CTCATGACTG CCCTGACCTT TATTCTGGGG
GTCGCACCTC TGGTCTGGGC CACCGGAGCT GGTGCAGCCA GTCGGCGCCA TATCGGCATC
GTAGTCTTTT CCGGCATGGT CGCGGCAACC ACCCTGGGTA TTCTGCTCAT CCCGTCCCTG
TACTATTTCT TCCAGCGCAT ACGGGAAAAG GGCAAAGCCT GGCGGGACGC TCTGCGTTCC
AATACGTAA
 
Protein sequence
MFSKIFIERP RFAVVLAILI TLAGMIAVYS LPVAEHPDIT PPVIRVSAVY PGASSEVVRD 
TIAAPIEKQM NGVEDMLYMQ SESTDDGRYS LEVTFSVDSD PDIDQVNVQN RLQLAESSLP
QAVLDQGIDV RRRSSDMLGV VSFTSPDGSR DRLFMSNYIS RTISDAVQRV DGVSDVFIFG
EAEYSMRIWV DPDKLTALDM NGNEVIQAIR EQSVQATLGS IGTAPTVPGQ KLQYTLKAQG
RLKSAEEFEN IIIRSNDQGG QVRVKDVAEV ELGNKTYSAA GNFNNQAAVN VALYRSSEAN
AMETMEAARA ELERQAELLP EGMTYTIPYD TTKYIQATID EIVTTLALVF ILVVLVIFIF
LQNFRATLIP AAAVPVSIIG TFAFLLAIGF NLNTITLFAL ILAIGLVVDD AIVVVENVHR
IMEEEDLSPK QASIKAMEQV SAPIVATSLV LLAIFIPVAF MPGITGLLYK QFGLTLCVSI
IISSFCALTL SPALCTVLLS KPKPHNRGPF GWFNSLLGKT RFGYTSVVGW MIRHLAVALG
LFLLVLGGSW YFYGTLPTSF LPQEDKGGFL IDVQLPEGAT LQRTETVTER ATKLLQELEG
VENVLAINGF SLMTGSAENV GFLIADLDPW QERQDPELHI NALVDKTNKK LNAITTATIR
SFVPPPIQGL GLTGGFDFRL QATQGQPPTE LAEVARGLAG RANQDPKLTR VYTTFSANTP
QINLELDRSQ MGQLGVQVSR LFGTLNQQLG AQYVNDFNLY GRTYQVKMRS QAEYRQSRQD
ILDLHVVNDR GQNVPVENFV DLSTTIGAKT VNRYNQFSSA AIKGQAAPGY SSGQAMAAMQ
QLADRTLPDG YAFEWSSMSY QEQKASGTVI YLYALAIVFA YLFLVALYES WNLPLSIVLS
VVVATLGSFV GLWVTSYSLS IYAQIGLVLL VGLAAKNAIL IVEFARSSKT EGATTYKAAI
EAAGVRFRPV LMTALTFILG VAPLVWATGA GAASRRHIGI VVFSGMVAAT TLGILLIPSL
YYFFQRIREK GKAWRDALRS NT