Gene Dret_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1199 
Symbol 
ID8419027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1410067 
End bp1411614 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID645037774 
ProductBile acid:sodium symporter 
Protein accessionYP_003198065 
Protein GI258405323 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAC TTCTATCGAG CATCAGTAAA CATCTGATTA TCGCTATCCC GGTCATGATG 
CTCCTGGGAT TCGTCTTCGG CATTGCTGTT GATGACGCCT CTGGGCTGAA GGGACTGATC
ATCCCGTTCA CCTTCCTCAT GGTGTATCCC ATGATGGTCA ATCTGAAGAT CAAGAAGGTC
TTCGAGGGAG GAGATGTCAA AGCCCAGCTC CTGACTCAGG CCATCAACTT CGGCATCATT
CCCTTTGTCG CCTTCGGGTT GGGGATGCTC TTTTTCCGGG ATAGCCCGTA CATGGCCCTG
GGGATGCTCC TGGCCGGGCT GGTTCCCACC AGCGGCATGA CGATTTCCTG GACCGGTTTT
GCCAAGGGCA ATGTCGCCTC TGCAGTGAAG ATGACCGTTA TCGGCCTCAC CCTGGGTTCG
CTGGCCACAC CGTTTTACGT CCAGTTCCTG ATGGGGGCCA GTCTTGAGGT CAATGTCATG
GCGGTCATGA AACAGATCGT CATCATTGTC TTTATTCCCA TGCTGGCTGG ATTTTTGACC
CAGCAGGGGC TGATCAAACG CTACGGACAG AAGGATTTCC AACAGAGTTG GGCCCCGAAA
TTTCCGGCCC TGTCCACCCT TGGCGTCGTG GGCATCGTCT TTATTGCAAT GGCCCTGAAG
GCCAAGGCTA TTGCCGGCGC TCCCCAGATG CTGCTGTATA TTCTCATCCC GCTGACAATT
ATCTACGCTT TTAACTATGT GCTGAGCACC GTCATCGGCA TCAAATTCCT GTCCCGTGGC
GACGGCATCG CCTTGGTCTA CGGCTCGGTG ATGCGTAATC TCTCCATCGC CCTGGCCATC
GCCATCAACG CCTTCGGCCC CGAGGGGTCC AGCGCCGCCC TGGTCATTGC CGTGGCCTAT
ATCATTCAGG TGCAATCCGC AGCCTGGTAC GTAAAATTTT CCGACGCCAT TTTTGGCGCG
CCCGCTGAAG CCGAAGCGCA GGCCGAAAAA ACCGCCGCTC CGACCCCGGG CAAGGAAACA
GAACACGAGC TGCTGGTGCC GGATTTCAAG AATATCCTCT ACGTCACCGA CCTCTCGCAA
AGCGCAAAGC ACGCCGCACA ATACGCCTGC AGCCTCGGGG TGAAATATTC CGCCCAGGTG
ACGGTTATGC ACGTCGTGCC CGATCAGCTT GAGGAGTATT CGGAAAATGT CGGGGTGGAC
ATCACCCACC GCGTGGATCA GCAGACCCGG ACCGCTTTCA ACGAATCCAG CGTGAGCGAA
GCCCAGCAAG CGATCCGTTC CCGGATCGAA TCTACATCCA AAGAAGTGAC CAAGCAGATC
CCCTACTGTC CGATGACTCC GGAGAATATC CGCATCGAAG TTGGGGATCC CCAGAACAAA
ATTGTCGAAA TCGCCCGCAA GGAGGGCTTC GATCTGATCA TCATCGGCAC CCACGGCCAC
GGCGCGTTCG AAGACGCCTT TCTGGGCAGT GTGGCCCGGG ATGTCATCCG CAAGAGTCCC
GTGCCAGTGC TCTCGGTGCG CCTTGCGGAC GCGGCCCACT CACGGTGA
 
Protein sequence
MWKLLSSISK HLIIAIPVMM LLGFVFGIAV DDASGLKGLI IPFTFLMVYP MMVNLKIKKV 
FEGGDVKAQL LTQAINFGII PFVAFGLGML FFRDSPYMAL GMLLAGLVPT SGMTISWTGF
AKGNVASAVK MTVIGLTLGS LATPFYVQFL MGASLEVNVM AVMKQIVIIV FIPMLAGFLT
QQGLIKRYGQ KDFQQSWAPK FPALSTLGVV GIVFIAMALK AKAIAGAPQM LLYILIPLTI
IYAFNYVLST VIGIKFLSRG DGIALVYGSV MRNLSIALAI AINAFGPEGS SAALVIAVAY
IIQVQSAAWY VKFSDAIFGA PAEAEAQAEK TAAPTPGKET EHELLVPDFK NILYVTDLSQ
SAKHAAQYAC SLGVKYSAQV TVMHVVPDQL EEYSENVGVD ITHRVDQQTR TAFNESSVSE
AQQAIRSRIE STSKEVTKQI PYCPMTPENI RIEVGDPQNK IVEIARKEGF DLIIIGTHGH
GAFEDAFLGS VARDVIRKSP VPVLSVRLAD AAHSR