Gene Dret_2479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2479 
Symbol 
ID8420341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2843510 
End bp2845291 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content58% 
IMG OID645039082 
Productputative sodium symporter protein 
Protein accessionYP_003199339 
Protein GI258406597 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.025526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATT CAGTCAAAAT GTGGACCTTC ATCATGGTCT TTTGCACCTT TGGACTCTAC 
CTGTATATCG CCTGGCGTTC GCGGGTCAAA GACACCAAAG GATTTTATGT TGCCGGAGGA
GGCGTTCCGG CGCTGGCCAA CGGCATGGCT ACGGGGGCCG ACTGGATGAG TGCGGCCTCG
TTCATCTCCA TGGCCGGCCT GATTTCCTTC ATGGGCTATG AGGGGTGCAT CTATTTGATG
GGCTGGACAG GAGGATATGT GCTTCTGGCC CTCTTTTTGG CCCCCTATTT ACGGAAATTC
GGTAAATTCA CCGTACCCGA TTTTGTCGGC GACCGGTATT ATTCGACCAC AGCCCGAGTT
GTGGCCCTGA TCTGCGCCAT TTTTGCCTCC CTGACCTATG TCGCAGGGCA AATGCGCGGC
GTGGGCATCG TTTTCAGCCG TTTTTTGGAA GTCGACGTAA CCACAGGGGT GGTCATCGGC
ATGGCCATTG TCTTTTTCTA CGCCAGTCTC GGGGGCATGA AAGGCATCAC CTGGACCCAG
GTCGCCCAGT ATTGCGTGAT GATCGTGGCC TTTCTGATCC CGGCGATCGC CATTTCCCTG
AAAATAACCG GACAGGCGGT TCCGCAACTC GGCTTCGGCG GGACCATCGC CTCGGGAAGC
AACGCTGGTC AGTACCTGCT GCAGACCTTG GACCAGATCA GCACGGATCT CGGATTTCGG
GAATACACCT CGGCCTTTGG CGCAGGGAAC AAATCGATGC TCGATGTGAC CGCTATCGCC
CTGGCCCTCA TGGTCGGTAC CGCCGGTTTG CCCCACGTCA TCGTCCGCTT CTACACCGTG
CCTTCGGTCC GTGCCGCACG GCTGTCCGCC GGATACGCCC TGCTGTTTAT CGCCATCTTG
TACACCACCG CCCCAGCCGT AGCTGGTTTT GCCCGCTACA ACATGATCAA CGCCTTAAAC
GAGACCCCGT ACGCCGAAGC CCCGGCTTGG TTCGCCAATT GGGAGGACTC CGGGCTGGTC
GCTTGGCTGG ACAAAAACGA CGACGGCCGG ATCCAATACC GGGCCGGAGC CGCCTTTTCC
GGCAAACCCG AATTTCGGGA AGCCAGAGGC TCCCATGGCC AGCGCCTGCT GAACAATCCC
CCGGGCCCGA CATCCAATGA ACTCTATGTC GACCGGGACA TCATGGTCCT GGCCAATCCG
GAGATCGCTG ATCTGCCGCC CTGGGTAGTA GCCCTGGTCG TGGCCGGAGG GTTGGCAGCT
GCTCTGTCCA CCGCTTCCGG GCTCCTGCTG GTCATCGCCT CCTCGATCTC TCACGACCTG
TATTACCGGA TCCTCAACAG GCAAGCCACG GAAAAACAGC GGCTGCGGCT GGGACGGATC
ATGATCGGCA TTGCTGTCAT CGTGGCCGGC TATTTCGGGA TCAACCCACC AGGATTTGTG
GCTCAGGTGG TCGCCCTGGC CTTTGGCTTG GCCTGTTCCT CATTTTTTCC GGTCCTGGTT
TTGGGGATCT TCTCCAAGCG TGTTACCAAG CAAGGAGCCA TCGCAGGCAT GATCACCGGG
ATTGGGTTCA CCATTCTCTA TATTGTCCAA ACCAAATTCC TGGGTATGGA TCCCTGGCTG
CTGGGCATCA GCCCGGAAGG GATCGGCAGT GTGGGCATGA TCCTGAATTT CACGGTGACT
CTGGGAGTTT CTGCGGTCAC TCCCCCGCCG CCGCCAGAAA TCCAGGAATT GGTCGAGCAT
GTGCGCACGC CTCGCGGCGC CGGAATGGCC ATCGACCACT AA
 
Protein sequence
MMDSVKMWTF IMVFCTFGLY LYIAWRSRVK DTKGFYVAGG GVPALANGMA TGADWMSAAS 
FISMAGLISF MGYEGCIYLM GWTGGYVLLA LFLAPYLRKF GKFTVPDFVG DRYYSTTARV
VALICAIFAS LTYVAGQMRG VGIVFSRFLE VDVTTGVVIG MAIVFFYASL GGMKGITWTQ
VAQYCVMIVA FLIPAIAISL KITGQAVPQL GFGGTIASGS NAGQYLLQTL DQISTDLGFR
EYTSAFGAGN KSMLDVTAIA LALMVGTAGL PHVIVRFYTV PSVRAARLSA GYALLFIAIL
YTTAPAVAGF ARYNMINALN ETPYAEAPAW FANWEDSGLV AWLDKNDDGR IQYRAGAAFS
GKPEFREARG SHGQRLLNNP PGPTSNELYV DRDIMVLANP EIADLPPWVV ALVVAGGLAA
ALSTASGLLL VIASSISHDL YYRILNRQAT EKQRLRLGRI MIGIAVIVAG YFGINPPGFV
AQVVALAFGL ACSSFFPVLV LGIFSKRVTK QGAIAGMITG IGFTILYIVQ TKFLGMDPWL
LGISPEGIGS VGMILNFTVT LGVSAVTPPP PPEIQELVEH VRTPRGAGMA IDH