Gene Dret_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1961 
Symbol 
ID8419806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2244434 
End bp2245882 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content59% 
IMG OID645038549 
Productsodium/proline symporter 
Protein accessionYP_003198823 
Protein GI258406081 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0044995 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAGTG TGCCCACCTT TACCTCCTTT GTCGTCTACT TGATTGTCAT GATGTCCATC 
GGGATTTTTT TCTACTACCG GACCAAAAAC CTCTCGGACT ACATCCTCGG CGGACGCCAA
CTCAGCCCGG CAGTGGCCGC CTTGAGCGCC GGGGCTTCGG ACATGAGCGG CTGGTTGTTG
CTCGGTCTGC CCGGGGCCAT GTACGCCGGA GGGATGAACA ACATCTGGAT CGCTGTCGGC
TTGTCCATTG GCGCCTATCT GAACTGGCAG TTCGTGGCCA AGAAATTACG GACCTATACG
GAAAAGGCCG GTGACGCGAT CACCCTGCCC GACTACCTGG AAAACCGCTT CGGCGATTCC
TCGCGCATCC TGCGGGTCAT CTCGGCCATC GTGATTCTCA TCTTCTTCAC TATCTACGTC
TCTTCCGGCC TGGTCGGCGG GGCCATCCTT TTCGAGAAAA CCTTTGGCCT GAACTACCAG
CTCGCCCTGT GGGTCGGGGC CCTGGTCATC GTGGCCTACA CCTTTCTCGG CGGGTTCATG
GCCGTCAGCT TGACCGACTT TCTCCAGGGA ACGCTGATGT TTATCGCGTT GCTGGTGGTT
CCGGCCGTCG TGATCGCCAA AATGGGCGAT TGGGGCACCG TGGTCGACAA GGTCGCGCAT
GTCGACGCCA AATATGTGGA CGCCTTTTCC GGCATGACCT TGCTGTCGAT TATCTCTCTC
ATGGCCTGGG GACTGGGCTA TTTCGGGCAG CCCCATATTC TGGCCCGCTT CATGGCCATC
CGGCGGGCCA AGGACGTGCC CAAGGCGCAG ATGGTCGGCA TGACCTGGAT GGTTCTGGGC
CTTTTCGGGG CCATTTTCAC CGGTTTTGCC GGCATCGCCT ATTATGTGGG CAGTCCGCTG
GAAAATTCAG AAACCGTCTT CATCGCTTTG ACCCAGGCCC TCTTCAATCC CTGGATCGCC
GGCATCCTGC TGGCCGCCAT TCTTTCGGCC ATCATGTCCA CCGTCGACTC CCAGCTTTTG
GTCTGCTCCA GCGCTATTGC CGAAGACTTT TACAAACAGA TCCTGCGCAA GGAAGCCCCG
CAAAATGAAC TGGTCTGGAT CGGACGGGTG TCCGTGCTCA TCCTGGCCCT GATCGCGACC
TCCCTGGCGG CCGACCCGAA CAGCAAGGTC CTGGACCTGG TGGCCTACGC CTGGGGCGGC
TTTGGCGCCG CCTTCGGCCC GGTGGTCATT TTGTCCCTGT TCTGGCGGCG CATGACCCGC
AACGGGACCC TGGCCGGGAT GATCGTCGGC GCTGTGACCG TCATCGTCTG GAAGCACATG
ACCGGTGGCC TTTTTGATCT CTACGAGATC CTGCCCGGGT TTCTGTTCTG TGCTCTGACC
GTTATCATCG TCAGCCTCCT GGACAAAGCC CCCGGCAAAT CCGTCACTGA GGTTTTCGAC
TCCGTCTGA
 
Protein sequence
MMSVPTFTSF VVYLIVMMSI GIFFYYRTKN LSDYILGGRQ LSPAVAALSA GASDMSGWLL 
LGLPGAMYAG GMNNIWIAVG LSIGAYLNWQ FVAKKLRTYT EKAGDAITLP DYLENRFGDS
SRILRVISAI VILIFFTIYV SSGLVGGAIL FEKTFGLNYQ LALWVGALVI VAYTFLGGFM
AVSLTDFLQG TLMFIALLVV PAVVIAKMGD WGTVVDKVAH VDAKYVDAFS GMTLLSIISL
MAWGLGYFGQ PHILARFMAI RRAKDVPKAQ MVGMTWMVLG LFGAIFTGFA GIAYYVGSPL
ENSETVFIAL TQALFNPWIA GILLAAILSA IMSTVDSQLL VCSSAIAEDF YKQILRKEAP
QNELVWIGRV SVLILALIAT SLAADPNSKV LDLVAYAWGG FGAAFGPVVI LSLFWRRMTR
NGTLAGMIVG AVTVIVWKHM TGGLFDLYEI LPGFLFCALT VIIVSLLDKA PGKSVTEVFD
SV