Gene Dshi_3906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3906 
Symbol 
ID5714435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp133997 
End bp135598 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID641276819 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001542115 
Protein GI159046444 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACG CGAAATTTGA CCTTCACTGG ATCGATTACG CCATCGTTGT CATCTATTTC 
ATCGGCGTGA TCGCGCATGG CGTCTATGTC TCGCGCAAGA ACGAGGAAGG CGCAGACGGT
TACTTCCTCG CGGGACGGTC GCTGCCTTGG TACCTGATTG GGTTTTCGCT TTTCGCGTCG
AACATGTCGG GTTCCAGCTT CGTGGGTTTG ATGGGCGGCG CCTATGCAAA CGGTATCGTT
ATCTTCAACT ACGAATGGAC CGCTGCACTC GTCCTGATCC TGTTCGCAAT CTTCGTGCTG
CCCTCCTTCC TGAAAGCGAA AATCTCCACC GTCCCCGAAT TTCTCGAGCA GCGCTATGAC
GTGCGCTCGC GGCGGGCCTT CTCGATATTT ACCATTCTTG CCATCCTGTT CATCGACACG
GCCGGGGCGC TCTATGCCGG TGGGCTCGTG ATTTCGAACG TGACGGGTTA CCTCAACCTC
TGGACAGCCG TCGCCGTCCT GGCCCTCGTT GCGGGTATCT ATACCATCCT TGGTGGTCTT
TCGGCGGTGG TGGTCACCGA CACTGTGCAG GCAATCCTGC TGATCATCGG CGCCGCGATC
CTATTCTGGC TCGGCCTTGA CGAGATCGGT GGGTGGGAAC AGCTCTTCGT CGACATTCCC
GAAGGCCACG ACCAACTTAT CCTGCCTGCC GATGACGATT TCCTGCCGTG GACGGGGCTG
TGGGGCGTGG TCTTGCTGGG ATTTTATTAC TGGACCATCA ATCAATTCGT GGTGCAGCGC
ACGCTGGGCG CCAAGAACCT CAAGGAAGGG CAGATCGGTG CCCTCTTCGC GGGCTTTCTC
AAACTGCCGA ACATCTTTCT GATGATCATC CCCGGGGTCA TCGCCCTGAA ACTCTATCCC
GAGCTTGAAA CACCTGACCT CGCCTTCCCG ACCCTCGCTT TCGAACTGAT GCCGATTGGT
GTGCGGGGCC TGATCATGGC CGCCCTGATC GCAGCGATCA TGTCCTCACT CGACTCGGCC
ATGAATTCCG CCTCCACCCT TGTGGTCAAA GATTTCGTCG AGCCGATCTG GGAGGTAGAC
GAGGGCAAGC AGGTTTGGAT CGGCCGTTTG GTGACCGGCG CTGTCATGGT CTTCGGCGCG
ATCTATGCGC CTTCCATTGC CGGGTTCGAA AGCCTGTTCA GCTACTTCCA GTCCTCGTTG
AGCTACATAA TCCCCACCAT CGTCGTGGTC TATATCGTCG GCCTTTTCGT GCCATGGCTG
AACGGCAATG GCGCATTCTG GACGATTATC CTTGGCCTCG TGGTGGGCAT TCCTCTGTTC
ATCATGAAAG AGGTGACGGG CGTCTGGGCC GGCATGGGCC TGCCCGAGAT CCACTATACG
ATCATGTCGA CCCTCATGAT GTGTCTGGGC CTGGCCACTC ATTTCGGGAT CTCCGCTCTG
ACCCGAAAGG CCGACAAGGA AAACATCGAG GACCTCGTCT GGTCTGCCGC CGACACAAAG
GCCATTTTCA CCCAATGGGA AGAGCCACTG TGGCAGGACC GTACGATCTG GGCCGGGTTG
CTGATCCTGT CGACCATCGG TTTTGTCGCG TGGTTCTGGT AA
 
Protein sequence
MGDAKFDLHW IDYAIVVIYF IGVIAHGVYV SRKNEEGADG YFLAGRSLPW YLIGFSLFAS 
NMSGSSFVGL MGGAYANGIV IFNYEWTAAL VLILFAIFVL PSFLKAKIST VPEFLEQRYD
VRSRRAFSIF TILAILFIDT AGALYAGGLV ISNVTGYLNL WTAVAVLALV AGIYTILGGL
SAVVVTDTVQ AILLIIGAAI LFWLGLDEIG GWEQLFVDIP EGHDQLILPA DDDFLPWTGL
WGVVLLGFYY WTINQFVVQR TLGAKNLKEG QIGALFAGFL KLPNIFLMII PGVIALKLYP
ELETPDLAFP TLAFELMPIG VRGLIMAALI AAIMSSLDSA MNSASTLVVK DFVEPIWEVD
EGKQVWIGRL VTGAVMVFGA IYAPSIAGFE SLFSYFQSSL SYIIPTIVVV YIVGLFVPWL
NGNGAFWTII LGLVVGIPLF IMKEVTGVWA GMGLPEIHYT IMSTLMMCLG LATHFGISAL
TRKADKENIE DLVWSAADTK AIFTQWEEPL WQDRTIWAGL LILSTIGFVA WFW