Gene Dshi_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3556 
Symbol 
ID5713787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3739962 
End bp3741758 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content65% 
IMG OID641269485 
Productsodium/solute symporter family protein 
Protein accessionYP_001534890 
Protein GI159046096 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGA CAACACTCAA CATTATCGTC GTGGGGCTGT CGTTCGCGCT CTACTTCGGT 
ATCGCGATCT GGGCGCGCGC CGGATCGACT TCGGAGTTCT ATGCCGCCGG ACGCGGTGTC
AACCCGGTGG TCAACGGCAT GGCCACGGCG GCGGACTGGA TGTCCGCGGC ATCCTTCATC
TCCATGGCGG GCCTGATCGC CTTCGTGGGC TATTCCAACT CCAGCTTCCT GATGGGTTGG
ACCGGCGGGT ACGTGCTGAT GGCGCTGCTG CTGGCGCCTT ACCTGCGCAA GTTCGGCAAG
TTCACCGTGC CGGAATTCAT CGGGGACCGG TTCTACTCGA CCAAGGCACG CGTCGTCGGC
GTGATCTGCC TGATCGTCAT CTCGACCACC TACGTGATCG GCCAGATGAC CGGCGCAGGC
GTGGCCTTCT CGCGCTTTCT CGAAGTGAGC AGCACGACGG GCCTGGTCAC CGCGTCCTGC
GTGGTGTTCG TCTACGCGGT TCTCGGCGGC ATGAAGGGCA TCACCTACAC CCAGGTGGCG
CAATATGTCG TTCTGATCAT CGCCTACACG ATCCCGGCGA TCTTCATCTC GCTGCAGCTG
ACCGGCAACC CGATCCCGGG GCTCGGTCTG TTCTCGAACA TGGCGCCGGG CCAGGTCGGG
GCGGGCGAGC CGCTTCTGGT CACCCTGGAC GGCCTGCTGA CGGATCTCGG CTTCAACGCC
TACACCACGG GCAGCTCGCC CTTCCTGATG GCGCTGTTCA CCCTGTCGCT GATGATCGGG
ACCGCGGGTC TGCCGCACGT GATCATCCGC TTCTTCACCG TGCCGCGCGT GGCGGATGCC
CGCATCTCGG CCGGTTGGAC CCTCGTCTTC ATCGCTCTGC TGTATCTGAC GGCGCCTGCG
GTGGGTGCGA TGGCGCGGCT GAACATCACC GACCTGATGT GGCCGGAAGG CACGCAGGGC
GAAGCCGTGA CCATCGAGAT GATCCAGAAC GATCCGGAAT ATGCCTGGAT GAACACCTGG
CAGCAGACCG GCCTTTTGGG TTGGGAAGAC AAGAACGGCG ACGGCCGGAT CCAGTACTAC
AACGACGCCA ACCCGTCGAT GACGGAACGC GCCGAGGCGG CTGGCTGGCA GGGTAACGAG
TTGACCAACT TCAACCGGGA CATCCTGGTT CTGGCCAACC CGGAGATCGC CAACCTGCCG
GGCTGGGTCA TTGCCCTGAT CGCGGCGGGC GGTATCGCGG CCGCCCTGTC GACGGCGGCG
GGCCTGCTGC TCGCCATCTC GTCGGCGATC AGCCACGACC TGATCAAGAC CGTGTTCAAC
CCGTCCATCT CGGAGAAGGG CGAGCTGCTC GCGGCCCGGA TCTCGATGGC CGGTGCGATC
GTGGTGGCGA CCTATCTGGG GCTCAATCCG CCGGGCTTTG CGGCCCAGGT GGTGGCGCTG
GCCTTCGGTC TGGCCGCCGC GACGCTGTTC CCGGCGCTGA TGATGGGCAT CTTCTCCAAG
CGCATCAACG CAAGCGGTGC GACCTGGGGC ATGCTCGTGG GTCTGATCAG CACCTGTGCC
TACCTGTTCA CGTATCTGGG CATCTTCTTC GTCCCGGGGA CCAACTTCCT CGAGCCGACG
GCGTCCAACT ACCTGTTCGG GATCCCGCCG ACCCATTTCG GGCCGATCGG CGCGCTGCTG
AACTTCGCCG TGGCGATCCT GGTGAGCCGT GCCACCGAAG AGCCGCCGCA GGAGATCCAG
GATCTCGTCG AAAGCGTGCG CATCCCCAGG GGTGCCGGCG CGGCCGTCGA TCACTGA
 
Protein sequence
MDQTTLNIIV VGLSFALYFG IAIWARAGST SEFYAAGRGV NPVVNGMATA ADWMSAASFI 
SMAGLIAFVG YSNSSFLMGW TGGYVLMALL LAPYLRKFGK FTVPEFIGDR FYSTKARVVG
VICLIVISTT YVIGQMTGAG VAFSRFLEVS STTGLVTASC VVFVYAVLGG MKGITYTQVA
QYVVLIIAYT IPAIFISLQL TGNPIPGLGL FSNMAPGQVG AGEPLLVTLD GLLTDLGFNA
YTTGSSPFLM ALFTLSLMIG TAGLPHVIIR FFTVPRVADA RISAGWTLVF IALLYLTAPA
VGAMARLNIT DLMWPEGTQG EAVTIEMIQN DPEYAWMNTW QQTGLLGWED KNGDGRIQYY
NDANPSMTER AEAAGWQGNE LTNFNRDILV LANPEIANLP GWVIALIAAG GIAAALSTAA
GLLLAISSAI SHDLIKTVFN PSISEKGELL AARISMAGAI VVATYLGLNP PGFAAQVVAL
AFGLAAATLF PALMMGIFSK RINASGATWG MLVGLISTCA YLFTYLGIFF VPGTNFLEPT
ASNYLFGIPP THFGPIGALL NFAVAILVSR ATEEPPQEIQ DLVESVRIPR GAGAAVDH