Gene Dshi_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3904 
Symbol 
ID5714433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp130573 
End bp132153 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content61% 
IMG OID641276817 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001542113 
Protein GI159046442 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.757586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC AGATCGACGC CAATCTCACG ATGCTCGATT ACGGAGTGAT TGCCGTCTAT 
CTGGCGATCG TGATCGCCAT CGGGGTCTGG GTCGCGCGCA AGACCCGGAC GGGAGAGGAT
CTGTTTCTGG CCGGCCGGTC ACTGGGTTGG GCGGCGATCG GGTTTTCGCT CTTCGCCTCC
AACATATCCA CCTCGACCCT CGTCGGCCTG ACCGGCAGCG CTTACACAGG TGGCCTGACG
GTCTCTTCCT ATGAATGGAT GGCCGGGATC CCGCTGCTGT TCATGGCGTT CATCTTCGCG
CCGGTGTTCC TGAAATCGCG CATCTCGACC ACGCCGGAAT ACCTCGAAAA TCGCTATTCC
CGCCGCGTGC GCCTGTATTT CTCGGGCCTG ACTATCGTCT TTACCGTGAT CGTCGATACC
GCTGGCGGGC TTTATGCGGG TGCCGTCGTG CTCAAGGTCT TCTTCCCCGA TCTCGACATC
TGGATGTCTT GTGTGGCGAT TGGCCTCTTC GCGGGCATAT ACACGGCAAC CGGCGGCCTG
CGCGCCGTGG TCTATACCGA TATCCTGCAG GCGGTGGTGC TGATCTGCGG CACTGGCCTG
ACTGCGTTCC TGATGTACCA GTCTGTCGAT TTCTCGTGGG AGTCGGTGCG CAGCCAGGTC
CCCGAGGGCC ATCTGAGCAT CGTGCAGCCC ATCGACGACG ACACCCTGCC CTGGCCGGGG
CTGTTCACCG GTGTCTGGCT GCTGGGCTTT TGGTACTGGG TCACCAACCA GTACATCGTG
CAGCGCGTTC TGGGCGCGAA GGATCTGAGC AATGCGCAGT GGGGCGCCAT CCTGGGCGGT
ATCCTCAAGA TCCTGCCGAC CTTCTTCATC ATCCTGCCCG GGGTCATGGC GCTGGTCACA
CTGCCAGATA TCCAGAACTC GGACCAGGTG TTCCCGATCA TCATCACCGA GGTGCTGCCT
TCGGGCCTGA CCGGGCTTGT CATGGCCGGG TTGATCGCGG CGATCATGTC CACCGTGGAC
TCGACCCTGA ACTCGTCCTC AACCCTCTTG ATCAACGATT TCCTGACGAG GCCCGAAAAA
GAGCCCGACC CCGAGACGGC GAAGAAATGG GGCATGATGG CGACCCTTGG CTTCATGGTG
ATCGCCATCG CCTGGGCGCC GCTGATCCAG TATTTCGGCG GCCTCTGGGC CTACATCCAA
CAGGCCTTCT CGGTGCTGGT CCCGCCGCTT GTGGTGTGCT TCACACTCGG CGCCCTGTGG
TCCCGCGGCA CCGAAAACGC AGCCTTCTGG ACGCTGATCA TCGGCCATAC CCTCGGCCTC
GTGGTCTTCA TGCTGAACCA GTTCGGTATC TGGCCTCTGC ATTACACGAT CAGCGTCACT
ATCATGACCG CCGTCTCCGC CGCGATCTTC GTCGCGCTCA GTCTGCGGGA CGACACACCG
GATGTGCGCG AGGATGCGCT CTGGCAGCGG GCGGACGCCT TCGACACCCC CGCAACCACT
GCACCGGTGC TCAAGAACGT GAAGACCCAT GCGATCCTCC TGATCCTGCT GATGATCGGC
ACGCTGGTGC TGTTTTGGTG A
 
Protein sequence
MEQQIDANLT MLDYGVIAVY LAIVIAIGVW VARKTRTGED LFLAGRSLGW AAIGFSLFAS 
NISTSTLVGL TGSAYTGGLT VSSYEWMAGI PLLFMAFIFA PVFLKSRIST TPEYLENRYS
RRVRLYFSGL TIVFTVIVDT AGGLYAGAVV LKVFFPDLDI WMSCVAIGLF AGIYTATGGL
RAVVYTDILQ AVVLICGTGL TAFLMYQSVD FSWESVRSQV PEGHLSIVQP IDDDTLPWPG
LFTGVWLLGF WYWVTNQYIV QRVLGAKDLS NAQWGAILGG ILKILPTFFI ILPGVMALVT
LPDIQNSDQV FPIIITEVLP SGLTGLVMAG LIAAIMSTVD STLNSSSTLL INDFLTRPEK
EPDPETAKKW GMMATLGFMV IAIAWAPLIQ YFGGLWAYIQ QAFSVLVPPL VVCFTLGALW
SRGTENAAFW TLIIGHTLGL VVFMLNQFGI WPLHYTISVT IMTAVSAAIF VALSLRDDTP
DVREDALWQR ADAFDTPATT APVLKNVKTH AILLILLMIG TLVLFW