Gene Dshi_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2067 
Symbol 
ID5713062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2187413 
End bp2189017 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content65% 
IMG OID641267989 
Productsulfate transporter 
Protein accessionYP_001533405 
Protein GI159044611 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.701494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGAG CTCTGCTGGC ATCCTTTGCC AATCGTATCG CCTTTTCCGC CCCGACCGCC 
GATGAAACCC TCAGCATCTC GCGCATTCGG ATCGAGTTGT TGTCCGGCCT GACCGTGGCG
CTGGCCCTCG TGCCCGAGGC CGTGGCGTTT GCCTTCGTGG CGGGGGTGCA TCCGCTGGTG
GGACTTTACG CGGCCTTCAT CGTGGGTCTG ATCACGGCGC TGATCGGGGG GCGGCCGGGC
ATGATCTCGG GCGCGACGGG CGCGCTGGCC GTGGTCATGG TGGCGCTGGT GGCCGAGCAC
GGGGTCGAGT ACCTGTTCGC CACGGTGGTG TTGATGGGGA TCCTCCAGAT CCTCTTCGGC
ATCTTCAAGC TGGGCAAGTT CATCCGGCTG GTGCCGCATC CGGTCATGCT GGGCTTCGTC
AACGGGCTGG CCATCGTGAT CTTCCTGGCG CAGCTGACCC AGTTCAAGGT GCCCAACGAC
GCTGGAGAGA TGGTCTGGAT GACCGGCTGG CCGCTGGTGA TAATGCTGGG TCTGGTGGCG
CTGACCATGG CGATCATCTG GGGCATGCCC AAGATCACAC GCGTTATTCC CGCGCCGCTG
GCGGGGATCG GGATCGTGGC GGTTCTGGTC ATCGCCTTCG GGATCGACGT GCCGCGGGTG
GGGGATCTCG CCTCCATCGC GGGGGGCTTG CCGAGCCTGC ATATCCCCAT GGTGCCGCTG
AACATGGAGA CGCTTCAGAT CATCGCGCCC TATGCCTTCA TACTCGCGGC CATCGGCCTG
ATCGAGAGCC TGCTGACCCT GAACCTGGTG GGGGAGATCA CCGGCAAGCG GGGCGGCGCG
AGCCAGGAGT GCATCGCCCA AGGCGTCGCC AATACCGTGA CCGGGTTTTT CGGCGGCATG
GGCGGCTGCG CGATGATCGG CCAGTCGATG ATCAACGTGA AATCCGGTGG GCGGACGCGG
ATCGCCGGGG TGGCGGCGGC GCTCTTTCTG TTGCTGTTCA TCGTGGCGGC CTCGCCGCTG
ATCGAGCAGA TCCCGCTCGC GGCCCTCGTG GGCGTAATGT TCATGGTGGT GATCGGCACC
TTCGCCTGGC AATCCCTGAC GATCCTGCGC CGGGTGCCGT TGACGGATGC GCTGGTTATC
GTGCTGGTGA CGGTGGTCAC GGTGCTGACA GACCTTGCCA TCGCGGTGGT GGTGGGGGTG
ATCGTCTCGG CGCTGGCCTA TGCCTGGAAT AACGCCTCGC GCATTCACGC CAAGACCTAC
ACCACCCCCG AGGGGGCGAA GGTGTACCAG GTGCAGGGGC CGCTCTTTTT CGGCTCGTCG
GCCGGGTTCG TCGAGCTGTT CGATGTGACC CATGATCCGG GTCAGGTCAT CGTGGACTTC
GCCGACAGCC GGGTGGTCGA CCAATCCGCG CTGACCGCCA TCGAAGCCAT GGCCGCAAAA
TACGCCGATG CGGGCAAGAA CCTGCAACTG CGCCACCTGA GCCGGGACTG TCACCAGTTG
CTGACCAAGG CGGGTCAGTT GATGATCGAC AGCGACGATG ACCCCGACTA CGCCATCGCC
GCCGACTACC AGGTCAAGAC CGGTATCCTT GGCGGGGGAC ACTGA
 
Protein sequence
MPRALLASFA NRIAFSAPTA DETLSISRIR IELLSGLTVA LALVPEAVAF AFVAGVHPLV 
GLYAAFIVGL ITALIGGRPG MISGATGALA VVMVALVAEH GVEYLFATVV LMGILQILFG
IFKLGKFIRL VPHPVMLGFV NGLAIVIFLA QLTQFKVPND AGEMVWMTGW PLVIMLGLVA
LTMAIIWGMP KITRVIPAPL AGIGIVAVLV IAFGIDVPRV GDLASIAGGL PSLHIPMVPL
NMETLQIIAP YAFILAAIGL IESLLTLNLV GEITGKRGGA SQECIAQGVA NTVTGFFGGM
GGCAMIGQSM INVKSGGRTR IAGVAAALFL LLFIVAASPL IEQIPLAALV GVMFMVVIGT
FAWQSLTILR RVPLTDALVI VLVTVVTVLT DLAIAVVVGV IVSALAYAWN NASRIHAKTY
TTPEGAKVYQ VQGPLFFGSS AGFVELFDVT HDPGQVIVDF ADSRVVDQSA LTAIEAMAAK
YADAGKNLQL RHLSRDCHQL LTKAGQLMID SDDDPDYAIA ADYQVKTGIL GGGH