Gene Smed_5404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5404 
Symbol 
ID5319706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp366745 
End bp368478 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content57% 
IMG OID640777170 
Productsulphate transporter 
Protein accessionYP_001314102 
Protein GI150377507 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.428288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.878672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTAT CGACAGACAG ACAGAGCGAA GCGCCGGGCT TTGCTGAACT CTACACTCCA 
AAATTTCTTA CAATCCTGCG CGAAGGCTAC GGATTGCCCC ACTTGCGCGC CGATGCAATA
GCGGGACTTA CCGTCGCCAT CGTAGCGTTG CCACTGTCGA TGGCCATCGC CATTGCCTCG
GGCGCAACGC CCGCGCAGGG GCTATATTCG GCAATCGTCG GTGGGTTCTT CGTCTCATTG
TTCGGAGGAT CGCGATTTCA GATAGGCGGC CCGGCCGGGG CCTTTATCGT GTTGGTCGCC
GCAACAGTCG CTCAACACGG CATGGATGGC TTAATCCTGG CAACCTTCCT TTCAGGTTTG
ATGCTCACCG CAGTTGGCCT CCTGCGGTTC GGAACATTCA TAAAGTTCAT TCCCTTCCCG
GTAACGGTGG GCTTTACGGC AGGGATTGCC GTGATCATCT TTGCCAGCCA GATCAAGGAC
CTTTTCGGTC TGACTTTGGA CCATGAACCA GGAGAGCTCC TTCAGAAGCT CCCAGTGCTG
TGGGGAGCAA AGGACAGCGT TGCGTCTGGC GCGGTCGGGA TCTCTATCAC TACCATTGCG
GTCATTCTCG GATTACGCCG CTGGCGACCG CGCTGGCCGG GGATGCTGAT CGCTGTTGCC
CTGAGCTCAA CCGCAACAGC ATTGCTTGCC CTGCCTGTCG AGACGATTGG AACCAAGTTT
GGCGGAATCC CCTCTGCACT GCCTTTCCCG CAACTGCCTG ATCTCTCGAT GGACAGGATT
GTAGCGGTTC TGCCAGCCGC ACTGTCCTTC ACCTTGCTAG GGTCCATAGA ATCGCTGCTA
TCAGCCGTAG TTGCGGATGG CATGACTGGT CGACGACACC GTTCCAACTG CGAGCTTGTT
GCTCAGGGTG CGGCAAATAT CGGTGCCTCG CTGTTCGGCG GCTTCTGTGT GACGGGCACC
ATTGCCCGTA CCGCTACCAA CATACGCGCT GGAGCGCACG GGCCAGTCGC CGGAATGTTG
CATTCAGCAT TTCTTCTTAT TTTCATGGTG ATCGCGGCAC CGCTCGCTGC ATATATCCCA
TTAGCAGCTC TGGCGGGCGT GCTGGCAGTC GTTGCCTGGA ACATGATCGA AAAGCCTGCA
ATCGCGATCT TGCTGCGCTC GGGTTGGGGC GAAGCAACCG TATTGGGGGC GACATTCTTC
CTGACAGTAT TCCGGGATTT GAGTGAGGCG ATTGTCGTTG GGTTCGCTCT TGGCTCAGTG
CTCTTTATTC ACCGTATGAG CCGAACAACC GCAGTCACCA CCCATGTACC GTTCGTGGCG
CGCGACGAAG CCGATGACGC CCATCCACGC AAAGCCTACA ATGAATCTGC TGCTGCGAAC
CCCGATGTTG TCGTCTACCG GATAACCGGT GCTCTCTTCT TCGGCGCGAC AGCATCAATC
GGCTCTGTTC TCGACCGCAT TCAGGACAAC CAAAAGGCAC TGGTCGTCGA TTTCTCCGCG
GTGCCGTTTC TGGACTCGAC CGGCGCCAAC ATGATCGAGG GCCTAGCGCA CAAGGCTCAC
AAGCATGGTG TGACCCTGTG GCTCACCGGA ACCACTCGCG ATATCCGGCG TGTCTTGCTG
AAGCATGAAA TAAAGAGGCC ACTGGTACGT TACGCGGCCA CAGTCGAAGC TGCCATATCA
GCTTTGCAAC GCGCGCGTAC AACCTTGAGG GCAGGACCAA CACAAGCAGC ATGA
 
Protein sequence
MKVSTDRQSE APGFAELYTP KFLTILREGY GLPHLRADAI AGLTVAIVAL PLSMAIAIAS 
GATPAQGLYS AIVGGFFVSL FGGSRFQIGG PAGAFIVLVA ATVAQHGMDG LILATFLSGL
MLTAVGLLRF GTFIKFIPFP VTVGFTAGIA VIIFASQIKD LFGLTLDHEP GELLQKLPVL
WGAKDSVASG AVGISITTIA VILGLRRWRP RWPGMLIAVA LSSTATALLA LPVETIGTKF
GGIPSALPFP QLPDLSMDRI VAVLPAALSF TLLGSIESLL SAVVADGMTG RRHRSNCELV
AQGAANIGAS LFGGFCVTGT IARTATNIRA GAHGPVAGML HSAFLLIFMV IAAPLAAYIP
LAALAGVLAV VAWNMIEKPA IAILLRSGWG EATVLGATFF LTVFRDLSEA IVVGFALGSV
LFIHRMSRTT AVTTHVPFVA RDEADDAHPR KAYNESAAAN PDVVVYRITG ALFFGATASI
GSVLDRIQDN QKALVVDFSA VPFLDSTGAN MIEGLAHKAH KHGVTLWLTG TTRDIRRVLL
KHEIKRPLVR YAATVEAAIS ALQRARTTLR AGPTQAA