Gene Smed_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3578 
Symbol 
ID5318082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp5353 
End bp6639 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content62% 
IMG OID640775393 
Productextracellular solute-binding protein 
Protein accessionYP_001312326 
Protein GI150375730 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.608798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACTCA ACAGACGCGG ATTCATGGCC GGCGCGGCTG GTGCCGCGGC AGGTGTCGCT 
CTCGGGTCCC GAACGACCCT CGCGGCCGAA AGCGTGCAGT TGCGCGCCAT GTGGTGGGGA
TCCAACGACC GCTCGAAGCG AACATTGGCA GTCGCCAAGC TTTTCGCGGA AGCCAACCCG
GATATCAGGA TCATCGGCGA AAGCTTGAGC GGCGACGGCT ACTGGACGAA GCTTGCGACC
CAGATGGCCG GTCGGTCCAT TGCCGATATC TTCCAGCTCG AGCCCAGCAC GATCTCGGAT
TACTCCAAGA GGGGCGCCTG CATGGCGCTC GACCCCTTCA TCTCTTCGGC GTTGGACGTC
GACGCATTCG GCAAGGACGT CCTGAAGCTG ACGACGGTGG ATGGAAAGCT CTGGGGTGTG
GGGCTTGGCC TCAATTCTTT CGCGCTGTTC TACGATGCCG ACGCTTTTGC CAGAGCGGGC
ATCGATCCGC CGGGAATTGA CACCACCTGG GCGGAATATG CTGAGATCGC CGTCGAAATG
ACCAAGGCGG TCGGGAAGAA AAGTGCCGGG GGCGGTCCCT ACGGAGCCCG CTACGCCTAT
GTGTTCGATG CCTGGCTCCG TCAGCGAGGA AGCAGCCTCT ATACCGATAG CGGCCTCGGT
TTCGGGGTCG AGGAGGCGAA GGAATGGTAT GCCTATTGGG AAGAGTTGCG CAAGCGCGGC
GGCACCGTCG GAGCGGACAT CCAGACGCTC GACCAGAACA CGATCGACAC CAACTGCCTG
GCGCTCGGCT ATTCGGCGAT GGGCATGGCC TATTCCAACC AGATGGTCGG CTATCAGCTC
ATCATGAAAA GCAAGCTTGG CATCGGCATG CTGCCCCGTG CCGAGAAAGG AGGTCCCTCC
GGCCATTACT ACCGGCCGGC GCTGATCTGG AGCATCGGTG CGTCGACGGA GCACGGCGAA
GAGGCCGCGA AATTCATCAA CTTCTTCGTC AATGACGTGG AGGCCGGCAA GATCCTCGGC
GTGGAGCGCG GCGTGCCCAT GTCGCCGACC GTTCGCGAAG CCATCCTGCC GTCACTCAAC
CCGACCGAAA CGGAAACGGT GAAATATATC AACGGCCTCA AGGATCAGGT GGGGAGCTAT
CCGTCGCCGG CGCCGCTTGG AGCGACCGAG TTCGACCAGC GCGTGCTGCG GCCGATTGCC
GATGAACTCG CCTTCGAGCG GATATCGATC GGAGACGCGG CGACACGGCT GGTGGAGGAA
GGCAGGGCCA CGGTCCGAGC CGGCTGA
 
Protein sequence
MLLNRRGFMA GAAGAAAGVA LGSRTTLAAE SVQLRAMWWG SNDRSKRTLA VAKLFAEANP 
DIRIIGESLS GDGYWTKLAT QMAGRSIADI FQLEPSTISD YSKRGACMAL DPFISSALDV
DAFGKDVLKL TTVDGKLWGV GLGLNSFALF YDADAFARAG IDPPGIDTTW AEYAEIAVEM
TKAVGKKSAG GGPYGARYAY VFDAWLRQRG SSLYTDSGLG FGVEEAKEWY AYWEELRKRG
GTVGADIQTL DQNTIDTNCL ALGYSAMGMA YSNQMVGYQL IMKSKLGIGM LPRAEKGGPS
GHYYRPALIW SIGASTEHGE EAAKFINFFV NDVEAGKILG VERGVPMSPT VREAILPSLN
PTETETVKYI NGLKDQVGSY PSPAPLGATE FDQRVLRPIA DELAFERISI GDAATRLVEE
GRATVRAG