Gene Smed_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3710 
Symbol 
ID5318430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp151577 
End bp152872 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content59% 
IMG OID640775523 
Productextracellular solute-binding protein 
Protein accessionYP_001312456 
Protein GI150375860 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00279907 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTGA TTTCCATTTC CACGACAATG GCCGTCGTCG GCTTCGCTGT TCAGGCGCAG 
GCCGCCACCG AACTGCAGTG GTGGCATGCA ATGACGGGCG CCAACAACGA AATGATCGAG
GAGCTCACCA AGGAGTTCAA CGCGAGCCAG AGCACCTACA AAGTGGTGCC TGTCTTCAAG
GGCACCTATC CCGAAACTCT GAATGCGGGG ATTGCGGCCT TCCGCTCGAA GCAGCCGCCG
GCGATCATTC AGGTATTCGA TGCCGGCAGC GGCACGATGA TGGCGGCCGA GGGCGCGATC
GTGCCGGCCG CCGAGATCCT CCAGAAGGGC GGCTTCACCT TCGATAAATC GCAGTATCTT
CCCGGGATCG TTGCCTATTA TTCGAAGCCG GATGGAACGA TGCTGTCCTT TCCGTATAAC
TCTTCCTCGC CGATTCTTTA CTACAATAAG GACGCTTTTC AGAAAGCAGG CTTGAACGTA
GACAATCCGC CGAAGACATG GCCGGAAGTC TTCGAAGCCG CGAAGAAGAT CAAGACGAGC
GGTGCGGCAC CTTGCGGGAT GACGTCGACC TGGTTGACCT GGATCCAGAC GGAGAACTTC
GCCGCCTGGA ACAATATGCC CTACGGAACC AATGAAAACG GGCTCGGCGG CACCGATGTG
CAGCTGAAGA TCAACGCGCC CCTTTACGTG GAGCATTTCC AGGCCATAGC GAACCTCGCC
AAGGACGGCG CCTTTCGTTA TGGGGGGCGC ACCTCCGAGG CAAAGCAGCT CTTTACATCA
GGCGAATGTG CCATCCTGAC CGAATCCTCG GGCGGTCTCG GCGACATCGC CAAGAGCGGC
GTCAACTACG GGATCGGTCA ACTGCCCTAT TACGAGGGTC ACGGTCCGCA GAACACGATC
CCCGGTGGAG CGAGCCTCTG GGTGTTCGCC GGCAAGTCCG ACGAGGAATA CAAGGGCATT
GCCGAGTTCT TCAACTTCCT TTCACAGACA GAAATCCAAG CCAAGCTGCA TCAGGTCTCG
GGTTATATGC CGGTCACGAT GGCTGCCTAC GAGGAAACCA AGAAGTCCGG CTTCTACGAG
AAGAACCCCG GGCGTGAGAC GCCACTCCTG CAGATGATGG GGAAGGCGCC GACCGAAAAC
TCGAAGGGTG TCCGGCTGGT CAACCTGCCG CAGGTTCGGG ACATCCTCAA CGAGGAGTTC
GAAGCGATGC TGTCGGGACA ACAGGATGCC AAGACGGCGC TCGATAAAGC GGTCGAGCGG
GGCGACGCCG CGATCGCAGC AGCAATCAGC AATTGA
 
Protein sequence
MRVISISTTM AVVGFAVQAQ AATELQWWHA MTGANNEMIE ELTKEFNASQ STYKVVPVFK 
GTYPETLNAG IAAFRSKQPP AIIQVFDAGS GTMMAAEGAI VPAAEILQKG GFTFDKSQYL
PGIVAYYSKP DGTMLSFPYN SSSPILYYNK DAFQKAGLNV DNPPKTWPEV FEAAKKIKTS
GAAPCGMTST WLTWIQTENF AAWNNMPYGT NENGLGGTDV QLKINAPLYV EHFQAIANLA
KDGAFRYGGR TSEAKQLFTS GECAILTESS GGLGDIAKSG VNYGIGQLPY YEGHGPQNTI
PGGASLWVFA GKSDEEYKGI AEFFNFLSQT EIQAKLHQVS GYMPVTMAAY EETKKSGFYE
KNPGRETPLL QMMGKAPTEN SKGVRLVNLP QVRDILNEEF EAMLSGQQDA KTALDKAVER
GDAAIAAAIS N