Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3710 |
Symbol | |
ID | 5318430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 151577 |
End bp | 152872 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640775523 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312456 |
Protein GI | 150375860 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00279907 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTGA TTTCCATTTC CACGACAATG GCCGTCGTCG GCTTCGCTGT TCAGGCGCAG GCCGCCACCG AACTGCAGTG GTGGCATGCA ATGACGGGCG CCAACAACGA AATGATCGAG GAGCTCACCA AGGAGTTCAA CGCGAGCCAG AGCACCTACA AAGTGGTGCC TGTCTTCAAG GGCACCTATC CCGAAACTCT GAATGCGGGG ATTGCGGCCT TCCGCTCGAA GCAGCCGCCG GCGATCATTC AGGTATTCGA TGCCGGCAGC GGCACGATGA TGGCGGCCGA GGGCGCGATC GTGCCGGCCG CCGAGATCCT CCAGAAGGGC GGCTTCACCT TCGATAAATC GCAGTATCTT CCCGGGATCG TTGCCTATTA TTCGAAGCCG GATGGAACGA TGCTGTCCTT TCCGTATAAC TCTTCCTCGC CGATTCTTTA CTACAATAAG GACGCTTTTC AGAAAGCAGG CTTGAACGTA GACAATCCGC CGAAGACATG GCCGGAAGTC TTCGAAGCCG CGAAGAAGAT CAAGACGAGC GGTGCGGCAC CTTGCGGGAT GACGTCGACC TGGTTGACCT GGATCCAGAC GGAGAACTTC GCCGCCTGGA ACAATATGCC CTACGGAACC AATGAAAACG GGCTCGGCGG CACCGATGTG CAGCTGAAGA TCAACGCGCC CCTTTACGTG GAGCATTTCC AGGCCATAGC GAACCTCGCC AAGGACGGCG CCTTTCGTTA TGGGGGGCGC ACCTCCGAGG CAAAGCAGCT CTTTACATCA GGCGAATGTG CCATCCTGAC CGAATCCTCG GGCGGTCTCG GCGACATCGC CAAGAGCGGC GTCAACTACG GGATCGGTCA ACTGCCCTAT TACGAGGGTC ACGGTCCGCA GAACACGATC CCCGGTGGAG CGAGCCTCTG GGTGTTCGCC GGCAAGTCCG ACGAGGAATA CAAGGGCATT GCCGAGTTCT TCAACTTCCT TTCACAGACA GAAATCCAAG CCAAGCTGCA TCAGGTCTCG GGTTATATGC CGGTCACGAT GGCTGCCTAC GAGGAAACCA AGAAGTCCGG CTTCTACGAG AAGAACCCCG GGCGTGAGAC GCCACTCCTG CAGATGATGG GGAAGGCGCC GACCGAAAAC TCGAAGGGTG TCCGGCTGGT CAACCTGCCG CAGGTTCGGG ACATCCTCAA CGAGGAGTTC GAAGCGATGC TGTCGGGACA ACAGGATGCC AAGACGGCGC TCGATAAAGC GGTCGAGCGG GGCGACGCCG CGATCGCAGC AGCAATCAGC AATTGA
|
Protein sequence | MRVISISTTM AVVGFAVQAQ AATELQWWHA MTGANNEMIE ELTKEFNASQ STYKVVPVFK GTYPETLNAG IAAFRSKQPP AIIQVFDAGS GTMMAAEGAI VPAAEILQKG GFTFDKSQYL PGIVAYYSKP DGTMLSFPYN SSSPILYYNK DAFQKAGLNV DNPPKTWPEV FEAAKKIKTS GAAPCGMTST WLTWIQTENF AAWNNMPYGT NENGLGGTDV QLKINAPLYV EHFQAIANLA KDGAFRYGGR TSEAKQLFTS GECAILTESS GGLGDIAKSG VNYGIGQLPY YEGHGPQNTI PGGASLWVFA GKSDEEYKGI AEFFNFLSQT EIQAKLHQVS GYMPVTMAAY EETKKSGFYE KNPGRETPLL QMMGKAPTEN SKGVRLVNLP QVRDILNEEF EAMLSGQQDA KTALDKAVER GDAAIAAAIS N
|
| |