Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3578 |
Symbol | |
ID | 5318082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 5353 |
End bp | 6639 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775393 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312326 |
Protein GI | 150375730 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.608798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACTCA ACAGACGCGG ATTCATGGCC GGCGCGGCTG GTGCCGCGGC AGGTGTCGCT CTCGGGTCCC GAACGACCCT CGCGGCCGAA AGCGTGCAGT TGCGCGCCAT GTGGTGGGGA TCCAACGACC GCTCGAAGCG AACATTGGCA GTCGCCAAGC TTTTCGCGGA AGCCAACCCG GATATCAGGA TCATCGGCGA AAGCTTGAGC GGCGACGGCT ACTGGACGAA GCTTGCGACC CAGATGGCCG GTCGGTCCAT TGCCGATATC TTCCAGCTCG AGCCCAGCAC GATCTCGGAT TACTCCAAGA GGGGCGCCTG CATGGCGCTC GACCCCTTCA TCTCTTCGGC GTTGGACGTC GACGCATTCG GCAAGGACGT CCTGAAGCTG ACGACGGTGG ATGGAAAGCT CTGGGGTGTG GGGCTTGGCC TCAATTCTTT CGCGCTGTTC TACGATGCCG ACGCTTTTGC CAGAGCGGGC ATCGATCCGC CGGGAATTGA CACCACCTGG GCGGAATATG CTGAGATCGC CGTCGAAATG ACCAAGGCGG TCGGGAAGAA AAGTGCCGGG GGCGGTCCCT ACGGAGCCCG CTACGCCTAT GTGTTCGATG CCTGGCTCCG TCAGCGAGGA AGCAGCCTCT ATACCGATAG CGGCCTCGGT TTCGGGGTCG AGGAGGCGAA GGAATGGTAT GCCTATTGGG AAGAGTTGCG CAAGCGCGGC GGCACCGTCG GAGCGGACAT CCAGACGCTC GACCAGAACA CGATCGACAC CAACTGCCTG GCGCTCGGCT ATTCGGCGAT GGGCATGGCC TATTCCAACC AGATGGTCGG CTATCAGCTC ATCATGAAAA GCAAGCTTGG CATCGGCATG CTGCCCCGTG CCGAGAAAGG AGGTCCCTCC GGCCATTACT ACCGGCCGGC GCTGATCTGG AGCATCGGTG CGTCGACGGA GCACGGCGAA GAGGCCGCGA AATTCATCAA CTTCTTCGTC AATGACGTGG AGGCCGGCAA GATCCTCGGC GTGGAGCGCG GCGTGCCCAT GTCGCCGACC GTTCGCGAAG CCATCCTGCC GTCACTCAAC CCGACCGAAA CGGAAACGGT GAAATATATC AACGGCCTCA AGGATCAGGT GGGGAGCTAT CCGTCGCCGG CGCCGCTTGG AGCGACCGAG TTCGACCAGC GCGTGCTGCG GCCGATTGCC GATGAACTCG CCTTCGAGCG GATATCGATC GGAGACGCGG CGACACGGCT GGTGGAGGAA GGCAGGGCCA CGGTCCGAGC CGGCTGA
|
Protein sequence | MLLNRRGFMA GAAGAAAGVA LGSRTTLAAE SVQLRAMWWG SNDRSKRTLA VAKLFAEANP DIRIIGESLS GDGYWTKLAT QMAGRSIADI FQLEPSTISD YSKRGACMAL DPFISSALDV DAFGKDVLKL TTVDGKLWGV GLGLNSFALF YDADAFARAG IDPPGIDTTW AEYAEIAVEM TKAVGKKSAG GGPYGARYAY VFDAWLRQRG SSLYTDSGLG FGVEEAKEWY AYWEELRKRG GTVGADIQTL DQNTIDTNCL ALGYSAMGMA YSNQMVGYQL IMKSKLGIGM LPRAEKGGPS GHYYRPALIW SIGASTEHGE EAAKFINFFV NDVEAGKILG VERGVPMSPT VREAILPSLN PTETETVKYI NGLKDQVGSY PSPAPLGATE FDQRVLRPIA DELAFERISI GDAATRLVEE GRATVRAG
|
| |