Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0329 |
Symbol | |
ID | 5321162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 358968 |
End bp | 360563 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640789264 |
Product | extracellular solute-binding protein |
Protein accession | YP_001326022 |
Protein GI | 150395555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.889875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0616043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TCACTACTCT TTTAGCGGCG ACGGCGCTCG CCACGCTTAT GGCCGGCACC GCCTGGTCGA AAACGTTCGT CTTCTGCTCG GAAGGTTCGC CGGAGGGCTT TGATCCTGGC CTCTATACGG CCGGCACCAC GTTCGACGCT GCCGCCCACA CCGTTTACAG CCGTCTTCTC GAGTTCAAGA AGGGCACGAC CGAAACGGAA CCCGGGCTCG CCGAAAGCTG GACGATCTCC GACGACGGCC TCGAATACAC CTTCAAGCTG CGTCCCGGCG TCAAGTTTCA GACGACCGAA TACTTCACTC CGACCCGTGA ATTGAACGCC GACGATGTCG TCTTTTCGAT CGAGCGCCAG TGGAAATCGG ACCATCCATG GCATGGCTAT GTGACCGGCG GTTCCTGGGA ATATTTTGCC GGCATGGGGC TGCCGGAACT GCTCGAGTCT GTCGAGAAGG TCGACGACAT GACCGTCAAG ATCAAGCTGA AGCGCAAGGA AGCGCCGTTC CTGGCCAATC TTGCCATGCC CTTCGCGTCG ATCATGTCGA AGGAATATGC CGACAAGCTG CAGGCCGAAG GCAAGATGAA CCAGCTCAAC CAGATGCCGC TCGGCACCGG TCCCTTCGCC TTTGTCGCCT ATCAGCAGGA CGCGGTCATC CGCTACAAGG CGCATCCGGA ATTCTGGGGC GGAAAGCAGA AGATCGACGA TCTGGTCTTT GCGATCACCA CGGACGCGGC CGTTCGCTTC CAGAAGCTGC AGGCCGGCGA ATGCCACCTG ATGCCCTATC CGAACGCGGC GGATGTCGAG GCAATGAAGG CCGATCCGAA CCTCAAGGTG ATGGAGCAGG CCGGCCTCAA CGTCGCCTAT CTCGCCTATA ACACGACGCA GCCGCCCTTC GATAAGCTCG AGGTGCGCAA GGCGCTGAAC AAGGCGATCA ACAAGGAGGC GATCGTCGAC GCGGTCTTCC AGGGACAGGC GCAACCGGCG ACCAATCCGA TTCCGCCGAC CATGTGGTCC TATAACGAGC AGATCGAAGA CGACACCTAT GATCCGGAAG CGGCGAAGAA GATGCTCGAG GATGCCGGCG TGAAAGATCT TTCGATGAAG GTCTGGGCGA TGCCAGTGGC GCGTCCCTAC ATGCTCAACG CCCGTCGCGC CGCCGAACTG ATGCAGGCCG ACTTCGCCAA GGTCGGCGTC AAGGTCGAAA TCGTCTCCTA CGAATGGGCC GAATATCTCG AGAAGTCCAA GGCGAAGGAC CGCGACGGTG CCGTGATCCT CGGCTGGACA GGCGACAACG GCGATCCGGA CAACTTCCTC GACACGCTGC TCGGTTGCGA CGCCGTCGGC GGCAACAACC GCGCGCAATG GTGCAACCAG GAGTTCGACG AACTCGTCAC GAAGGCGAAG GAAGCATCCG ACGTCGCAGA GCGCACCAAG CTCTATGAAG AGGCGCAGGT CGTCTTCAAG CGCGAAGCCC CCTGGGCTAC GCTCGACCAC TCGCTCTCCA TCGTCCCGAT GCGCAAGAAT GTCGAAGGCT TCGTGCAGAG CCCGCTCGGC GACTTTGCTT TCGACGGCGT TGATATTGTA GAGTAA
|
Protein sequence | MKKLTTLLAA TALATLMAGT AWSKTFVFCS EGSPEGFDPG LYTAGTTFDA AAHTVYSRLL EFKKGTTETE PGLAESWTIS DDGLEYTFKL RPGVKFQTTE YFTPTRELNA DDVVFSIERQ WKSDHPWHGY VTGGSWEYFA GMGLPELLES VEKVDDMTVK IKLKRKEAPF LANLAMPFAS IMSKEYADKL QAEGKMNQLN QMPLGTGPFA FVAYQQDAVI RYKAHPEFWG GKQKIDDLVF AITTDAAVRF QKLQAGECHL MPYPNAADVE AMKADPNLKV MEQAGLNVAY LAYNTTQPPF DKLEVRKALN KAINKEAIVD AVFQGQAQPA TNPIPPTMWS YNEQIEDDTY DPEAAKKMLE DAGVKDLSMK VWAMPVARPY MLNARRAAEL MQADFAKVGV KVEIVSYEWA EYLEKSKAKD RDGAVILGWT GDNGDPDNFL DTLLGCDAVG GNNRAQWCNQ EFDELVTKAK EASDVAERTK LYEEAQVVFK REAPWATLDH SLSIVPMRKN VEGFVQSPLG DFAFDGVDIV E
|
| |