Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3308 |
Symbol | |
ID | 5324192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3500639 |
End bp | 3502225 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640792260 |
Product | extracellular solute-binding protein |
Protein accession | YP_001328965 |
Protein GI | 150398498 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.705056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCG GAAACAAACG CGGATTGCAG TTTGCCACGG CATTGCTGTC TTGCATGGCA GGTTTTGCGG GCCCGGCTTC GGCTCAAAGC TCTGATCTCG TCGTCAACAG CGCGGTGGCG CCCAGCACGC TGGATCCGGC CTGGGCCTGC GGGTTGCAGG AAATCAGTTT TCTGCAGAAT TTCTATGTGC GTCTGGTTCA GAACGGCACC GCGGAAGGGC CCGAGGGCAC AGGGGTCGTC GATTATTCCA AGATCGAGCC CTATCTCGCC AAGTCCTGGG AGGTCAGTGA GGATGGGTTG GTCTACACGT TCCACCTCAA GGAAGGCTAC ACGTTCGAAA GCGGCAAGCC GGTCGATGCG GATGCTGTTC GCTATTCCCT ACAGCGCGTG CTTGACATGG CCGGCTGCGG CCGGTTCTTC CTGACAGACG GTCATATCGA TCCTGTCATT TTCAAGTCCA TCGAAGCGGT CGATCCGCTG ACGGTCGAGA TCACGCTCAA CAAGCCAAAC GGCAACATGC TGGGCGATCT GGCCACCCAT GCGGCATCGA TCGTCGATCC CTCCATCGTC GAAGCCAATG GCGGCGTCAC GCCCGGTCAG CCGAATGAGT ACATGGCGGC AAATGTAACC GAATCCGGAC CGTTCCTTTT GGATTCCTAT ACGGCCAACC AGAATGCTCG CCTAGTGGCC AATCCGGCTT TCGCAGGCGA AGCGCCGGCA TCGAAGGCCA TCAACGTCAA CTGGATCACC GCCTTGCCGA CGCTTTTGCT GCAGGCACGT ACCGGGCAGG CCGATATCAC CTTCGGCCTT GCCAAGCAGG CCGTCAAAAC CATGGTCAGC AATGCCGGCA CGCGGGTGAT CGCCTATTCC AGTCCATTTG CCCAGCAGAT GATTCTGCCC AATACCAAGG CACCCTGGAA CAACAAGCTT TTCCGACAAG CCGTCGCCCA TGCCGTGCCC TATGAAAGCA TCGTTTCGCG CGTCGCCTAT GGTTACGGCA CGCAATATTA CGGACCGATT CCGCCCAGCC TGCCCGGCTT CAACGCCGAG CTCAGCAAGC CGGTCCCCTT CGATCTCGAT CGGGCAAAAG ATCTGATCGC CGAAAGCGGC GTGGCAACAC CCGTCGATGT CGAGGTGATG ATCCAGGAAG GCGACGCCAC CCAGCAGCAA CTGGCCACCA TTCTGCAAAG CACTTGGAAA GATCTCGGCA TCACCCTCAA GATCCGCGTG GCGCCGGCTG CCGAATTCCA GGACCTGTCA CAGGGTCATC GGGTTCAGTC GCTGATGCGT CTCGACGGTC CCGGCGTGTT CGAGGTCGGC TATTACTGGG GCTATGACGC GGTTTGCGGC AATCCCAACA ATCTCACCGA ATATTGCAAC AAAGACATGG ATGCGCTGAT CGAGAGACTG CGCGCCTCTT CCGACCCGGC CGAGCGACAG ACGATCATGG ACCAGGCCAC GGAAATCTGG CGTGCGGATT ATCCGAAGAT CCTGTTCTTC GAGGATCAGC CGGTCGTCGT TCTCTCCGAT GCCGTGACCA AGTTCACCTT CTCGCCGCTC CCCGACTATC GCTACTGGGC GAAATAA
|
Protein sequence | MRIGNKRGLQ FATALLSCMA GFAGPASAQS SDLVVNSAVA PSTLDPAWAC GLQEISFLQN FYVRLVQNGT AEGPEGTGVV DYSKIEPYLA KSWEVSEDGL VYTFHLKEGY TFESGKPVDA DAVRYSLQRV LDMAGCGRFF LTDGHIDPVI FKSIEAVDPL TVEITLNKPN GNMLGDLATH AASIVDPSIV EANGGVTPGQ PNEYMAANVT ESGPFLLDSY TANQNARLVA NPAFAGEAPA SKAINVNWIT ALPTLLLQAR TGQADITFGL AKQAVKTMVS NAGTRVIAYS SPFAQQMILP NTKAPWNNKL FRQAVAHAVP YESIVSRVAY GYGTQYYGPI PPSLPGFNAE LSKPVPFDLD RAKDLIAESG VATPVDVEVM IQEGDATQQQ LATILQSTWK DLGITLKIRV APAAEFQDLS QGHRVQSLMR LDGPGVFEVG YYWGYDAVCG NPNNLTEYCN KDMDALIERL RASSDPAERQ TIMDQATEIW RADYPKILFF EDQPVVVLSD AVTKFTFSPL PDYRYWAK
|
| |