Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5662 |
Symbol | |
ID | 5319964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 628249 |
End bp | 629883 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777394 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314326 |
Protein GI | 150377731 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.76313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCAA GACAGACTGG TACTTTGAAG CCGACGCGCC GCGCGTTTTT GCTTTCCTCC GTTGCACTCG CGACCGCTAT TGCCGTGCCA TTGCTGCCGG TACCTTCGTT CGCCGCCGCC GAAGAGCCGG CGCGCGGCGG CAGCGTGTCG ATCAATATTG GCACCGAGCC GCCGGTTCTG GTGCTCATCG CTCACAGTGC CGGCGCTGCC TATTACATTA GCGGCAAGGC AACCGAGAGC CTGCTGACCT ATGACAAGGA GTTCAATCCT CAGCCGTTGC TCGCCACCGA GTGGACGGTG AGCGAAGACG GCCTGCGCTA CTGGTTCAAG CTGCGCCAGG GCGTTCGTTG GCACGACGGC AAGGATTTCA CCGCCGAAGA CGTCGCCTTC TCGATCTTGG CGCTGAAGGA AAACCATCCG CGCGGCCGCG CGACCTTTGC CCATGTGAAA GAGGCCAATG TTCTCAATTC GCATGAGGTG GAACTGGTCC TTGCCAAGCC GGCACCCTAC CTGCTGACGG CATTTGCAAG CTTTGAGGCG CCGATCGTTC CCAAACATCT CTACGAGGGC ACGCAGATTG CGGAGAACCC GCACAACGTC GCGCCGATTG GCACCGGCCC CTACAAGTTC GTCGAATGGG TTCGCGGCAG CCACGCACTT TTCGTGCGCA ACGAAGACTA TTGGGGTTCG CCCAAGCCTT ACCTCGACCA GATCATCTTC CGGTTCATCG TCGATCCCGC CGCAGCTGTG GCCGCGATTG AGACCGGCGA GGTGCAGGTC TCGACGGCCA ACCTGCCGCT GACCGATATC GACCGCCTGA AGGCCAATCC GAACCTCGTC GTCGATACCG ACCCGGCGCC GTATTCACCG AGCATTGCCC GCGCGGAGTT TAACCTTGAG AACAAGTATC TGGCCGACAT CAAGGTGCGA CATGCGATCG CGCATGCGAT CGACAAGGAT TTCATCGTCA ACACCGTCTA CCTCGGCTAC GCCACCCGCC TCGACGGACC GGTCAGCCCC GACCTTGCGA AGTTCTACTC GCCGGACCTT CCCAAGTACG AGTTCGACCC GGCGAAGTCC GAGAAGCTTT TGGATGAGGC AGGTTACTCG CGCGGCGCCG ATGGTGTTCG CTTCAAGCTA TTCATCGATC CGACCCAGCC GTCCGGTCCT CCCAAGCAGA CCGCGGAATA CATCGCACAG GCGCTTGCCA AGGTCGGCAT CAAGGTGGAA CTGCGAACGC AGGACTTCGC GACTTTCGTC AAGCGCGTCT TCACTGATCG AGACTTCGAC ATCGCCATCG AGGGCATGAG CAATCTCTAC GACCCGACCG TCGGCGTCCA GCGCCTCTAC TGGTCGAAAA ACTTCAAGCC GGGCGTCCCC TTCACCAATG GTTCCAAATA TTCGAATCCC GAAGTCGACC GGCTGCTCGA AACGGCCGCC GTCGAGATCG ACCCGAAGAA GCGGTTGGAA CTCTTCAACG AGTTCCAAAA GCTGGTGGTG GAGGACTTGC CGACGCTCGA TATCGTCACG CCGGCGGTGA TCACCGTCTA CGACAAGCGG TTGAAGAACC TGAAACTTGG TGTCGAGCAT CTCTGGGGCA ACGGGGCGGA CATCTATCTC GACGGACAAT CGTAA
|
Protein sequence | MTSRQTGTLK PTRRAFLLSS VALATAIAVP LLPVPSFAAA EEPARGGSVS INIGTEPPVL VLIAHSAGAA YYISGKATES LLTYDKEFNP QPLLATEWTV SEDGLRYWFK LRQGVRWHDG KDFTAEDVAF SILALKENHP RGRATFAHVK EANVLNSHEV ELVLAKPAPY LLTAFASFEA PIVPKHLYEG TQIAENPHNV APIGTGPYKF VEWVRGSHAL FVRNEDYWGS PKPYLDQIIF RFIVDPAAAV AAIETGEVQV STANLPLTDI DRLKANPNLV VDTDPAPYSP SIARAEFNLE NKYLADIKVR HAIAHAIDKD FIVNTVYLGY ATRLDGPVSP DLAKFYSPDL PKYEFDPAKS EKLLDEAGYS RGADGVRFKL FIDPTQPSGP PKQTAEYIAQ ALAKVGIKVE LRTQDFATFV KRVFTDRDFD IAIEGMSNLY DPTVGVQRLY WSKNFKPGVP FTNGSKYSNP EVDRLLETAA VEIDPKKRLE LFNEFQKLVV EDLPTLDIVT PAVITVYDKR LKNLKLGVEH LWGNGADIYL DGQS
|
| |