Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4593 |
Symbol | |
ID | 5319009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1091446 |
End bp | 1093347 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776394 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313326 |
Protein GI | 150376730 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.996188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCC GCAGAACCGC GCTCGGCCTT CTCGCAACGG CGGCCTTTCC TAAAACCTTG CTTGCGGCTG CAGGCGGGGC CGACCCGCTC GCTGCCCTCG TTCAGGAGGG CAAACTTCCC CCGGTCGGCG AAAGGCTGCC GAAGACGCCG CGGGTGATCA ATGTCGCGGG GATGGGCCGC AAAGCCGGGC GGCACGGCGG CACCATCCGC AGCCTGATCG GCAGTGCGAA GGACATTCGC CTGATGACGA TCTATGGCTA TGCACGCCTG GTCGGGTACG ACGAGGAACT GAACCTACAG CCTGACGTTC TCGAAAGCTA CGAAACGGTC GAGGACCGGA TTTTCACCTT CCACCTGCGC GAGGGCCATA GGTGGTCCGA TGGCACGCCG CTGACCGCCG AGGACTTCCG TTACTGCTTC GAAGACGTTC TGCTCAACGA GGACCTTTCC CCGGCAGGCC TGCCGACCTC GATGGTGATG GATGGTCAAG CGCCCAAGCT CGAAATCGTC GACGAACGGA CGGTCCGCTA TTCCTGGCCG ATGCCAAATC CGGTCTTCCT GCAGGAACTC GCCGCGCCGC AGCCGCTGAT CGTGGCGATG CCTTCGGCCT ATCTCAAGCA GTTCCACAAA AAATATCAGG AAGAGGACAA GCTGAAGACC TTACTGAAAG AACAGCGGGT CAAGAGGTGG AGCCAGCTCC ACATGCGCAT GGCACGGTCC TACCGGCCGG AGAACCCGGA CCTGCCCACC CTCGACCCCT GGCGCAACAC GACGCCGTTG CCTGCCGAAC AGTTCGTCTT CGAGCGCAAT CCCTATTACC ACCGGGTGGA CGAAAACGGC CTGCAGCTGC CTTACATCGA CCGGTTCGTT CTCAGCGTGA GCTCGTCCGC GCTTATCCCG GCAAAGACCG GCACGGGCGA GAGCGACCTG CAGGCAAATG GGATCGATTT CGTCGACTAT ACCTATCTTA AGGATGCGGA GAAGCGGTAT CCGGTCGAGG TGAAGCTCTG GAAGAAGACC TCGGGATCGC GCCTGGCATT GCTGCCGAAC CTCAACTGTG CCGACCCGGT GTGGCGGGCG TTGCTAAGAG ACGTGCGCGT CCGTCGGGCT CTGTCGCTCG CAATCGACCG GCGCGAGATC AACATGGCCG CCTTCTACGG GCTGACGAAG GAGAGCGCCG ATACTGTGCT GCCGGAAAGC CCTCTCTTCC GTCCCGAATT CGCAAGTGCC TGGATCGCAC ATGACCCGGA GCAGGCGAAC ACCCTGCTCG ACGCCGCAGG GCTGGCCAAA CGCGGCAGCG ACGGGATCCG TATTCTTCCC GACGGCCGGA AGGCGCAGAT CGTCGTCGAG ACGGCCGGCG AGAGCACACT CGATACGGAC GTGCTGCAGC TGATCACCGA CTATTGGCGT GAAGTCGGCA TTTCGCTCTT CATCCGTACC TCTCAGCGCG ATACCTTCCG CAGCCGTGCG GTGGGGGGCG AGATCATCAT GTCGATATGG TTCGGTATCG ACAACGGCGT GCCGACGGCT GACATGAGTC CTCACCAGCT CGCGCCGACA GCGGACGACC AGCTGCAATG GCCCGTCTGG GGCTTGAACT ATATCTCCCA CGGCGAAATG GGAGAAACGC CAGACCTTCC GGCCGTGGTC GAACTTCTGG AGCTGCTGAA GCGCTGGAGG CATTCGGCCG ACGATGCGGA GCGGGCGGAC ATCTGGAGAA AGATGCTGTC CATCTATACG GACCAGGTCT TCTCGATCGG GCTGGTCAAC AGTTCCCTGC AGCCTATCCT CGTGACCAAG AAGCTGCGCA ATTTCCCGGA AGAGGGCCTT TGGGGTTTCG ACCCGACGAG CTATTTCGGC GCCTACAAGC CCGACACCTT CTGGCTGGAA CAGGACGGCT GA
|
Protein sequence | MITRRTALGL LATAAFPKTL LAAAGGADPL AALVQEGKLP PVGERLPKTP RVINVAGMGR KAGRHGGTIR SLIGSAKDIR LMTIYGYARL VGYDEELNLQ PDVLESYETV EDRIFTFHLR EGHRWSDGTP LTAEDFRYCF EDVLLNEDLS PAGLPTSMVM DGQAPKLEIV DERTVRYSWP MPNPVFLQEL AAPQPLIVAM PSAYLKQFHK KYQEEDKLKT LLKEQRVKRW SQLHMRMARS YRPENPDLPT LDPWRNTTPL PAEQFVFERN PYYHRVDENG LQLPYIDRFV LSVSSSALIP AKTGTGESDL QANGIDFVDY TYLKDAEKRY PVEVKLWKKT SGSRLALLPN LNCADPVWRA LLRDVRVRRA LSLAIDRREI NMAAFYGLTK ESADTVLPES PLFRPEFASA WIAHDPEQAN TLLDAAGLAK RGSDGIRILP DGRKAQIVVE TAGESTLDTD VLQLITDYWR EVGISLFIRT SQRDTFRSRA VGGEIIMSIW FGIDNGVPTA DMSPHQLAPT ADDQLQWPVW GLNYISHGEM GETPDLPAVV ELLELLKRWR HSADDAERAD IWRKMLSIYT DQVFSIGLVN SSLQPILVTK KLRNFPEEGL WGFDPTSYFG AYKPDTFWLE QDG
|
| |