Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5067 |
Symbol | |
ID | 5319369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 13381 |
End bp | 14868 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640776847 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313779 |
Protein GI | 150377184 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.68424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA AGATCCGGAG AATGACTGCC GGCGTTGCGA TGCTGCTGGC ATCCACACTT GCCTCTTCGC CCGCATGGGC CCAATCCATC ACCATCGCCA TCGGGTCGGA GCCTTCGACG CTCGACCCGC AGCTCAGGGA TGACGGCGGC GAGCGGCAGG TCAACGACAA CATCTATGAG ACGCTGATGG CACGGACGCC GACCGGCGAA CTCGTGCCGG GCCTTGCCGC GGAGGCTCCA AAACAGGTCG ATGCCACGAC CTGGCAGTTC AAATTGCGGG AGGGCGTCAA GTTCCACAAC GGCGAACCCT TCAATGCCGA TGCAGTGGTC GCTTCCGTCG CGCGTGTAAT TGATCCCGCG AACAATTCCG AGCAGATGGC GTATTTCGGC ACGATCAAGG CGGCCGAAAA GGTCGATGAC CTGACCGTCA ACCTTGTCAC GACAGGCCCG GATCCGATCC TGCCTTCGCG CATGTACTGG ATGAAGATGA TCGCGCCGGG CTATTCCAAG GACGGCGATC TTGCCGGTGC GCCGGTCGGG ACAGGCCCTT ACAAGTTCGA AAGCTGGAAC CGGGGCACGG ACCTGAAGCT CGTCGCGAAC AGCGAATACT GGGGAGGGGA ACCGCAGATC GACGACGTCA CTTACCGCTT CGTGACGGAG CCGGGCACGC GCCTTTCCGG ACTGCTTTCC GGCGAATTCG ACGTGATTAC GAACCTTCTG CCGGAGTTCA CGACGAATGT GCCGAAGTTC GCCGCCGTTC CTGGCCTCGA GACATCCGTC TTCGTTCTGG GCACGGACAA TGAGGTAACG AAAGACCCCA AGGTACGCGA GGCGCTCAAC CTCGCCATCG ACCGCAAGGC CATGGTCGAG GGCCTTTTCA TGGGTTACGC GACGATCGCC AAAGGGTCGC ATATCAATCC GGCCGCCTTT GGCTTCAACG AAAAGCTGGA GCATTATCCC TACGACATCG AGAAGGCGCG GGCGCTGATC AAGGAGGCAG GCGCCGAGGG CAAGCCTCTC GTCGTCGTCG GCGAATCCGG CCGCTGGCTG AAGGACCGTG AGCAGATCGA GGCAGTTGCG GGTTACTGGG CCGAGACCGG ACTGAACGTC ACGACCGACA TACAGGAGTT CTCGCAATAT CTCGACAGCC TGATGGGCGA CGGGCCCCGT CCCGACGCGA TCTTCATCGC CAATTCCAAC GAGCTGCTCG ATGCCGACCG GGAAATGTCC TTCATCTACC ACAAGGACGG TGCTGCGGCC TCGAATTCGG ACGCCGAGAT GGCCACCTTG ATCGAGGCGG CGCGCCTCGA AACGGATACG GCCAAACGCA AGGCGCTTTA CGACGAGATC CAGAAGAAGG GGCATGACCT GAACTACACG GTGCCGCTGT TTAATCTTCA GGACATCTAC GGAATGTCGG AACGAATGGA ATGGCAGCCA CGTGTCGACG CGAAGATGAT GGTGAGCGAA ATGAAGGTCA CCGAATAG
|
Protein sequence | MKKKIRRMTA GVAMLLASTL ASSPAWAQSI TIAIGSEPST LDPQLRDDGG ERQVNDNIYE TLMARTPTGE LVPGLAAEAP KQVDATTWQF KLREGVKFHN GEPFNADAVV ASVARVIDPA NNSEQMAYFG TIKAAEKVDD LTVNLVTTGP DPILPSRMYW MKMIAPGYSK DGDLAGAPVG TGPYKFESWN RGTDLKLVAN SEYWGGEPQI DDVTYRFVTE PGTRLSGLLS GEFDVITNLL PEFTTNVPKF AAVPGLETSV FVLGTDNEVT KDPKVREALN LAIDRKAMVE GLFMGYATIA KGSHINPAAF GFNEKLEHYP YDIEKARALI KEAGAEGKPL VVVGESGRWL KDREQIEAVA GYWAETGLNV TTDIQEFSQY LDSLMGDGPR PDAIFIANSN ELLDADREMS FIYHKDGAAA SNSDAEMATL IEAARLETDT AKRKALYDEI QKKGHDLNYT VPLFNLQDIY GMSERMEWQP RVDAKMMVSE MKVTE
|
| |