Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2098 |
Symbol | |
ID | 5322958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2158833 |
End bp | 2159864 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640791036 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327766 |
Protein GI | 150397299 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0603168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.126386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGA TCAAGCTTCT CACGGCCGGT GTTTTTGCCG GTCTCGCCGT TACCACAGGT CAGGCGTCTG CCTCCGTCCT CGACACCGTC AAGCAGCGCG GCACACTGAA CTGCGGCACC GACAACACCG CTCCCGGTTT CGGCTACCTC AATACGACCA CAGGCCAGAT GGAAGGGCTG GACGTAGACT TCTGCAGGGC GGTTGCAGCG GCGGTCCTCG GCGACGCGTC CAAGGTCAAA TTCGTCACCG TAACGGACAA AAGCCGCTTC GACGCCGTCC TGACGAACCA GGTGGACGTC GTCTTCGCAC ACACGACCAT GAAGCCGGCC CGCGAATCCT CGATCGCCAT AGATTTTCTG CCGGTCAACT TCTATGACGG CACGGGTATC ATGGTGAAGA CGGATTCCGA GGTGGTGCAG TTCGCCGACC TCGAAGGTGC GACGTTCTGC ACGACTCAGG GTTCCGTGAC CGAAACCGTG CTTACCAGCG CTTTCAAGGC CAATGGATGG CAGGGCTCCA AGGTTCTCAC CTACGAAAAC CTCGAAAAGC TGTTCGCCGC GCTCAACTCC GGTCGCTGCA ACGCGATGAG CACAGACAAG TCCGCGCTTG CGGCCTGGGC CGGCAACTCG CCGAAGCCGT CCGATTATCT CATCCTGCCG GAAACCCTCG ACAAGTCGCC CTTCGCTGGT TTCGTCGCGG CCAATGATTC CAAATGGCGC AATGCGCTGC GCTGGATCAC CTACGGCCTG TTCCAGGCGG AGGAGTCCGA CATCACACAG GCCAATCTCG AAGAAAAGCT GAAGAGCGAC GACCCGTTCG TTCAGAAGTT TCTCGGCGTG GGAGGCGGCT ACGGCAAGGA CTTCGGCCTT CCGGACGATT TCGTTGCGCA GGCCATAAAG GCGATGGGCA ATTACGGCGA GATTTACGCC CGCAATCTCG GTCCGGATAC GAGCATGTTC CTTGACCGGA AAGGTACGCC TAACGCTCTT TGGACAGAGG GCGGCGCGAT CTACTCGCCC CTCTGGAACT GA
|
Protein sequence | MKLIKLLTAG VFAGLAVTTG QASASVLDTV KQRGTLNCGT DNTAPGFGYL NTTTGQMEGL DVDFCRAVAA AVLGDASKVK FVTVTDKSRF DAVLTNQVDV VFAHTTMKPA RESSIAIDFL PVNFYDGTGI MVKTDSEVVQ FADLEGATFC TTQGSVTETV LTSAFKANGW QGSKVLTYEN LEKLFAALNS GRCNAMSTDK SALAAWAGNS PKPSDYLILP ETLDKSPFAG FVAANDSKWR NALRWITYGL FQAEESDITQ ANLEEKLKSD DPFVQKFLGV GGGYGKDFGL PDDFVAQAIK AMGNYGEIYA RNLGPDTSMF LDRKGTPNAL WTEGGAIYSP LWN
|
| |