Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4557 |
Symbol | |
ID | 5318419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1043256 |
End bp | 1044494 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776358 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313290 |
Protein GI | 150376694 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.302395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAT CCATCCGCAA CGCTTTGATC GGCGCGACCC TTGTCGGCGC CGGCTTCACC GGCCATGCCC AGGCCGAAAC GACTCTGAAC GCGCTTTTCA TGGCGCAGGC CGCCTATAGC GAGGCAGACG TCCGCGCCAT GACGGATGCC TTCGCCAAGG CCAATCCCGA CATCAAGGTC AATCTCGAAT TCGTTCCCTA TGAGGGACTG CACGACAAGA CGGTGCTCGC GCAGGGTTCC GGCGGCGGCT ATGACGTCGT CCTTTTCGAC GTCATCTGGC CGGCGGAATA TGCAGCCAAC AATGTTCTCC TCGACGTGAC GGACCGCATC ACGGACGAGA TCAATCAAGG CGTCCTGCCC GGCGCCTGGA CGACGGTGGA ATATGACGGC AAACGTTACG GCATGCCGTG GATCCTCGAC ACGAAGTACC TGTTCTACAA CAAGGAAATC CTTGAGAAAG CCGGCATCAA GGAACCGCCG AAAACCTGGG ACGAGCTTGC GGAACAAGCC AAAGCCATCA AGGACAAGGG ACTGCTCGAA AACCCGATCG CCTGGAGCTG GTCTCAAGCG GAAGCGGCCA TCTGCGACTA CACCACTCTG GTCAGTGCCT ATGGCGGAAA ATTCCTCGAT AGCGGCAAGC CGGCCTTCGC CAGCGGCGGC GGGCTCGATG CGCTGAACTA CATGGTGACG AGCTACACCT CCGGGCTCAC CAACCCGAAT TCCAAGGAGT TCCTCGAGGA GGATGTCCGC AAGGTCTTCC AGAACGGCGA GGCCGCCTTC GCGCTCAACT GGACGTACAT GTACAACCTC GCCAACGATC CCAAGGAGAG CAAGGTAGCC GGCAAGGTCG GCGTCGTTCC GGCTCCCGGC GTTGAAGGCA AAAGCGAGGT TTCGGCCGTC AACGGCTCCA TGGGCCTCGG CATCACGACG ACCAGCAAGC ACCCCGAAGA AGCATGGAAA TATATCGTCC ACATGACCTC GCAGGAGACG CAGAACGCCT ATGCCAAGCT GAGCCTCCCG ATCTGGGCAT CTTCCTATGA AGACCCCGAT GTGACCAAGG GCCAGGAGGA ACTCATCGCT GCGGCGAAGC GCGGGCTGGC CGCCATGTAT CCACGCCCAA CGACGCCGAA ATACCAGGAG CTTTCGGCTG CCCTGCAGCA GGCCATCCAG GAGGCGCTGC TCGGCCAAGC CTCTGCGGAA GACGCGCTGA AGAGCGCCGC TGAGAACAGT GGCTTGTGA
|
Protein sequence | MSKSIRNALI GATLVGAGFT GHAQAETTLN ALFMAQAAYS EADVRAMTDA FAKANPDIKV NLEFVPYEGL HDKTVLAQGS GGGYDVVLFD VIWPAEYAAN NVLLDVTDRI TDEINQGVLP GAWTTVEYDG KRYGMPWILD TKYLFYNKEI LEKAGIKEPP KTWDELAEQA KAIKDKGLLE NPIAWSWSQA EAAICDYTTL VSAYGGKFLD SGKPAFASGG GLDALNYMVT SYTSGLTNPN SKEFLEEDVR KVFQNGEAAF ALNWTYMYNL ANDPKESKVA GKVGVVPAPG VEGKSEVSAV NGSMGLGITT TSKHPEEAWK YIVHMTSQET QNAYAKLSLP IWASSYEDPD VTKGQEELIA AAKRGLAAMY PRPTTPKYQE LSAALQQAIQ EALLGQASAE DALKSAAENS GL
|
| |