Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4286 |
Symbol | |
ID | 5319122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 779607 |
End bp | 780854 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776091 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313024 |
Protein GI | 150376428 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.683579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.881811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA CGGCATTCTT CGCATCGGCC GCTACCATCA TGCTGTCGGG CCTCGTAGCG GTTCAGGCCG CACGGGCCGA GTGCGGCATA GAAAAGGGCA CGGTGCGCAT CCTCTCGAAC GATTTCGAAG CCCTGCGTCT CGTGGTGTCG GAAGCCGAGA AATGCGCCTC GGCGACGGTC AAGGTCAGCA AGAACCAAAC GAGCGAGCAC AAGAACCTGC AAGTGCCGGC GCTCAAGATC AATCCGGCCC AATATACTGT CGCGGTGATT GCCAACAATT CGATCGTTCC GCTGCTGAAC GAAGGTCTGC TGCGCCCCCT TGACGATCTC GTTGCCAAGT ACGGCCAGGA TCTCCAGCCG ACCCAGCTCA TCAAACTCGA CGGCAAGGTT ATGGCGATCG CCTTCATGGG CAACTCGCAG CATCTATTCT TCCGCAAGGA CATCCTCGAA AAGGCCGGGC TTCAACAGCC GAAGTCCTAT GAGGACGTGC TCGCAGCGGC CAAGGCGATC AAGGAGAAGG GGCTGATGCA GTATCCGCTC GCCGCCTCCA ACAAGGCCGG CTGGGATCTC GCGGCCGAAT TCGTCAACAT GTATCTCGGC TATGGCGGAG AGCTCTTCGC GGCCGGTTCG GCAGCCCCCG CCATCAACAA CGAGAAGGGT CTCGCCACTC TGAAGACGAT GAAGGCGATG ACCGAGTACA TGAACCCCGA CTACATGACC TACAATGCCG ACGAGATCGT CAAGGGCTAT GCGGCCGGCA AGACGGCGAT CATCAATGCG TGGGGCTCGC TTGCGGGCGG CGTGATCGAT CCGGTCAAAA CGCCGGCCGA GATCGCCGAC AACACGGTTC TCGGCGCAGC GCCGACCGTT GGGGGCGGCT CGATCCCTGC AGCGGCCCTC TGGTGGGACG GTTTCTCCAT CGCCAAGAAC ATTTCCGACG AAGATGCGGA GGCATCTTTC CGCGCCATGG TCCATGGCAT CCGGCCGGAA GTTGGCCAGA AACATCCGGC TCTTGCGACC TGGCTGACCA AGGGGTACCA GCCTGGACCC AACGCGGTCG GGGTTCTCGC GACGGCGAAT GGCGGCGCGA AGCCCTATCC GATGCTGCCT TATATGGGCC TGCTGCACTC GGCTCTCGGC TCCGAACTGG CGGAGTACAT GCAGGGGCGT GAAAGTGCAG AGGAGGCGCT GAAAGATGTC GAGGCATCCT ACAGCGCGGC AGCGAAAGAG GGCGGATTCC TGAAATGA
|
Protein sequence | MKKTAFFASA ATIMLSGLVA VQAARAECGI EKGTVRILSN DFEALRLVVS EAEKCASATV KVSKNQTSEH KNLQVPALKI NPAQYTVAVI ANNSIVPLLN EGLLRPLDDL VAKYGQDLQP TQLIKLDGKV MAIAFMGNSQ HLFFRKDILE KAGLQQPKSY EDVLAAAKAI KEKGLMQYPL AASNKAGWDL AAEFVNMYLG YGGELFAAGS AAPAINNEKG LATLKTMKAM TEYMNPDYMT YNADEIVKGY AAGKTAIINA WGSLAGGVID PVKTPAEIAD NTVLGAAPTV GGGSIPAAAL WWDGFSIAKN ISDEDAEASF RAMVHGIRPE VGQKHPALAT WLTKGYQPGP NAVGVLATAN GGAKPYPMLP YMGLLHSALG SELAEYMQGR ESAEEALKDV EASYSAAAKE GGFLK
|
| |