Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4467 |
Symbol | |
ID | 5318169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 947971 |
End bp | 949008 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776268 |
Product | D-xylose ABC transporter, periplasmic substrate-binding protein |
Protein accession | YP_001313200 |
Protein GI | 150376604 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0159284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCCA TTTTGAAACT GATGGCCGGG GCTGCCATCA TAGCGTCCAT GCATTCCGCG GCGATCGCCA AGGATCTCGT CATCGGCGTT TCCTGGTCAA ACTTCCAGGA GGAGCGCTGG AAGACCGATG AGGCCGCCAT CAAGACCGCG CTGGAAGCCT CGGGCGACAA ATACATTTCG GCCGATGCAC AGTCTTCAGC AGCAAAGCAG CTCACCGACA TCGAATCGCT AATCGCCCAG GGCGCCAACG CACTGATCGT GCTTGCGCAG GACTCCGATG CGATCGGTCC GGCTATCGAG AAGGCCGCTG CCGAGGGGAT CCCGGTCGTC GGCTATGACC GCCTGATCGA AAACCCGGCC GCCTTCTACA TCACCTTTGA CAACAAGGAA GTCGGCCGCC TGCAAGCGAG CGAGGTGTTC AAGCAGAAGC CGGAAGGCAA CTACGTCTTC ATCAAGGGCT CCTCCGCCGA TCCGAACGCC GACTTCCTTT TCTCAGGACA GATGGAAGTC CTGAAGGATG CCATCGATGC GGGCAAGATC AAGAATGTCG GCGAGGCCTA TACCGATGGC TGGAAGCCGG AAAACGCCCA GAAGAACATG GAACAGTTCC TGACGGCTAA CGACAACAAG GTCGATGCGA TCGTGGCCTC GAACGACGGG ACCGCCGGCG GCGCGATCGC GGCACTCGAC GCCCAGGGCC TTGCCGGTTC GGTTCCTGTG TCCGGCCAAG ATGCCGACAA GGCAGCGCTG AACCGCGTCG CTCGCGGCAC GCAGACGGTT TCGGTGTGGA AGGACTCCCG CGAACTCGGT AAGAAAGCGG CAGAGATTGC CGCGGCGCTT GCCGCCGGCA AGACCATGGA TGAAATCGAA GGCGTCCAGA CCTTTGACGG CGGCCCCAAG GGCGTGGCCA TGAAATCCGT TTTCCTGGCA CCGCTGGCGA TCACCAGGGA CAATCTCAAT GTCGTCATCG ATGCCGGCTG GATTGCCAAG GAAGAGACCT GCCAGGGCGC CAAGGACGAC GTGGCTGCGT GCAAGTAA
|
Protein sequence | MKSILKLMAG AAIIASMHSA AIAKDLVIGV SWSNFQEERW KTDEAAIKTA LEASGDKYIS ADAQSSAAKQ LTDIESLIAQ GANALIVLAQ DSDAIGPAIE KAAAEGIPVV GYDRLIENPA AFYITFDNKE VGRLQASEVF KQKPEGNYVF IKGSSADPNA DFLFSGQMEV LKDAIDAGKI KNVGEAYTDG WKPENAQKNM EQFLTANDNK VDAIVASNDG TAGGAIAALD AQGLAGSVPV SGQDADKAAL NRVARGTQTV SVWKDSRELG KKAAEIAAAL AAGKTMDEIE GVQTFDGGPK GVAMKSVFLA PLAITRDNLN VVIDAGWIAK EETCQGAKDD VAACK
|
| |