Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4991 |
Symbol | |
ID | 5318712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1504933 |
End bp | 1506063 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640776773 |
Product | hypothetical protein |
Protein accession | YP_001313705 |
Protein GI | 150377109 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.723293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00320461 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAACCTCG TCCTCAAGGG AATGACCTGG AACCACCCGC GCGGCTACGA CCCGATGGTG GCCTGCTCGC GGGCCTGGCA GGAGACGTCG GGCGTCGAGA TCCAATGGGA GAAGCGGTCG CTTCAGGACT TCGAGACGTT TCCGGTGGAA GTTCTTGCAA GGGATTACGA CCTGATCGTC ATAGATCATC CGCATGTCGG CCAGATCACC AGCGAAAACT GCCTGCTTCC GCTGGACGTG CCCGGGCGTG AGGCCGATCG GCAGGCTCTC TCGGCGGCAA GTGTCGGCCC CTCCTATCGC AGCTACGAAT GGAACGCCCG CCAATGGGCC TTTCCGATCG ATGCCGCGAC CCAGGTGCAG GCCTGGCGGC CGGATCGGAC CGAGCGCCTC CGGACCTGGC GGGAGGTGCT GGACCTGGCG CGTTCAGGCG GCGTCGTTCT GCCGCTTCGC CCGCCCCATT CGCTGATGAG CTTCTTCACC CTCTGCGGCA ATCTGGGGCG GCCCTGCCGC AGCAACGGGC AGGGCGAGCT CGTCGATGCC GAAACGGGTG CCGCTGCAAT CGAGCTGCTG AAGGAGATCG CAGCCCTGGT CGACCCGGAC TGCTTCGACA TGGATCCGAT CGCAGCCTTC GAGGCGATGG CGGAAAAGGG GTCCGCCTTT GCCTGCGCGC CGCTCATCTA CGGCTATGTC AGCTACTCGA TGGCGGGCTT TCGGCCGGCG CTCATCCGCT TCGGCGATAT TCCTGAAATC GGCGCGGCAG GACCGGTCGG CTCGGCTCTC GGCGGGACGG GCATTGCGGT ATCCGCCTTT TCGAAGGCGC CGGAGCAGGC GATCGATTTT GCCTATTGGG TGGCGAGCGG CGACGTGCAG CGGGGTATCT ACGCCGCCTG CGGCGGCCAG CCCGGCCATG GCGCCGCCTG GCAGGACGAG ACGGTCAATG CGGCGACGCA TGATTTCTAC CGCGCGACCC GGGCGACGCT CGAGGCTGCC TGGCTCCGGC CGCGCCATGA CGGCTATATG GCATTCCAGC AGGCGGGTTC GGATCGCCTG AACGAAGGGC TCAAGCGCGG CGAGAGGCCA AGCCTCGTGG CCGAAGAACT CAATCGGCTG TTCTGCGAGA GTTTTCGCTG A
|
Protein sequence | MNLVLKGMTW NHPRGYDPMV ACSRAWQETS GVEIQWEKRS LQDFETFPVE VLARDYDLIV IDHPHVGQIT SENCLLPLDV PGREADRQAL SAASVGPSYR SYEWNARQWA FPIDAATQVQ AWRPDRTERL RTWREVLDLA RSGGVVLPLR PPHSLMSFFT LCGNLGRPCR SNGQGELVDA ETGAAAIELL KEIAALVDPD CFDMDPIAAF EAMAEKGSAF ACAPLIYGYV SYSMAGFRPA LIRFGDIPEI GAAGPVGSAL GGTGIAVSAF SKAPEQAIDF AYWVASGDVQ RGIYAACGGQ PGHGAAWQDE TVNAATHDFY RATRATLEAA WLRPRHDGYM AFQQAGSDRL NEGLKRGERP SLVAEELNRL FCESFR
|
| |