Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4956 |
Symbol | |
ID | 5318167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1468388 |
End bp | 1469596 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776738 |
Product | putative glucosyltransferase protein |
Protein accession | YP_001313670 |
Protein GI | 150377074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0869476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAGA TACTCTATCT GGCACAGGAT CTTGCCGATC CCGCAGTGCG CCGGAGAACC CTCACGCTCG TCACGGGCGG CGCACGCGTG ACGCTCGCCG GCTTTCGCCG CGGCGCCAAT CCGCTCGCAG CCATCGACGG CGTCGAGCCC ATCGAACTCG GGACCACCGC CGACGGCCGT TTCGGCCAGC GGATCGGTGC CGTCGTCCGC GCTTGCATGT CTCTGAAAAG CAGGCTCGCA CACGTTCCCA AGCCCGACTT GATCATCGCG CGCAACCTGG AGATGCTTGC ACTAGCCAGG AGGGCCGTCG CCGTTTTCGG CGGCACGGTT CCGATCGTTT ACGAATGCCT GGACATTCAT CGCCTAATGC TCCGCCAGGA CATGGTCGGG CGCACTTTGC GAGCGGCCGA AGGCCAGCTC GGCAAGGACG TTCGCTTGCT GATCACCAGC TCCCCGGCCT TCGTCGAGCA CTATTTTCGC CCCATTTCCA GCATCGGCGC TCCGCCGATG CTGCTCGAAA ACAAGGTTCT TGAACTCGAT GGCATCTCCG AACCTAAAGC CGCACCGGCG GTATGCCCAG CGCCCGGTGC GCCTTGGAAG ATCGGCTGGT TCGGGGCGTT GCGCTGCCGG AGGTCACTCG CACTTCTGGC CGAGTTCTCG CGCAGGATGA ACGGCCGCTT CGAGGTCGTT CTGCGCGGCC GGCCAGCCTA CTCCGAGTTC GAGGATTTCG ACGGCTTCGT TACGAACGAG CCCTTCATGC GTTTCGCAGG CGCCTATCGC AACCCAGAAG ACCTCGCAGA TATCTATGGC GAGGTCCATT TCACCTGGGC GATCGACTTC TTCGAGGAAG GCCAGAATTC TGCCTGGCTG CTGCCGAACC GTCTCTATGA AGGCTGCCGC CACGGCCGAG TTCCCATCTC GATGAAAGGG ACGGAGACTG CTCGTTTCCT ATCCGTGCGG GGCATCGGTC TCGTTCTTGA AGAAGCCGAC GCCGACAGCC TCGACACAGT CCTCGGCTCA CTGTCTCCGC AGAGCTATGC CGACGCCGCA GAACGCATCA GCCGCTGCAA TCCCGGATCC TGGATATTCA GCCGCGCCGA TTGCGAAGCC CTGGTGCGGC AACTGGCGGC GCTTACCGTG GAGACGCGGC AGCACGTTCC CATAGTCGCG ACGGCGGGGG CCCGCCACAA TGAAGGTGGT TTCTTATGA
|
Protein sequence | MLQILYLAQD LADPAVRRRT LTLVTGGARV TLAGFRRGAN PLAAIDGVEP IELGTTADGR FGQRIGAVVR ACMSLKSRLA HVPKPDLIIA RNLEMLALAR RAVAVFGGTV PIVYECLDIH RLMLRQDMVG RTLRAAEGQL GKDVRLLITS SPAFVEHYFR PISSIGAPPM LLENKVLELD GISEPKAAPA VCPAPGAPWK IGWFGALRCR RSLALLAEFS RRMNGRFEVV LRGRPAYSEF EDFDGFVTNE PFMRFAGAYR NPEDLADIYG EVHFTWAIDF FEEGQNSAWL LPNRLYEGCR HGRVPISMKG TETARFLSVR GIGLVLEEAD ADSLDTVLGS LSPQSYADAA ERISRCNPGS WIFSRADCEA LVRQLAALTV ETRQHVPIVA TAGARHNEGG FL
|
| |