Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4144 |
Symbol | |
ID | 5319140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 616757 |
End bp | 618130 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775949 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001312882 |
Protein GI | 150376286 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.663959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGAC AACCCAGGAT CACTTTCATC GGCGCCGGTT CCACCGTGTT CATGAAGAAC ATTATCGGCG ATATCTTGCA GCGCCCGGCG CTTTCGGCCG CGACCATCGC CTTGATGGAC GTCAACCCGG AGCGCCTGGC GGAAAGCGAG ATCGTCGCGG GCAAGCTGGC GCGCACGCTG GGCGCCGGCG CCAGGATCGA GACGCACTCC GACCAGCGCA AGGCGCTCAC GGGAGCGGAC TTCGTCGTGG TTGCCTTCCA GATCGGCGGC TACGAGCCAT GCACCGTGAC AGATTTCGAG GTGCCGAAAA AATACGGACT GCGCCAGACG ATCGCCGACA CGCTCGGCGT CGGCGGCATC ATGCGGGGGT TGCGCACCGT CCCGCATCTC TGGAAGATCT GCGAGGACAT GCTCGAGGTC TGCCCCGAGG CGATCCTCCT GCAATATGTA AACCCGATGG CGATCAACAC CTGGGCGATC GCCGAGAGGT ATCCGGCCAT CAAGCAGGTG GGCCTCTGCC ACTCCGTGCA GGGCACGGCC TATGAACTCG CCCGCGATCT CGAGATACCG CTCGAGGAGA TCCGCTATCG CGCCGCCGGC ATCAACCACA TGGCCTTCTA TCTGAAATTC GAGCACCGTC AGAAGGACGG CAGCTATCGT GATCTCTATC CGGACCTTAT CCGCGGCTAC CGCGAGGGGC GCTTTCCGAA GCCGAGCCAT TGGAACCCGC GCTGCCCAAA CAAGGTACGT TACGAAATGC TGACGCGGCT CGGCTATTTC GTCACCGAAA GCTCGGAGCA TTTCGCCGAG TACACGCCCT ATTTCATCAA GGAGGGGCGT CCCGATCTGA TCGAAAAATT CGGGATTCCG CTCGACGAGT ATCCGAAGCG TTGCATCGAG CAGATCGAGC GCTGGAAGGG CCAGGCGGCC GCCTTCAAGG AGGCGGAGAC GATGGAAGTC GCAGAGAGCC GCGAATATGC CTCCTCGATC ATGAACTCGG TCTGGACCGG CGAGCCCTCG GTGATTTACG GCAACCTCAG AAACAATGGC TGCATCACCT CGCTGCCGGA AAACTGCGCG GCGGAGATGC CGTGTCTCGT CGATCAGTCG GGTATTCAGC CGACCCATAT CGGCGCGCTG CCGCCGCAAC TCACGGCGTT GATCCGCACC AACATCAACG TACAGGAGTT GACGGTTCAG GCGCTCGTCA CTGAAAACCG GGAGCATCTC TACCATGCGG CAATGATGGA TCCGCATACC GCCGCCGAGC TCGACCTCGA CCAGATCTGG TCCCTTGTCG ACGATCTGCT CACAGCGCAC CGCGACTGGA TCCCGGAATG GGCCCGCGTC GCGCAGAAGG TAGCGGCCGC CTGA
|
Protein sequence | MTRQPRITFI GAGSTVFMKN IIGDILQRPA LSAATIALMD VNPERLAESE IVAGKLARTL GAGARIETHS DQRKALTGAD FVVVAFQIGG YEPCTVTDFE VPKKYGLRQT IADTLGVGGI MRGLRTVPHL WKICEDMLEV CPEAILLQYV NPMAINTWAI AERYPAIKQV GLCHSVQGTA YELARDLEIP LEEIRYRAAG INHMAFYLKF EHRQKDGSYR DLYPDLIRGY REGRFPKPSH WNPRCPNKVR YEMLTRLGYF VTESSEHFAE YTPYFIKEGR PDLIEKFGIP LDEYPKRCIE QIERWKGQAA AFKEAETMEV AESREYASSI MNSVWTGEPS VIYGNLRNNG CITSLPENCA AEMPCLVDQS GIQPTHIGAL PPQLTALIRT NINVQELTVQ ALVTENREHL YHAAMMDPHT AAELDLDQIW SLVDDLLTAH RDWIPEWARV AQKVAAA
|
| |