Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3598 |
Symbol | |
ID | 5318432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 25022 |
End bp | 26170 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775412 |
Product | galactonate dehydratase |
Protein accession | YP_001312345 |
Protein GI | 150375749 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.729698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCAAGCTGAC GACCTATATC GTTCCGCCGC GCTGGCTCTT TCTCAAGATC GAAACCGACG AGGGCGTCGT CGGGTGGGGA GAGCCCGTGG TAGAGGGGCG CGCCCTGACG GTCGAGGCCG CCGTTCACGA GCTGTCTGAC TATCTCGTAG GCAGGGACCC CTTCCTCATC GAGGATCATT GGAACGTCCT TTATCGCGGC GGATTCTACC GCGGCGGTGC CATACATATG AGTGCGCTGG CCGGCATTGA CCAGGCGCTA TGGGACATCA AGGGCAAGGC CCTTGGGCAG CCGGTCCATT CCTTGCTCGG CGGGCAATGC CGCGACAAGA TCAAAGTCTA TTCCTGGATC GGCGGCGACC GACCGAGCGA CGTCGCGAAC AATGCGCGCG ACGTTGTTGC CCGCGGTTTC AAGGCGATAA AGCTCAACGG CTGCGAGGAG ATGCAGATCG TCGACACCAA CGAGAAGATC GATAGGGCGG TCGAGACGAT CGGAACGATC CGCGACGCGA TCGGCCCCAA TATCGGCATC GGCGTCGACT TCCACGGTCG TGTGCATAGG CCCATGGCGA AGGTTCTCGC CAAGGAACTG GAACAGTTCA AGCTGATGTT CATCGAGGAG CCGGTCCTGT CGGAGAACCG CGAGGCCCTG AGGGAGATCG CCAACCATTG CTCGACCCCG ATCGCGCTCG GCGAAAGGCT CTATTCGCGC TGGGACTTCA AATCTGTTCT CTCCGACGGC TTCGTCGACA TCATTCAGCC GGATCTTTCG CACGCCGGCG GGATCACCGA ATGCCGCAAG ATCGCCGCGA TGGCCGAGGC CTACGATGTG GCGCTGGCGC CCCACTGCCC GCTCGGGCCG ATTGCACTTG CCGCCTGCCT GCAGGTCGAC GCGGTCAGCT ATAACGCCTT CATCCAGGAG CAGAGCCTCG GCATCCACTA CAACGAGGCG AACGACATCC TCGATTACAT CTCCAACAAG GACGTCTTCG CCTATGAGGA CGGCTTCGTT TCCATTCCTC AGGGACCCGG TCTCGGCATC GAGGTGGACG AGGCCTATGT GATGGAACGC GCGAAGGAGG GGCATCGCTG GCGCAACCCG GTCTGGCGCC ATTCGGATGG CAGCGTGGCC GAATGGTGA
|
Protein sequence | MKITKLTTYI VPPRWLFLKI ETDEGVVGWG EPVVEGRALT VEAAVHELSD YLVGRDPFLI EDHWNVLYRG GFYRGGAIHM SALAGIDQAL WDIKGKALGQ PVHSLLGGQC RDKIKVYSWI GGDRPSDVAN NARDVVARGF KAIKLNGCEE MQIVDTNEKI DRAVETIGTI RDAIGPNIGI GVDFHGRVHR PMAKVLAKEL EQFKLMFIEE PVLSENREAL REIANHCSTP IALGERLYSR WDFKSVLSDG FVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD AVSYNAFIQE QSLGIHYNEA NDILDYISNK DVFAYEDGFV SIPQGPGLGI EVDEAYVMER AKEGHRWRNP VWRHSDGSVA EW
|
| |