Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0408 |
Symbol | |
ID | 6979123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 420213 |
End bp | 421193 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643395121 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002279933 |
Protein GI | 209548016 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.253997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAG CGCTTGTCAC CGGGGCAGAT GGGTTCATCG GTTCTCATCT CGTCGAAACG CTGGTCAGGT CGGGTGTCGA GGTTCGCGCC CTCTGCCAGT ATAACTCGTT TTCCAGTTGG GGCTGGCTCG ACCAATCGGA ATATCGCGGC AAGTTCGAGG TGATCCTCGG AGACGTCCGT GATCCCGCCC AGATGCGCTC CGTCGCCAGA GATGTCGACA CGATCTTCCA TCTTGCCGCT CTGATTGCGA TTCCCTATTC CTATCAGGCT CCCTCGAGCT ACATCGATAC CAATGTGCAT GGGACGCTGA ACGTTCTTCA AGGCGCTCTC GACGCCGGCG TCGGAAGAGT GATCCAGACC TCGACGAGCG AAGTCTATGG GACGGCACGT TTTGTGCCCA TCAGCGAAAG CCATCCTTTG CAGGCGCAGT CGCCCTATTC GGCGTCCAAA ATCGGTGCCG ATGCGATCGC CTACAGCTAT CACTCGAGCT TCGATCTGCC GGTGACGATC GCACGGCCGT TCAACACCTA CGGCCCGAGA CAATCTGCAA GGGCGGTTAT TCCAACCGTG ATTTCGCAGC TTCTGAGTGG ACGAAGGACG CTCAAGCTTG GTGCGCTCTC GCCCACCCGG GATTTCAATT TCGTGCAGGA TACATGCGAC GGCTTTCTGG CGCTCGCGGC CTGCGACAAA GCCATCGGAC AGACGGTCAA TATCGGCTCG GGCGGCGAGA TATCGATCGG CGATACCGTT CGGCTGATCG CCGATATCAT CGGCGTCAAC ATCGAGATCG AATGCGACGA ACAGCGTTTG CGTCCGGCAA ACAGCGAAGT GGAACGCCTG TGCTGTGATA ACAGCCTGAT CAAGTCTCTG ACGGGATTCT CGCCTCGCTA CAGCTTGAAA GACGGGCTCC AGGCGACGAT TGAATGGCTG CGTCAGCCCG AGAATCTGGC GCGGTATAAG GCGGATATTT TCAATGTCTA G
|
Protein sequence | MKKALVTGAD GFIGSHLVET LVRSGVEVRA LCQYNSFSSW GWLDQSEYRG KFEVILGDVR DPAQMRSVAR DVDTIFHLAA LIAIPYSYQA PSSYIDTNVH GTLNVLQGAL DAGVGRVIQT STSEVYGTAR FVPISESHPL QAQSPYSASK IGADAIAYSY HSSFDLPVTI ARPFNTYGPR QSARAVIPTV ISQLLSGRRT LKLGALSPTR DFNFVQDTCD GFLALAACDK AIGQTVNIGS GGEISIGDTV RLIADIIGVN IEIECDEQRL RPANSEVERL CCDNSLIKSL TGFSPRYSLK DGLQATIEWL RQPENLARYK ADIFNV
|
| |