Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2288 |
Symbol | |
ID | 6981027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2345908 |
End bp | 2346792 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643397001 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002281789 |
Protein GI | 209549872 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATAT TCGTAACCGG AGCAACCGGC TGGGTTGGCT CGGCCGTCGT CAGTGAACTG ATCGCTGGCG GACACCAGGT GCTCGGCCTC ACCCGCTCCG AAAAAGGCGC TGAGGAATTG GCGGCCGCCG GCGCTGCGGT CCATCGCGGC ACGCTCGAGG ATGTGGAGAG CCTGAAGCGC GGCGCCGCCG AGGCCGATGG CGTCATCCAC ACGGGCTTCA ACCACGATTT CTCGAAGTTC GCCGAAAACT GCACCCTGGA CCGGCGCGCC ATCGAGGCGC TCGGCGAAGC TCTCCAAGGC TCCAGCCGCC CCCTGCTGGT CACGGCAGGC CTCGGCCATG CGCCCGGCCG CGTCGGCACC GAGAAAGATC CGCCCATGCC CACCACAGAG ACCTATCCCC GCGCCTCCGA AATCACCGCG GTATCACTTG CGGCACGCGG GATGCGCGCC TCCACCGTCC GGCTTCCGCC TTCGGTGCAT GGCCACGGCG ATCACGGCTT CGTGCCGATC CTGATTGATT TCGCCCGGCG GACGGGTGTT TCGGCCTATA TCGGCGAAGG GCAGAACCGC TGGCCGGCGG TGCACAGGCT CGATGCCGCC CGCCTCTATC GGCTGGCGCT GGAGCGCGGC GCGGTCGGCG GCCCCTTCCT CGCGGTCGCC GAGGAGGGCG TGCCGTTCCG GAAGATCGCC GAGGTGATCG GGCGGCGGCT CAATCTTCCG GTGGTCTCGA AGTCGCGTGA GGAAGCGGTT GAGCATTTCG GCTGGTTCGT GATGTTTGCC GGCTTCGACG TGCCGACGTC GAGCGAGCGC ACCCGCACGC TCCTCAATTG GCAGCCCACC CAGCCGGACC TGCTCGCCGA TATCGACCAC CCGGCTTATT TTTAA
|
Protein sequence | MRIFVTGATG WVGSAVVSEL IAGGHQVLGL TRSEKGAEEL AAAGAAVHRG TLEDVESLKR GAAEADGVIH TGFNHDFSKF AENCTLDRRA IEALGEALQG SSRPLLVTAG LGHAPGRVGT EKDPPMPTTE TYPRASEITA VSLAARGMRA STVRLPPSVH GHGDHGFVPI LIDFARRTGV SAYIGEGQNR WPAVHRLDAA RLYRLALERG AVGGPFLAVA EEGVPFRKIA EVIGRRLNLP VVSKSREEAV EHFGWFVMFA GFDVPTSSER TRTLLNWQPT QPDLLADIDH PAYF
|
| |