Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5599 |
Symbol | |
ID | 6978693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1245491 |
End bp | 1246432 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643394697 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_002279515 |
Protein GI | 209547597 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGCCT TGGTAACGGG AGGGGCCGGA TTTATCGGCA GCCATCTCTG CGATCGCCTT TTGGCCTTGG GCTATCGGGT AACCGCGATC GACAATCTTC ACCTCGGCCG GATGCGGAAT ATAGACCATC TGATGGCTCG ACCGGATTTT CACTTCCAGC AACTGGATAT GCTTGATCGG GAGGGCATGG ACCAGCTCGT TGCGGCGGAT CGTCCCGACG CTGTTTTTCA TCTCGCCGCC AATTCCGATA TTGCCGCGGG CAATGCCAAT GCGGAACTGG ACCTGCAACT GAACCAGCTG ACGACGACGA CACTGCTTGC CATCATGCGC AAGTATGAGA TCGGCAGGCT CTTTTTCGCC AGCACCTCCG CCGTCTTCGG TGAAGCCGAA GGAAATATTC ACGAGAACCA CGGCCCCCTG CGGCCGATCT CGCTCTATGG CGCCAGCAAG CTTGCCGCTG AAGCCTATTT GTCGGTCTAC GCGCTGTCAT TCGGCATTAA GACGCTCGTG CTGCGCTTTC CCAACGTCGT TGGGGAACGG TCCACGCACG GCGCAATTTA CGACTTCATC AACAGGCTGA AGGCTGATCC GACGAGGCTT CAGGTGCTCG GCAACGGACG GCAGACCAAG CCCTATCTCT ATGTCGGCGA CCTCGTAGAC GCCATTCTGC TCGCATGGGA CAAGGCGCCC GGCGCCTATG AGGTGTTTCA CGCCAGCGGC ATCGGCGAAA CATCCGTCCG GGATATCGCC GAAATCGTGG TTTCCAAAGT CGCGGCCGGC GCTGCCATCG AATATGGCAG CGAGGACCGC GGTTGGCTGG GCGACGTGCC GCGCTTCAGC TACGATATAA GCCGGCTCGT AACCTTGGGC TGGTCTCCGA AGCGGAAATC GACAGAAGCC GTTGAACTTG CCGTCGAACG GATCCTGGCG AATGGGTTCT GA
|
Protein sequence | MHALVTGGAG FIGSHLCDRL LALGYRVTAI DNLHLGRMRN IDHLMARPDF HFQQLDMLDR EGMDQLVAAD RPDAVFHLAA NSDIAAGNAN AELDLQLNQL TTTTLLAIMR KYEIGRLFFA STSAVFGEAE GNIHENHGPL RPISLYGASK LAAEAYLSVY ALSFGIKTLV LRFPNVVGER STHGAIYDFI NRLKADPTRL QVLGNGRQTK PYLYVGDLVD AILLAWDKAP GAYEVFHASG IGETSVRDIA EIVVSKVAAG AAIEYGSEDR GWLGDVPRFS YDISRLVTLG WSPKRKSTEA VELAVERILA NGF
|
| |