Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4041 |
Symbol | |
ID | 8014846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4120258 |
End bp | 4121109 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826610 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_002977821 |
Protein GI | 241206725 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGAATA TTCAAGACAT TTTCAAGAAG CCCAATGTTG CCGTCATCAC CGGCGGCGCC TCCGGCATCG GTCTTGCCGC AGCGAGATAT TTCGCCGGAC GCGGCATGAG CGTCGCCATC GCCGATCTCG GCGGCGATCG GCTGGCCGAC GCGGCCAATG AACTCAAGGC CATTGCCGGC GAGGAAAATG TGATGGCCGT CGAGACCGAC GTCGCCAGCA GAGACTCGCT GGAATCGCTC GAGCGCGCCG TGCTCCAGCG TTTCGGCCGT GTGCACGTAC TGATGAACAA TGCCGGCATC GGCCCGGAAA CCTCGATCTT CAGCCCGCAG GCGAACTGGG ACAATATCTT TGCCGTCAAC CTGATGGGGG TGATCAACGG CACGCGCACC TTCGGCCCGA AGATGCTGTC GCACGGCGAG CCGGGCCTGA TCATCAACAC CGGCTCCAAG CAGGGCATCA CCACGCCGCC CGGCAACCCC GCCTACAACA TCTCCAAATC AGGCGTGAAA GTCTTTACCG AAGCGCTGCA GCACGAGCTT CGCAATACCG AAGGCGGAAA GATCTCCGCC CATCTTCTGA TCCCAGGCTT CGTCTTCACC GGCCTCACCA AGGGCGACCG CGCCGAAAAA CCGGCCGCTG CCTGGACCGC CGAGCAGACG GTCGATTTCA TGGTCGAGAG CCTCGAACGC GGCGATTTCT ATATTCTCTG CCCCGACAAT GACGTCGCGC GTCCTCTCGA CGAGCGCCGC ATGCTCTGGG CAGCCGGCGA TATCGTCGAG AACCGGCCGC CGCTGTCACG CTGGCACAAG GATTATGCCG ACAAGTTCAA GGCCTTCCTC GAACAGAAGT AA
|
Protein sequence | MTNIQDIFKK PNVAVITGGA SGIGLAAARY FAGRGMSVAI ADLGGDRLAD AANELKAIAG EENVMAVETD VASRDSLESL ERAVLQRFGR VHVLMNNAGI GPETSIFSPQ ANWDNIFAVN LMGVINGTRT FGPKMLSHGE PGLIINTGSK QGITTPPGNP AYNISKSGVK VFTEALQHEL RNTEGGKISA HLLIPGFVFT GLTKGDRAEK PAAAWTAEQT VDFMVESLER GDFYILCPDN DVARPLDERR MLWAAGDIVE NRPPLSRWHK DYADKFKAFL EQK
|
| |