Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3773 |
Symbol | |
ID | 8014603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3828044 |
End bp | 3828865 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826336 |
Product | short chain dehydrogenase |
Protein accession | YP_002977555 |
Protein GI | 241206459 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.302075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAG ACCACGGCCG CCTTGACGGC AAGATCGCCA TCGTCACCGG CGGCACGCAA GGATTGGGCG CGACGATCGC CCGCCTCTTC GCCGAACGCG GTGCAAAAGG CATCGTCATC TGCGGCCGCA ACGAAGCCAA GGGCAAGGCG AAGGCGGCGG AGATTTCGGC TGCGACCGGC ACCAGGATCG TCTACGTCAA GTCCGACCTC GGCAAGGTCG AGGATGCACA GAATGTCGTG CGCGCCTGCG ACGAGACCTT CGGTCGTGTC GATGCGCTGG TCAATGCCGC CGCCATCACC GATCGCGGCA CCATCCTCGA CACCAGCCCG GAACTCTTCG ACGCAATGTT CGCCGTCAAT GTTCGCGCGC CGTTTTTCCT GATGCAGGAA GCGGTGAAGG TCATGCGCCG CGAAAAGATC GAGGGTACGA TCGTCAACAT CGGCTCGATG TCGGCCAAAG CCGGCCAGCC CTTCATCGCC GCCTATTGCG CCTCCAAGGG CGCGCTGGAA ACGCTGACGA AGAACACCGC CTATGCGCTC CTGCGCAACC GCATCCGCGT CAATGGTCTG AACATCGGTT GGATGGCCTC TGAAGGCGAG GACCGCATTC AGCGCGAATA TCACGGCGCA CCGGCCGACT GGCTGGAGAA GGCGGCGGCA AGCCAGCCCT TCGGCCGTCT CGTCGATCCG CACGAGGTGG CGCGCGCCTG CGCTTACCTG TCGTCTTCCG AATCCGGCCT GATGACCGGC TCGGTCATCT GCTTCGACCA GTCGATCTGG GGCGCTTACG ACGGCTCGCC GCATCCGGTC GCCGCCCTCT AG
|
Protein sequence | MSTDHGRLDG KIAIVTGGTQ GLGATIARLF AERGAKGIVI CGRNEAKGKA KAAEISAATG TRIVYVKSDL GKVEDAQNVV RACDETFGRV DALVNAAAIT DRGTILDTSP ELFDAMFAVN VRAPFFLMQE AVKVMRREKI EGTIVNIGSM SAKAGQPFIA AYCASKGALE TLTKNTAYAL LRNRIRVNGL NIGWMASEGE DRIQREYHGA PADWLEKAAA SQPFGRLVDP HEVARACAYL SSSESGLMTG SVICFDQSIW GAYDGSPHPV AAL
|
| |