Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5019 |
Symbol | |
ID | 8007610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 403478 |
End bp | 404422 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644821934 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_002973194 |
Protein GI | 241113359 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.177814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.505176 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAATT ATCGGTCCAA ACGATTATCG GCGGCTGCCA GCGTATCCGT CGACTTCCCG AACCTTGTCA CCCGCAGCCT GGGGCGGGGC GGCCGGGGGG CTCTGTTCAC CGGCAATAGT GCCGATTCTG TCGAGGAACA GAGCCCCCAT GTTACCAGGC CTGCATTCGT CGTCTCGTTG GAGGCTCCAG ATATGCCGGC GAAGAACGAA TATGATCCCG CCGCAGTCGA TCATCACGAC CCCTTCCTGG CGCCTCTTTG CTATCGGCAA CGCGACATCG CACTGTTTGG CGACATCGAC TTCGTCATCA TCGGATGCGG CGGCTTGGGC TCGCAGATCG CCATCCAGCT CGCGGCCCTC GGCGCGCGTC GTTTCCTTCT CGTGGATGCG GATCGTATCG ATGAGAACGA CTTGAACCAT CTCCCATGGG CATGCGAGGC TGATCTCGGC CGGCTGAAGA CGGACAGGCT GGCGACCCAT CTGGCCGCGG GTTTCTCGGC CACTGTCTTC GCGCTGCCGG AATTTGCGGA AGGCGCCTCG GCGCTGCGGT TAATCGCAAA CTACGCCAAT AACCCGTTCC TCATCCTCGC CGGCGGCGAT TCTCGTCCAA CCCAAGATCT CCTGTCAGCC TGCCTGGCAT TGGAAGCCGG CCTGCCGCCT CATCTGCATC TGGGCCGCTC TGCAAACTAT TGCATGGCAG GGCCTTTGGC CTTGGTGCAT GAGGACGCAT GTTCCGTCTG CCATTGCGCT ACCCAAGTCA CGGCCGACGA CGGCTTACGC GCGCCGCAGG CTACCGTCGA CAGCCCATTG GTCGCCGGCC TTGCCGTGTC GCAGATTGTC CAAAAATGCC TTTCGAGACA CTCGCTCGCC CGGGGACGCC AATGGATATT GGACCTCAAA GGCGACCAGG CCAAGCTGCG CTCTCTCCAA AGAACCCGAA TGTAA
|
Protein sequence | MVNYRSKRLS AAASVSVDFP NLVTRSLGRG GRGALFTGNS ADSVEEQSPH VTRPAFVVSL EAPDMPAKNE YDPAAVDHHD PFLAPLCYRQ RDIALFGDID FVIIGCGGLG SQIAIQLAAL GARRFLLVDA DRIDENDLNH LPWACEADLG RLKTDRLATH LAAGFSATVF ALPEFAEGAS ALRLIANYAN NPFLILAGGD SRPTQDLLSA CLALEAGLPP HLHLGRSANY CMAGPLALVH EDACSVCHCA TQVTADDGLR APQATVDSPL VAGLAVSQIV QKCLSRHSLA RGRQWILDLK GDQAKLRSLQ RTRM
|
| |