Gene Rleg_5019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5019 
Symbol 
ID8007610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp403478 
End bp404422 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content61% 
IMG OID644821934 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_002973194 
Protein GI241113359 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.177814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.505176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAATT ATCGGTCCAA ACGATTATCG GCGGCTGCCA GCGTATCCGT CGACTTCCCG 
AACCTTGTCA CCCGCAGCCT GGGGCGGGGC GGCCGGGGGG CTCTGTTCAC CGGCAATAGT
GCCGATTCTG TCGAGGAACA GAGCCCCCAT GTTACCAGGC CTGCATTCGT CGTCTCGTTG
GAGGCTCCAG ATATGCCGGC GAAGAACGAA TATGATCCCG CCGCAGTCGA TCATCACGAC
CCCTTCCTGG CGCCTCTTTG CTATCGGCAA CGCGACATCG CACTGTTTGG CGACATCGAC
TTCGTCATCA TCGGATGCGG CGGCTTGGGC TCGCAGATCG CCATCCAGCT CGCGGCCCTC
GGCGCGCGTC GTTTCCTTCT CGTGGATGCG GATCGTATCG ATGAGAACGA CTTGAACCAT
CTCCCATGGG CATGCGAGGC TGATCTCGGC CGGCTGAAGA CGGACAGGCT GGCGACCCAT
CTGGCCGCGG GTTTCTCGGC CACTGTCTTC GCGCTGCCGG AATTTGCGGA AGGCGCCTCG
GCGCTGCGGT TAATCGCAAA CTACGCCAAT AACCCGTTCC TCATCCTCGC CGGCGGCGAT
TCTCGTCCAA CCCAAGATCT CCTGTCAGCC TGCCTGGCAT TGGAAGCCGG CCTGCCGCCT
CATCTGCATC TGGGCCGCTC TGCAAACTAT TGCATGGCAG GGCCTTTGGC CTTGGTGCAT
GAGGACGCAT GTTCCGTCTG CCATTGCGCT ACCCAAGTCA CGGCCGACGA CGGCTTACGC
GCGCCGCAGG CTACCGTCGA CAGCCCATTG GTCGCCGGCC TTGCCGTGTC GCAGATTGTC
CAAAAATGCC TTTCGAGACA CTCGCTCGCC CGGGGACGCC AATGGATATT GGACCTCAAA
GGCGACCAGG CCAAGCTGCG CTCTCTCCAA AGAACCCGAA TGTAA
 
Protein sequence
MVNYRSKRLS AAASVSVDFP NLVTRSLGRG GRGALFTGNS ADSVEEQSPH VTRPAFVVSL 
EAPDMPAKNE YDPAAVDHHD PFLAPLCYRQ RDIALFGDID FVIIGCGGLG SQIAIQLAAL
GARRFLLVDA DRIDENDLNH LPWACEADLG RLKTDRLATH LAAGFSATVF ALPEFAEGAS
ALRLIANYAN NPFLILAGGD SRPTQDLLSA CLALEAGLPP HLHLGRSANY CMAGPLALVH
EDACSVCHCA TQVTADDGLR APQATVDSPL VAGLAVSQIV QKCLSRHSLA RGRQWILDLK
GDQAKLRSLQ RTRM