Gene Rleg_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3994 
Symbol 
ID8014804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4071145 
End bp4072587 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content61% 
IMG OID644826563 
ProductUbiD family decarboxylase 
Protein accessionYP_002977774 
Protein GI241206678 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.247967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0822542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATC ACCCTGATCC GCTTGAAAAC CCGGCCATGA CATCCGGGAC GGCCGGCGCT 
GCAATGCCAC GCAGCGAAAG CGATCAGTCG TGTCGGGGCA TTCTTCAGGA GCTCGACCGT
CGCGGTCAGC TGCTGGTGAT TTCGGAGGAG GTCGACCCGA TCCACGACGT ATCGGCCATT
CTTTCCATCG TCGATGAAAA GGCGGCGGTG CGGCTCGACT GCATCAAAGG TCACGACATG
CCGATCTTCG GCAACATCCT GTCGGATCTC GATCGTGTCG CCCTGGCGCT TGACGTCGCG
AAAAGCGATA TCCAGGAAAA GCTGCTGTCC TCGATCGCCT CTCCGGTGCC GCCAGTCTTC
GTGGACGATG CGCCGGTACA GCAACAGCTC TTCCAGGACG ACATCCTGAC CCGGCTTCCA
GTCCCGACCT TCTTTTCCAA GGAGACCGGT CCTTACATTA CCGCGGGTCT GATCGTGGCG
CGCGACCCTG AAACGGGCTT GGGCAACGCG TCCTATGCCC GCATCAAGGT GCTTGGTCCG
AACGAGGCGA TGATCGGAAT TGCACCGAAC CATCATCTCG CCATCATGGC CCGCAAGGCG
GGCGCCAAAG GCGAGCCGCT CCCCTTCGCC GTCGTCCTCG GCGCGCATCC AGCCATTCAG
CTTGCCGCCT GCTTCTATCT GGGGCTTGGT GACGATGAAA TGCACTGCGC CGGATCGCTG
CTCGGCGAGC CGGTGCGGCT TGTCCGCTGC AAGTCGATCG ATCTTGCCGT GCCTGCAGAG
GCGGAGATCG TGCTCGAAGG TCATATTCAT ATCGACGAGC CGATCCTTGA AGGGTTGGTC
TCAGAATATC ACGGCATGTA CGAGGATTAC GGTTCCGGGG TGCGCGTCAG ATTTGAATGC
ATGACTTGCC GCTCCGACGC GATGTTGCAG GTGATCGAGC CCGGTTACCA CATGGAGCAC
CTCTATCTCG GCGCGGTGCC GATCGCCGCC AGCCTCAAGG CCGTCATCCG GCGGTCCGTT
CTCAATGTCG GCGAGGTTGC CGTCACCGCA TCAGGCAGCG GACGGAACAA CGTGGTCGTG
CAGATCGATG CGCCCCGCCC GGGACAGGCA CGCCGCATCA TGTCGATCTG CTGGGGTGCG
GTCAGCATCG TCAAGAACAT CACGATCGTC GACAGCGATG TCGACCCTTG GGATCTCGAT
GCCGTCGAAT TGGCAAAGAT GACGCGCATG CGCGCGGAGC GCGACATTCT CATCGTCACG
GACCTTCCGG CAGACCGGTC CGAGCCTCAG GAAGACGGCG GCGTGATCGC CAAGGTCGGC
TACGACGCCA CGATGAAGCC GGGCGATCGA AGGGAAGGGT TCGACAAGGC GCTTCCGCCG
CCGGATTCCT ACGAACGGAT GCGCAAGCTG CTGTTGCGCG TCAAGCCGGA GATGCTGATT
TGA
 
Protein sequence
MNDHPDPLEN PAMTSGTAGA AMPRSESDQS CRGILQELDR RGQLLVISEE VDPIHDVSAI 
LSIVDEKAAV RLDCIKGHDM PIFGNILSDL DRVALALDVA KSDIQEKLLS SIASPVPPVF
VDDAPVQQQL FQDDILTRLP VPTFFSKETG PYITAGLIVA RDPETGLGNA SYARIKVLGP
NEAMIGIAPN HHLAIMARKA GAKGEPLPFA VVLGAHPAIQ LAACFYLGLG DDEMHCAGSL
LGEPVRLVRC KSIDLAVPAE AEIVLEGHIH IDEPILEGLV SEYHGMYEDY GSGVRVRFEC
MTCRSDAMLQ VIEPGYHMEH LYLGAVPIAA SLKAVIRRSV LNVGEVAVTA SGSGRNNVVV
QIDAPRPGQA RRIMSICWGA VSIVKNITIV DSDVDPWDLD AVELAKMTRM RAERDILIVT
DLPADRSEPQ EDGGVIAKVG YDATMKPGDR REGFDKALPP PDSYERMRKL LLRVKPEMLI