Gene Rleg2_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1408 
Symbol 
ID6980136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1428762 
End bp1429772 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID643396129 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002280928 
Protein GI209549011 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.548805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG CGACCTTGAA GGACTCCACC CGGGACGGCC GCCTCGTCGT CGTTTCCCGC 
GACCTGACCC GCTGCTCGGA AGTCGGCCAT ATCACCCGCA CCCTGCAGGC AGCGCTCGAT
GACTGGGAGC ATGTGGCGCC GAGACTTCAG CTGATTGCCG AAGGCATCGA GACCGGAGCC
CAGCCGACGC TTCGCTTTCA CGAGCATGAC GCTGCGTCAC CTTTGCCGCG GGCTTATCAA
TGGGCCGACG GTTCCGCTTA CGTCAACCAT GTCGAACTGG TGCGCAAGGC CCGCGGCGCC
GAAATGCCGG CGAGCTTCTG GACCGATCCA CTGATGTATC AGGGTGGCTC GGATGCTTTC
CTCGCGCCAC GCGATCCGAT CCTGGTGGCC GACGAGGCCT ATGGGATCGA CATGGAGGGC
GAGGTCGCCG TCATCACCGG CGACGTTGCC ATGGGGGCGG ACCCGGAAGC GGCGAGCGGC
GCCATCCGGC TGCTGATGCT GGTCAACGAC GTATCGCTGC GCGGCCTGAT CCCTGACGAG
CTGGCCAAAG GGTTCGGTTT CTTCCAGTCC AAGCCGGCCT CGGCATTTTC GCCCGTGGCG
GTGACGCCGG ACGAGCTGGG GGAGGCGTGG GATGGCCGCA AACTGCATCT GCCGCTGCTT
GTCAGCCTGA ACGGCAGGCC GTTCGGCAAG GCCAATGCCG GCATCGATAT GACGTTCGAT
TTCGGCCAGT TGATCGCCCA TGCCGCCAAA ACCCGCAGTC TCGCGGCCGG AACCATCATC
GGCTCGGGAA CGGTTTCCAA CAAGCTGGAC GGCGGAGCGG GCAAGCCGGT CGAAACGGGA
GGGGACGGCT ACTCCTGCAT CGCCGAAATC CGGATGATCG AGACGATCGA AACCGGCGCG
CCGAAAACGC CGTTCATGCA GTTCGGCGAT CAGGTCCGCA TCGAGATGAA GGACCGTGCC
GGCCATTCGA TCTTTGGGGC GATCGAGCAG ACCGTCGAAC GTTATGGATG A
 
Protein sequence
MKLATLKDST RDGRLVVVSR DLTRCSEVGH ITRTLQAALD DWEHVAPRLQ LIAEGIETGA 
QPTLRFHEHD AASPLPRAYQ WADGSAYVNH VELVRKARGA EMPASFWTDP LMYQGGSDAF
LAPRDPILVA DEAYGIDMEG EVAVITGDVA MGADPEAASG AIRLLMLVND VSLRGLIPDE
LAKGFGFFQS KPASAFSPVA VTPDELGEAW DGRKLHLPLL VSLNGRPFGK ANAGIDMTFD
FGQLIAHAAK TRSLAAGTII GSGTVSNKLD GGAGKPVETG GDGYSCIAEI RMIETIETGA
PKTPFMQFGD QVRIEMKDRA GHSIFGAIEQ TVERYG