Gene Rleg_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1512 
Symbol 
ID8012596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1494230 
End bp1495255 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID644824100 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002975342 
Protein GI241204246 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0329401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG CGACGTTAAA AGACTCCACG CGGGACGGCC GGCTCGTCGT CGTTTCCCGC 
GACCTGACCC GCTGTTCCGA GGTCGGCCAT ATCGCCCGCA CATTGCAGGC AGCACTCGAT
GACTGGGAGC ATGTGGCTCC GAGGCTCGCG CTGATTGCCG AAGGGGTCGA GACCGGAGCC
CAGCCGACGA TCCGGTTTCA CGAACATGAC GCTACGTCAC CTCTGCCGCG CGCCTACCAA
TGGGCCGACG GCTCCGCTTA CGTCAACCAC GTCGAACTGG TGCGCAAGGC ACGCGGCGCC
GAGATGCCGG CGAGCTTCTG GACCGATCCG CTGATCTATC AGGGCGGCTC GGACGCTTTC
CTTGCGCCGC GTGACCCGAT CGTGGCGGCC GACGAGGCTT ACGGGATCGA CATGGAGGGC
GAGGTCGCCG TCCTCACCGG CGACGTTGAC ATGGGTTGCA GCCCGGAAGC GGCGCGCGGC
GCCATCCGTC TCCTGATGCT CGTCAACGAT GTGTCGCTGC GCAGTCTGAT CCCTGGCGAG
TTGGCGAAGG GATTTGGTTT CTTCCAGTCG AAGCCGGCAT CGGCATTCTC TCCGGTCGCC
GTGACGCCGG ACGAGCTCGG AGATGCCTGG GACGGCGGCA AGCTGCATCT CCCGCTGCTG
GTGAGCTTGA ACGGCAAGCC ATTCGGCAAG GCCAACGCCG GCATCGACAT GACCTTCGAT
TTTGCCCAGT TGATCGCCCA TGCCGCCAAA ACCCGCAATC TCGCGGCCGG CACAATCATC
GGCTCGGGAA CGGTCTCCAA CAAGCTCAAC GGCGGCCCGG GCAAGCCGGT GGAAGAGGGA
GGAGACGGCT ACTCCTGCAT TGCCGAAATC AGGATGATCG AGACGATCGA GACCGGATCG
CCGAAGACCC CTTTCATGAA GTTCGGCGAT CAGGTCCGCA TCGAGATGAA GGATAAGGCC
GGCCATTCGA TCTTCGGGGC AATCGAGCAG ACGGTCGAGA AATACCAAGG GGCCGGACCG
CAATGA
 
Protein sequence
MKLATLKDST RDGRLVVVSR DLTRCSEVGH IARTLQAALD DWEHVAPRLA LIAEGVETGA 
QPTIRFHEHD ATSPLPRAYQ WADGSAYVNH VELVRKARGA EMPASFWTDP LIYQGGSDAF
LAPRDPIVAA DEAYGIDMEG EVAVLTGDVD MGCSPEAARG AIRLLMLVND VSLRSLIPGE
LAKGFGFFQS KPASAFSPVA VTPDELGDAW DGGKLHLPLL VSLNGKPFGK ANAGIDMTFD
FAQLIAHAAK TRNLAAGTII GSGTVSNKLN GGPGKPVEEG GDGYSCIAEI RMIETIETGS
PKTPFMKFGD QVRIEMKDKA GHSIFGAIEQ TVEKYQGAGP Q