Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1408 |
Symbol | |
ID | 6980136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1428762 |
End bp | 1429772 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396129 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_002280928 |
Protein GI | 209549011 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.548805 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTG CGACCTTGAA GGACTCCACC CGGGACGGCC GCCTCGTCGT CGTTTCCCGC GACCTGACCC GCTGCTCGGA AGTCGGCCAT ATCACCCGCA CCCTGCAGGC AGCGCTCGAT GACTGGGAGC ATGTGGCGCC GAGACTTCAG CTGATTGCCG AAGGCATCGA GACCGGAGCC CAGCCGACGC TTCGCTTTCA CGAGCATGAC GCTGCGTCAC CTTTGCCGCG GGCTTATCAA TGGGCCGACG GTTCCGCTTA CGTCAACCAT GTCGAACTGG TGCGCAAGGC CCGCGGCGCC GAAATGCCGG CGAGCTTCTG GACCGATCCA CTGATGTATC AGGGTGGCTC GGATGCTTTC CTCGCGCCAC GCGATCCGAT CCTGGTGGCC GACGAGGCCT ATGGGATCGA CATGGAGGGC GAGGTCGCCG TCATCACCGG CGACGTTGCC ATGGGGGCGG ACCCGGAAGC GGCGAGCGGC GCCATCCGGC TGCTGATGCT GGTCAACGAC GTATCGCTGC GCGGCCTGAT CCCTGACGAG CTGGCCAAAG GGTTCGGTTT CTTCCAGTCC AAGCCGGCCT CGGCATTTTC GCCCGTGGCG GTGACGCCGG ACGAGCTGGG GGAGGCGTGG GATGGCCGCA AACTGCATCT GCCGCTGCTT GTCAGCCTGA ACGGCAGGCC GTTCGGCAAG GCCAATGCCG GCATCGATAT GACGTTCGAT TTCGGCCAGT TGATCGCCCA TGCCGCCAAA ACCCGCAGTC TCGCGGCCGG AACCATCATC GGCTCGGGAA CGGTTTCCAA CAAGCTGGAC GGCGGAGCGG GCAAGCCGGT CGAAACGGGA GGGGACGGCT ACTCCTGCAT CGCCGAAATC CGGATGATCG AGACGATCGA AACCGGCGCG CCGAAAACGC CGTTCATGCA GTTCGGCGAT CAGGTCCGCA TCGAGATGAA GGACCGTGCC GGCCATTCGA TCTTTGGGGC GATCGAGCAG ACCGTCGAAC GTTATGGATG A
|
Protein sequence | MKLATLKDST RDGRLVVVSR DLTRCSEVGH ITRTLQAALD DWEHVAPRLQ LIAEGIETGA QPTLRFHEHD AASPLPRAYQ WADGSAYVNH VELVRKARGA EMPASFWTDP LMYQGGSDAF LAPRDPILVA DEAYGIDMEG EVAVITGDVA MGADPEAASG AIRLLMLVND VSLRGLIPDE LAKGFGFFQS KPASAFSPVA VTPDELGEAW DGRKLHLPLL VSLNGRPFGK ANAGIDMTFD FGQLIAHAAK TRSLAAGTII GSGTVSNKLD GGAGKPVETG GDGYSCIAEI RMIETIETGA PKTPFMQFGD QVRIEMKDRA GHSIFGAIEQ TVERYG
|
| |