Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2007 |
Symbol | |
ID | 6980746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2066616 |
End bp | 2067989 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396729 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002281517 |
Protein GI | 209549600 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA ATTGGACCCC GAGCAGCTGG CGGCAAAAGC CCATCCTGCA GGTTCCCGAA TATCCGGATG CGGCCGCATT GGCAGCAACG GAGGCCACGC TCGCCAGCTA TCCGCCGCTC GTTTTTGCCG GCGAGGCGCG CCGGCTGAAG AAGCATCTCG CCAACGTCGC CGAAGGCAAC GGTTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAACATGG CGCCGATAAT ATTCGCGACT TCTTCCGCGC CTTCCTGCAG ATGGCCGTCG TGCTGACCTT CGGCGCACAG CTGCCGGTCG TCAAGGTCGG CCGCATTGCC GGCCAGTTCG CCAAGCCGCG TTCATCGAAT GTCGAAAAGC AGGGCGACGT GACACTGCCG GCCTATCGCG GCGACATCAT CAACGGCATC GAGTTCACCG AGGAGTCGCG CATTCCGAAC CCGGAACGCC AGGCGATGGC CTATCGCCAG TCGGCCGCGA CGCTGAACCT TCTGCGCGCC TTCGCGATGG GCGGCTACGC CAACCTCGAA AACGTGCATC AGTGGATGCT CGGCTTCGTC AAGGACAGCC CGCAGGGCGA GCGTTACCGC AAGCTTGCCG ACCGCATCAG CGAAACCATG GATTTCATGA AGGCGATCGG CATCACCTCG GAAAACCACC CGAGCCTGCG CGAGACCGAT TTCTTCACCA GCCATGAGGC GCTTCTGCTC GGCTACGAGG AGGCGCTGAC CCGCGTCGAT TCCACGTCGG GCGACTGGTA TGCCACATCG GGCCATATGA TCTGGATCGG CGACCGTACG CGCCAGGCCG ACCATGCGCA TATTGAATAT TGTCGCGGCA TCAAGAACCC GATCGGCCTC AAGTGCGGCC CATCGCTGCA GGCCGACGAT CTGCTGCAGC TGATCGACAT CCTGAACCCG GCGAACGAAG CCGGGCGCCT GACGCTGATC TGCCGTTTCG GCCATGAGAA GGTCGCCGAA AACCTGCCGC GCCTCATCCG CGCCGTCGAG CGCGAGGGTC GCAAGGTCGT CTGGTCCTGC GACCCGATGC ACGGCAACAC CATCACGCTC AACAACTACA AGACCCGGCC TTTCGAGCGG ATCCTGTCGG AAGTCGAAAG CTTCTTCCAG ATCCACCGCG CCGAAGGCAC GCATCCCGGC GGCATCCATG TCGAAATGAC CGGCAAGGAT GTGACGGAAT GCACCGGCGG CGCCCGTGCC GTCACCGCCG ACGATCTGCA GGACCGCTAC CACACTCATT GCGATCCGCG CCTCAACTCC GACCAGGCGC TCGAGCTTGC CTTCCTGCTT GCCGAGCGCA TGAAGGGCGG ACGCGACGAG AAGCGCATGG TCGCCCACGG CTGA
|
Protein sequence | MAENWTPSSW RQKPILQVPE YPDAAALAAT EATLASYPPL VFAGEARRLK KHLANVAEGN GFLLQGGDCA ESFAEHGADN IRDFFRAFLQ MAVVLTFGAQ LPVVKVGRIA GQFAKPRSSN VEKQGDVTLP AYRGDIINGI EFTEESRIPN PERQAMAYRQ SAATLNLLRA FAMGGYANLE NVHQWMLGFV KDSPQGERYR KLADRISETM DFMKAIGITS ENHPSLRETD FFTSHEALLL GYEEALTRVD STSGDWYATS GHMIWIGDRT RQADHAHIEY CRGIKNPIGL KCGPSLQADD LLQLIDILNP ANEAGRLTLI CRFGHEKVAE NLPRLIRAVE REGRKVVWSC DPMHGNTITL NNYKTRPFER ILSEVESFFQ IHRAEGTHPG GIHVEMTGKD VTECTGGARA VTADDLQDRY HTHCDPRLNS DQALELAFLL AERMKGGRDE KRMVAHG
|
| |