Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2226 |
Symbol | |
ID | 8013233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2230530 |
End bp | 2231903 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824812 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002976042 |
Protein GI | 241204946 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.435379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.755541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGACA ATTGGACCCC GAGCAGCTGG CGGCAAAAAC CGATCCTGCA GGTTCCCGAA TATCCCGACG CAGCCGCTTT GGCGGCAACG GAAGCCACGC TCGCCAGCTA TCCGCCGCTT GTCTTTGCAG GTGAGGCGCG CCGCCTGAAG AAACATCTCG CCAATGTCGC CGAAGGCAAC GGCTTCCTGC TGCAGGGCGG CGACTGCGCC GAGAGCTTCG CCGAACACGG TGCCGACAAT ATCCGCGACT TCTTCCGCGC CTTCCTGCAG ATGGCCGTGG TGCTGACCTT CGGGGCTCAG CTGCCGGTCG TCAAGGTCGG CCGCATCGCC GGCCAGTTCG CCAAGCCGCG GTCGTCGAAT GTCGAGAAGC AGGGCGATGT GACGCTGCCG GCCTATCGCG GCGACATCAT CAACGGTATC GAGTTCACCG AGGAATCGCG CATTCCGAAC CCGGAACGCC AGGCCATGGC CTATCGCCAG TCGGCTGCGA CGCTGAACCT ATTGCGTGCC TTTGCGATGG GCGGTTATGC CAATCTCGAA AACGTGCATC AGTGGATGCT CGGCTTCGTC AAGGACAGCC CGCAGGGCGA GCGCTACCGC AAGCTTGCCG ACCGCATCAG CGAAACCATG GATTTCATGA AGGCGATCGG TATCACCTCG GAAAACCAGC CGGCGCTGCG CGAGACCGAT TTCTTCACCA GTCACGAGGC GCTGCTGCTC GGCTACGAGG AAGCGCTGAC CCGCGTCGAT TCCACCTCGG GCGACTGGTA TGCAACCTCT GGCCACATGA TCTGGATCGG CGACCGCACG CGCCAGGCCG ACCATGCGCA TGTCGAATAT TGCCGCGGCA TCAAGAACCC GATCGGCCTG AAATGCGGCC CTTCGCTGCA GGCCGACGAC CTGCTGCAAC TGATTGACAT CCTGAATCCT GCCAACGAGG CCGGGCGCCT GACGCTGATC TGCCGCTTCG GCCATGAGAA GGTCGCCGAC AGCCTGCCGA AACTCATTCG CGCCGTTGAG CGCGAGGGCC GCAAGGTCGT CTGGTCCTGC GATCCGATGC ACGGCAACAC GATCACGCTC AATAACTACA AGACCCGTCC CTTCGAGCGG ATCTTGTCGG AAGTCGAAAG CTTCTTCCAG ATCCACCGCG CCGAAGGCTC GCATCCGGGT GGCATCCATA TCGAGATGAC TGGCAAGGAC GTGACCGAGT GCACCGGCGG CGCCCGCGCG GTCTCCGCCG AAGACCTGCA GGACCGCTAT CACACCCATT GCGATCCGCG CCTCAACGCC GACCAGGCGC TCGAGCTGGC CTTCCTGCTT GCCGAGCGCA TGAAGGGTGG TCGCGACGAA AAGCGCATGG TCGCCAACGG CTGA
|
Protein sequence | MADNWTPSSW RQKPILQVPE YPDAAALAAT EATLASYPPL VFAGEARRLK KHLANVAEGN GFLLQGGDCA ESFAEHGADN IRDFFRAFLQ MAVVLTFGAQ LPVVKVGRIA GQFAKPRSSN VEKQGDVTLP AYRGDIINGI EFTEESRIPN PERQAMAYRQ SAATLNLLRA FAMGGYANLE NVHQWMLGFV KDSPQGERYR KLADRISETM DFMKAIGITS ENQPALRETD FFTSHEALLL GYEEALTRVD STSGDWYATS GHMIWIGDRT RQADHAHVEY CRGIKNPIGL KCGPSLQADD LLQLIDILNP ANEAGRLTLI CRFGHEKVAD SLPKLIRAVE REGRKVVWSC DPMHGNTITL NNYKTRPFER ILSEVESFFQ IHRAEGSHPG GIHIEMTGKD VTECTGGARA VSAEDLQDRY HTHCDPRLNA DQALELAFLL AERMKGGRDE KRMVANG
|
| |