Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2219 |
Symbol | |
ID | 8013228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2222798 |
End bp | 2223859 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824805 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002976035 |
Protein GI | 241204939 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0289807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.425963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGATA CCATTGACGA TCTGCGTATC GTCGAAATTA CCCCCCTGAC CAAGCCTGCC GATATCATCG CGGAAATTTC CCGCAATGCG GATGTCAGCA AGACCGTGAC CATCAATCGT GAGGCGATCC GTAAAATTCT CCAGGGCGAA GACGATCGGC TGATCGTCGT CATCGGCCCC TGCTCGATCC ATGATCCCGT TGCCGCCAGG GACTATGCCG CTCGGCTCAC GGAACAGCGG CAGCGCTTCG CCGGCGATCT TGAGATCGTC ATGCGCGTCT ATTTCGAAAA GCCCCGCACC ACCGTCGGCT GGAAGGGTCT GATGAACGAC CCGCATCTCG ACGGCAGCTA CCGCATCGAG GAGGGGCTGA GGATTGCCAG ACGCCTGCTG CTCGATATCA ATGCGATGGG TCTTCCGGCC GGCGTCGAAT TCCTAGACAC GATCACGCCG CAATACATTG CCGATCTCGT CAGTTGGGGA GCGATCGGCG CACGCACCAC CGAAAGCCAG GTCCATCGTC AGCTTGCCTC CGGCCTTTCC TGCCCGATCG GCTTCAAGAA CGGCACCGAC GGCGGCGTGC GTGTGGCGCT GGACGCCATC CTCGCCGCCT CGCAGCCGCA TCACTTCCCC GCCGTGACCA AGGACGGACA GGCGGCCATC GCCTCGACGA CGGGTAATGA GGACTGCCAC ATCATCCTAC GCGGCGGCAA GCGGCCGAAC TATGAAGCGG CCGACGTCGA AGCGGTCACC GGCGAAGCCC TCAAGCTCGG CGTCGCCCGG CGCATCCTCA TCGATGCCAG CCACGCCAAC AGCGGCAAGG ATCCGATGAA CCAGCCGCTC GTCGTCAAAT CGGTGGCCGC ACAGATTGCC GCCGGGAACA ACGACATCAA GGGCATGATG ATCGAAAGCA ACCTCGTCGC CGGCCGCCAG GATCTCGTTC CCGGCAAGCC GCTGGTCTAT GGCCAGTCTA TCACCGACGG CTGCATCGAC TGGGCGATGT CGGTCGCGGT GCTGGAAGAC CTGGCAAAAT CCGCCCGCGA GCGCCGCCAG ACCCGCGCTT AA
|
Protein sequence | MSDTIDDLRI VEITPLTKPA DIIAEISRNA DVSKTVTINR EAIRKILQGE DDRLIVVIGP CSIHDPVAAR DYAARLTEQR QRFAGDLEIV MRVYFEKPRT TVGWKGLMND PHLDGSYRIE EGLRIARRLL LDINAMGLPA GVEFLDTITP QYIADLVSWG AIGARTTESQ VHRQLASGLS CPIGFKNGTD GGVRVALDAI LAASQPHHFP AVTKDGQAAI ASTTGNEDCH IILRGGKRPN YEAADVEAVT GEALKLGVAR RILIDASHAN SGKDPMNQPL VVKSVAAQIA AGNNDIKGMM IESNLVAGRQ DLVPGKPLVY GQSITDGCID WAMSVAVLED LAKSARERRQ TRA
|
| |