Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2000 |
Symbol | |
ID | 6980739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2058669 |
End bp | 2059736 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396722 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002281510 |
Protein GI | 209549593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.331476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGATA CCATTGACGA TCTCCGTATC CTCGAGATCA CCCCGCTGAC CAAACCCGCC GATATCATTG CGGAAATCCA CCGCGACGCC GTGGTCACCG AGACGGTGAC CAGCAATCGT GATGCGATCC ATAAGATTCT CCAGGGCGAA GACGATCGCC TGGTTGTCGT CATCGGCCCC TGCTCCATCC ATGATCCGAT TGCCGCCCGG GAATATGCCG CACGGCTCAA GGAGCAGCGG CGGCGCTTCT CCGACGATCT CGAAATCGTC ATGCGCGTCT ATTTCGAAAA GCCCCGCACC ACGGTCGGCT GGAAGGGCCT GATGAACGAT CCGCATCTCG ATGGCAGCTA CCGCATCGAG GAAGGGCTGC GCATCGCCCG ACGCCTGCTG CTCGATATCA ATGCCATGGA GCTTCCCGCC GGCGTCGAGT TCCTCGACAC GATCACGCCG CAATACATCG CCGATCTCGT CAGCTGGGGC GCGATCGGCG CGCGCACCAC CGAAAGCCAG ATCCATCGCC AGCTCGCCTC CGGCCTCTCC TGCCCGATCG GCTTCAAGAA CGGCACTGAC GGCGGCGTCC GTGTCGCGCT CGACGCCATC CTGGCCGCCT CGCAGCCGCA TCATTTCCCC GCCGTCACCA AGGACGGACA GGCGGCCATC GCTTCGACGA GGGGCAATGA GGACTGCCAC ATCATCCTGC GCGGCGGCAA ACAGCCGAAC TATGAAGCGA CCGACGTCGA AGCTGTGGTC GGCGAAGCCG TCAAGCTTGG CGTAACCCCG CGCATCCTGA TCGATGCCAG CCATGCCAAC AGCAGCAAGG ATCCGATGAA CCAGCCGCGC GTCGTCAAAT CCGTGGCCGC GCAGATCGCC GCCGGAAATC GCCATATCAA GGGCATGATG ATCGAGAGCA ATCTCGTCGC CGGCCGCCAG GATCTCGTGC CCGGCAAGCC GCTGGTTTAC GGCCAGTCCA TCACCGACGG CTGCATCGAC TGGGACATGT CGGTGGCGAC CCTGGAAGAC CTGGCGCAAT CCGCCCGCGC CCGGCGCAAG GCCGCCATCG CAGCCTGA
|
Protein sequence | MSDTIDDLRI LEITPLTKPA DIIAEIHRDA VVTETVTSNR DAIHKILQGE DDRLVVVIGP CSIHDPIAAR EYAARLKEQR RRFSDDLEIV MRVYFEKPRT TVGWKGLMND PHLDGSYRIE EGLRIARRLL LDINAMELPA GVEFLDTITP QYIADLVSWG AIGARTTESQ IHRQLASGLS CPIGFKNGTD GGVRVALDAI LAASQPHHFP AVTKDGQAAI ASTRGNEDCH IILRGGKQPN YEATDVEAVV GEAVKLGVTP RILIDASHAN SSKDPMNQPR VVKSVAAQIA AGNRHIKGMM IESNLVAGRQ DLVPGKPLVY GQSITDGCID WDMSVATLED LAQSARARRK AAIAA
|
| |