Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5443 |
Symbol | |
ID | 6978537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1086356 |
End bp | 1087309 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643394544 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_002279362 |
Protein GI | 209547444 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.344282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00661581 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATCG CGATCATGCG CAAGGCGCTC ACGGGCGTTT CGGGTGTTCC CGTCACGGCC TATGACGGCA AAGGCGAAGT CGAACCGCGG ATTACGGCCA AGGTCTATGC GCGGGTGGCG GCCGCCCGCA TTCACAACAT TGTCGCTGCC GGCAATACCG GGGAATTCTA CGCGCTGACG CCGCAGGAAA TCCGGATCGT CCACGAAGCC GCGGTATCAG GCGTCGACGG CCGCGCGCCG GTGACGGCGG CGATCGGCCG GTCGCTACGC GAGGCGATTG GCATGGCGCG GGATGCGGCC GCGATCGGCG CAACCGCCGT CATGTCGCAT CAGCCCGTCG ATCCTTTCGC AGCACCTTCG GCCCAGATCG GCTATTTCTG CAATCTTGCC GATGCGTCCA CCCTGCCGCT CGTCGCCTAT GTCAGGGCCG AGGGTTTCGG TGTCGACGAT ATCGTCCGCC TCGCCAACCA CGGCAACATC GCCGGCATCA AGTTTGCCAC GACTGATCTG ATGCTCTTGT CGCGCGCGAT CCCGGCGGCC GATCCCGATG GTGCGCTGTT CGTCTGCGGC CTGGCGGAGA GCTGGGCGCC GACATTCACC GCAGCCGGGG CGCGCGGCTT CACGTCGGGC CTCGTCAACG TTGCGCCGCA GCTTTCGCTT GCCGTCCACG CCGCGCTCGA AAAAGGCGAC TTTGCCGCTG CACGGGCGAT CGTCAACACG CTCGAGCCGT TCGAGCGGAT GCGAACCAAA TTCCGCAACG GCGCCAACGT GACGGTCGTG AAAGAGGCCG TCACCTATTC CGGCCTCGAT GTCGGCCCCG TGCGCGTGCC GGGGTTGCCG CTGCTCGACC AGCATGATCG CGAGGAACTT CATCGGCTGC TTCGAGGCTG GGAGGCCGAG GGCAGCATTC AAACTGATCC GGACCGGCAG CAGTCCGCCA AGGCGACCGG CTGA
|
Protein sequence | MSIAIMRKAL TGVSGVPVTA YDGKGEVEPR ITAKVYARVA AARIHNIVAA GNTGEFYALT PQEIRIVHEA AVSGVDGRAP VTAAIGRSLR EAIGMARDAA AIGATAVMSH QPVDPFAAPS AQIGYFCNLA DASTLPLVAY VRAEGFGVDD IVRLANHGNI AGIKFATTDL MLLSRAIPAA DPDGALFVCG LAESWAPTFT AAGARGFTSG LVNVAPQLSL AVHAALEKGD FAAARAIVNT LEPFERMRTK FRNGANVTVV KEAVTYSGLD VGPVRVPGLP LLDQHDREEL HRLLRGWEAE GSIQTDPDRQ QSAKATG
|
| |