Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4841 |
Symbol | |
ID | 6977935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 483601 |
End bp | 484479 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394002 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_002278820 |
Protein GI | 209546902 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00571792 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACTTT CCGGCGTCAT GCCCGCTCTC ATCACCCCCT TCGATGCGAA CAACAGGATC GACTTCAAGG CGTTCGAAAA GCTTCTGACG CATCTGCGTG AGGCAGGCGT CACCGGCTGG GTTCCAAACG GCTCGACCGG CGAGTATTTC AGCCAATCCA GGGAAGAGCG CCGTGACGTG CTGCAGTTCG TCAAGGAGTT TGCCAGGCCG GGCGAAATTC TGATCGCCGG CACCAATGCG CCGGCCACGC GCGAGGTGAT CGAGCAAACG GCGCTGGCGA GAGATATCGG CTATGACACA GTCCTGCTGG CGCCGCCATT CTACACCCGT CCGACCCAGG CGGAACTGAT CAAGCATTAT GAAGCCGTGC TCAGCACCGT CGATGTGAGC CTCGTGCTCT ATTCCTATCC GGCAAAGGAT GGTTCGGACA TCAGCTTCGA GCTGATGGAT CATTTTGCCG ATAATCCGCG GGTGATCGGC ATCAAGGAAA GCTCGGGCGT GTTGCAGCGC GCGATCGACA TCGCCAGCCG CTATGAGGGC AAGATCCAGC TCGTCAGCGG CTCCGACGAT ATCGCGCTCG ATTTCATGTT CTGGGGCGCG GAAAGCTGGA TCTGCGGTCC GTCCAACTGC ATGGCCAAGG CCTGCTGCGA TCTCGACCGA ACCTACAAGT CGGGAGACCT CGGCAAGGCC CGCGAGATGA TGAAAACCCT CTACCGGGCG ATGAACATCC TCGAGAGCGG CAAGTTCGTG CAGAAGATCA AGTATGGCTG CGAGCTTCAG GGCTTGCCTG TGGGGACGTG CCGCGCGCCG CTCGGCGAGC TGACCTCCGA AGAAAAGGCC GAGTTCAGGG CGGCGATGGA GCCGATCCTC AACTGGTAG
|
Protein sequence | MKLSGVMPAL ITPFDANNRI DFKAFEKLLT HLREAGVTGW VPNGSTGEYF SQSREERRDV LQFVKEFARP GEILIAGTNA PATREVIEQT ALARDIGYDT VLLAPPFYTR PTQAELIKHY EAVLSTVDVS LVLYSYPAKD GSDISFELMD HFADNPRVIG IKESSGVLQR AIDIASRYEG KIQLVSGSDD IALDFMFWGA ESWICGPSNC MAKACCDLDR TYKSGDLGKA REMMKTLYRA MNILESGKFV QKIKYGCELQ GLPVGTCRAP LGELTSEEKA EFRAAMEPIL NW
|
| |