Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1545 |
Symbol | |
ID | 6980276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1569994 |
End bp | 1571319 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396265 |
Product | homoserine dehydrogenase |
Protein accession | YP_002281061 |
Protein GI | 209549144 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.927693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGATG CCCTCAAAAT CGGCATTGCG GGCTTGGGCA CCGTTGGCGC CTCGCTAGTC CGCATCATTC AGCAGAAGAG CAACGAGCTT GCCGTCACCT GCGGGCGTCC GATCACCATC ACGGCGGTCT CCGCGCGTGA CAAAACGAGA GACCGCGGTA TCGATCTTTC CGCTGTCACC TGGTTCGATC GGCCGGAAGA TCTTGCCGAA AAGGGCGATA TCGACGTCTT CGTCGAGCTG ATGGGCGGCG CCGAAGGAGC TGCCAACACC TCCGTACGCG CCGCACTTAA GCGTGGTCTC CACGTGGTGA CAGCCAACAA GGCGCTGCTT GCCTATCACG GCGTCGAGCT TGCGACGATT GCCGAGGAGA AGGGCGCGCT TCTAAACTTC GAGGCCGCGG TGGCCGGCGG CATCCCGGTC ATCAAGGCGC TGCGCGAATC GCTGACCGGC AATGCCGTCT CGCGCATCTA TGGCATCATG AACGGCACCT GCAATTACAT CCTGACCAAG ATGGAAAAGG AGGGGCTTTC CTTCGCCGAG TGCCTGAAGG AAGCCCAGCG GCTGGGTTAT GCCGAGGCCG ATCCGGCCTT CGACATCGAG GGCAACGACA CCGCCCATAA GCTTTCCATC CTGACGACGC TCGCCTTCGG CAATCGCATC GCGGCCGACG ATATCTATCT CGAAGGCATC ACCAACATCT CGATCGAGGA TATCCACGCC GCCGCCGAGC TCGGTTATCG TATCAAGCTC CTGGGCGTTG CCCAGCGCAC CGACACCGGC ATCGAGCAGC GCGTGCATCC GACCATGGTG CCGGTCGATT CGGTCATTGC CCAGGTCGAC GGCGTTACCA ATGCGGTGGC GATCGAATCC GACGTGCTCG GCGAACTGCT GATGGTCGGT CCCGGCGCCG GCGGCAATTC GACGGCCTCG TCCGTACTGG GCGATATCGC CGATATCGCC AAAAGCCAGC CGGGCGCACA ACGCGTGCCG GTGCTCGGCC ATCCCGCAAA AGCGCTGGAA CCCTACCGCA AGGCGCAGAT GCAGAGCCAC GAGGGCGGCT ACTTTATCCG CCTGACCGTG CTCGACCGCA CGGGCGTCTT TGCCAGCGTT GCAACCCGCA TGGCGGAAAA CAACATCTCG TTGGAATCGA TCGTCCAGCG CTCCAAGCAA CATCTGGCGC CGTCGCACCA CCAGACGATC ATTCTCGTCA CCCATGCGAC GATGGAAGAG TCGGTGCGCA AGGCGGTCGC CTCGATCAAG TCGGAAGGCT ATCTCTTCGG CGAACCGCAG GTGATTCGTA TCGAGCGGCC GAAAGAAGAC GCTTAA
|
Protein sequence | MADALKIGIA GLGTVGASLV RIIQQKSNEL AVTCGRPITI TAVSARDKTR DRGIDLSAVT WFDRPEDLAE KGDIDVFVEL MGGAEGAANT SVRAALKRGL HVVTANKALL AYHGVELATI AEEKGALLNF EAAVAGGIPV IKALRESLTG NAVSRIYGIM NGTCNYILTK MEKEGLSFAE CLKEAQRLGY AEADPAFDIE GNDTAHKLSI LTTLAFGNRI AADDIYLEGI TNISIEDIHA AAELGYRIKL LGVAQRTDTG IEQRVHPTMV PVDSVIAQVD GVTNAVAIES DVLGELLMVG PGAGGNSTAS SVLGDIADIA KSQPGAQRVP VLGHPAKALE PYRKAQMQSH EGGYFIRLTV LDRTGVFASV ATRMAENNIS LESIVQRSKQ HLAPSHHQTI ILVTHATMEE SVRKAVASIK SEGYLFGEPQ VIRIERPKED A
|
| |