Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3479 |
Symbol | |
ID | 6982233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3594742 |
End bp | 3595890 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398197 |
Product | galactonate dehydratase |
Protein accession | YP_002282972 |
Protein GI | 209551055 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.412363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.814852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCAAACTCAC CACCTATATC GTTCCGCCGC GCTGGCTGTT TTTGAAGATC GAGACCGACG AGGGCATCAT CGGCTGGGGC GAACCGGTCG TCGAAGGCCG CGCGCTGACC GTCCAAGCCG CCGTCCACGA ACTGGAAGAC TATCTGATCG GCAAGGATCC CTTCCTGATC GAGGACCACT GGACGGTCAT GTATCGCGGC GGCTTCTATC GCGGCGGCGC TGTACACATG AGCGCCATCT CCGGCATCGA CCAGGCGCTG TGGGACATCA AGGGCAAGGC CCTCGGCCAG CCGATCCATT CCCTGCTGGG CGGCCAGCTG CGCGACCGCA TCAAGGTCTA TTCCTGGATC GGCGGCGATC GTCCGGCGGA TGTCGCCAAC AATGCCAGAG AGGTGGTCGC CCGCGGCTTC AAGGCGATCA AGCTCAACGG CTGCGAGGAA ATGCAGATCG TCGACACCAA CGAGAAGGTG GAAAAGGCGG TCGAAACCAT CGCCGTCATC CGCGAGGCGA TCGGCCCGCA TATCGGCATC GGCGTCGATT TCCACGGCCG CGTGCACAAG CCGATGGCCA AGGTTCTCGC CAAAGAACTC GAACCCTACA AGCTGATGTT CATCGAGGAG CCGGTTCTTT CGGAAAACAA GGAGGCGCTG CGCGACATCG TCAACCATAC CTCGACGCCG ATCGCGCTTG GCGAACGGCT CTTTTCGCGC TGGGACTTCA AGCAGGTCCT GTCGGACGGT TATGTCGACA TCATCCAGCC GGATCTGTCG CATGCCGGCG GCATCACCGA GTGCCGCAAG ATCGCGGCGA TGGCCGAAGC CTATGATGTG GCGCTGGCGC CGCATTGCCC GCTAGGCCCG ATCGCACTCG CCGCCTGCCT GCAGGTCGAT GCCGTCAGCT ATAATGCCTT CATCCAGGAA CAGAGCCTCG GCATCCATTA CAACAAGGGC AACGACATCC TCGACTACAT CTCCAACAAG GAGGTGTTCC AATATGCCGA TGGTTTCGTC TCGATCCCCC AGGGGCCCGG TCTCGGCATC GAAGTCGACG AGGCCTATGT CATCGAGCGC GCGAGGGAAG GCCACCGCTG GCGCAACCCG ATCTGGCGCC ACGCCGACGG CAGTTTCGCC GAATGGTAA
|
Protein sequence | MKITKLTTYI VPPRWLFLKI ETDEGIIGWG EPVVEGRALT VQAAVHELED YLIGKDPFLI EDHWTVMYRG GFYRGGAVHM SAISGIDQAL WDIKGKALGQ PIHSLLGGQL RDRIKVYSWI GGDRPADVAN NAREVVARGF KAIKLNGCEE MQIVDTNEKV EKAVETIAVI REAIGPHIGI GVDFHGRVHK PMAKVLAKEL EPYKLMFIEE PVLSENKEAL RDIVNHTSTP IALGERLFSR WDFKQVLSDG YVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD AVSYNAFIQE QSLGIHYNKG NDILDYISNK EVFQYADGFV SIPQGPGLGI EVDEAYVIER AREGHRWRNP IWRHADGSFA EW
|
| |