Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3767 |
Symbol | |
ID | 8014597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3821770 |
End bp | 3822918 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826330 |
Product | galactonate dehydratase |
Protein accession | YP_002977549 |
Protein GI | 241206453 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.854343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.799766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCAAACTCAC CACCTATATC GTTCCCCCGC GCTGGCTGTT TTTGAAGGTC GAGACCGATG AAGGCATCGT CGGCTGGGGC GAGCCGGTCG TCGAAGGCCG TGCGCTCACC GTTCAGGCCG CCGTCCATGA GCTGGAAGAC TACCTGATCG GCAAGGATCC TTTCCTGATC GAAGACCACT GGACCGTGAT GTATCGCGGC GGCTTCTATC GCGGCGGCGC CGTCCACATG AGCGCAATCT CGGGCATCGA CCAGGCGCTG TGGGACATCA AGGGCAAGGC GCTCGGCCAG CCGATCCATT CCCTGCTCGG CGGCCAGCTC CGTGATCGCA TCAAAGTCTA TTCCTGGATC GGCGGCGACC GTCCCTCGGA TGTCGCCAAC AATGCCAAGG AAGTGGTGGC CCGCGGTTTC AAAGCGATCA AGCTCAATGG CTGCGAGGAA ATGCAGATCG TCGACACCAA CGAAAAGGTG GAGAAGGCGG TCGAGACCAT CGCCGCTATC CGCGAGGCGA TCGGCCCGCA TATCGGCATC GGCGTCGATT TCCACGGCCG CGTCCACAAG CCGATGGCGA AGGTTCTCGC CAAGGAGCTC GATCCCTACA AGCTGATGTT CATCGAAGAG CCGGTGCTTT CCGAAAACAA GGAAGCGCTG CGCGATATCG TCAACCACAC CTCGACGCCG ATTGCGCTGG GTGAACGCCT CTTTTCGCGT TGGGACTTCA AGCAGGTTCT CTCCGACGGT TATGTCGACA TCATCCAGCC GGATCTCTCC CATGCCGGCG GCATCACCGA ATGCCGCAAG ATCGCGGCGA TGGCCGAAGC CTATGACGTG GCGCTGGCGC CGCATTGCCC ACTGGGTCCG ATCGCGCTTG CCGCCTGCCT GCAGGTCGAT GCCGTCAGCT ACAATGCCTT CATCCAGGAA CAGAGCCTCG GCATCCACTA CAACAAGGGC AACGACATCC TCGACTACAT CTCCAACAAG GAGGTGTTCC AGTATGCCGA TGGTTTCGTC TCGATCCCGC AGGGTCCGGG TCTCGGCATC GAGGTCGACG AGGCCTATGT CATCGAACGC GCCAAGGAGG GCCACCGCTG GCGCAACCCG ATCTGGCGGC ATGCCGACGG CAGCTTCGCC GAGTGGTGA
|
Protein sequence | MKITKLTTYI VPPRWLFLKV ETDEGIVGWG EPVVEGRALT VQAAVHELED YLIGKDPFLI EDHWTVMYRG GFYRGGAVHM SAISGIDQAL WDIKGKALGQ PIHSLLGGQL RDRIKVYSWI GGDRPSDVAN NAKEVVARGF KAIKLNGCEE MQIVDTNEKV EKAVETIAAI REAIGPHIGI GVDFHGRVHK PMAKVLAKEL DPYKLMFIEE PVLSENKEAL RDIVNHTSTP IALGERLFSR WDFKQVLSDG YVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD AVSYNAFIQE QSLGIHYNKG NDILDYISNK EVFQYADGFV SIPQGPGLGI EVDEAYVIER AKEGHRWRNP IWRHADGSFA EW
|
| |