Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5368 |
Symbol | |
ID | 8007326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 777760 |
End bp | 779076 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644822272 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002973532 |
Protein GI | 241113697 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.265798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGA TTTGCCTGGT AGGCGCTGGT AGCACCGTTT TTGCACAGAA CATTCTGGGA GACGTCCTGT CCTCGCAGAG AGGCGGCGAC TACGTCATCA GCCTCTTCGA TATCGATCCC GAGCGACTGA AGACGTCAGA GATCGTCGCT CGCCGGATCT GCGAGTCGCT TAAGCTTTCA AGTGTCAGGA TCGACGCCAC GCTCGACCGG CGGGAAGCGC TGCGGGGGTC GGATTTCGTC ATCCTCATGA TGCAGGTCGG TGGCTACAAG CCGGCTACCG TCACGGACTT CGACATTCCG AAAAAATATG GCTTGCGTCA GACGATCGCC GACACCCTTG GCATCGGCGG CATTTTCCGC GGTTTGCGGA CGATACCAGT CCTTGAGGCG ATCTGCCGAG ACATGCAGGA GGTCTGCCCG CAGGCTCTTC TGATGCAGTA CGTCAACCCG ATGGCCATCA ACTGCTGGGC GATCAAGGAG CTGGCCCCGG AAATCCGCAC CGTCGGCTTG TGTCATAGCG TGCAGCACAC AGCCGGGCAT CTCGCGCAAT GCCTCGGCGA GGACATTGCC GACGTGAACT ATATTTCGGC CGGGATCAAT CACGTCGCCT TCTTCCTGAA ATATGAGAAA GTTCACAGCG ACGGACGGCG GGAAGATCTT TATCCGAGAC TGAACGCTCT CGCCACCGAT GGCCGCGTCC CGTCGGACGA TCGGGTGCGG TTCGATGTGC TCAAACGCCT CGGCCACTTC GTCACCGAAT CCAGCGAGCA TTTCTCCGAG TATACCTCCT GGTACATCAA GGAGGGACGA GGGGATCTGA TCGATCAGCT CAATATCCCG TTGGACGAAT ACATCCGGCG CTGCGAGGTG CAGATCAAAG AATGGCATGC CCTGCGCAAG GAACTGGAAG GGGACAAGCC GATCGAGGTG TGCCGCAGCA ATGAATATGC GGCCGGCATC ATTCATGCCG CGGTCACTGG CAGCCCGGCG CTGATATACG GTAATGTCCC AAACAACGGC CTGATCGAGA ATCTGCCCGA TGAATGCATC GTGGAAGTGC CTTGCCATGT CGACAGAAAC GGAATTCAGC CGGTCCGGGT CGGTCGGATC CCCTCTCAGC TTGCCGCCGT CATGAACTTG AGCGTTTCCG TTCAGCAGTT GACGGTCGAG GCGGCTCTTA CAAAAAACCG CGAGCGCATC TACCAGGCCG CTCTGCTCGA TCCGCATACG TCTGCGGAAT TGTCGCCAGA CCAAATCTGG AACCTTGTCG ACGACCTGAT CGTCGCACAC GGCGATTTGT TGCCGAGATA TCAGTGA
|
Protein sequence | MPKICLVGAG STVFAQNILG DVLSSQRGGD YVISLFDIDP ERLKTSEIVA RRICESLKLS SVRIDATLDR REALRGSDFV ILMMQVGGYK PATVTDFDIP KKYGLRQTIA DTLGIGGIFR GLRTIPVLEA ICRDMQEVCP QALLMQYVNP MAINCWAIKE LAPEIRTVGL CHSVQHTAGH LAQCLGEDIA DVNYISAGIN HVAFFLKYEK VHSDGRREDL YPRLNALATD GRVPSDDRVR FDVLKRLGHF VTESSEHFSE YTSWYIKEGR GDLIDQLNIP LDEYIRRCEV QIKEWHALRK ELEGDKPIEV CRSNEYAAGI IHAAVTGSPA LIYGNVPNNG LIENLPDECI VEVPCHVDRN GIQPVRVGRI PSQLAAVMNL SVSVQQLTVE AALTKNRERI YQAALLDPHT SAELSPDQIW NLVDDLIVAH GDLLPRYQ
|
| |