Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4755 |
Symbol | |
ID | 6977849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 386954 |
End bp | 388270 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643393922 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002278740 |
Protein GI | 209546822 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAA TTTGCCTTAT CGGGGCTGGG AGTACCGTCT TCGCCCAGAA CATTCTGGGA GATGTGCTGT CCACGCCCTC AGACCATGAC TACATCATCG GCCTTTTTGA CATCGATCCA GAGCGCCTCA AAACATCCGA GATCGTCGCG CGGCGCATAT GCGCGTCGCT GAAGCTCGAC ACGGTTCGGA TCGAGGCAAC GCTCAACCGC CGGGAGGCGC TGAAAGGTGC CGATTTCGTG ATCCTGATGA TGCAGGTTGG CGGCTATAAA CCGGCAACGG TCACGGATTT CAACGTGGCG AAGAACTACG GCTTGCGCCA GACCATTGCG GATACGCTTG GCATCGGCGG CATCTTCCGC GGTCTCAGGA CGATCCCAGT CCTCGAAAGC ATCTGCGGCG ACATGGAAGA GGTGTGCCCG AATGCATTGC TAATGCAGTA TGTAAACCCG ATGGCGATTA ATTGCTGGGC GATCAAGGAA ATCGCCCCGA GCATTCGCAC CGTCGGTCTC TGCCACAGTG TTCAACATAC GGCAGATCAC CTTGCCAGGT GCCTCGGCGA GAAAATCGAC AATATCAGTT ACATCTCAGC CGGCATCAAC CATATCGCTT TCTTCCTTAA ATATGAAAAG CTCCATGGCG ATGGCAGCCG CGAAGACCTT TATCCCAAGC TGAAGGCGCT GGCCGCGGAG GGCAAGGTTC CCGCAGATGA CCGTGTTCGC TTTGATGCCC TGAAAAGGCT CGGTCATTTC GTGACCGAAT CCAGCGAGCA TTTTGCGGAA TATACGTCAT GGTACATCAA GAACCACCAA CCGGAATTGG TAGACCAGCT CAACATTCCA CTCGACGAAT ATATTCGCCG TTGCGAGCTG CAGATCTCAC AATGGCATGT CCTGCGGCAG GACCTCGAAG GGGGAAGACC GATCGAAGTA TGCCGCAGCA ATGAATATGC TTCAGGCATT ATTCATGCTG CGGTGACCGG GAAGCCGGCG CTGATTTATG GAAATGTGCC GAACAACGGC CTGATTGAAA ATCTTCCGCC AGAATGCATT GTCGAAGTTC CATGCCATGT CGATCGCAAT GGCGTCCAAC CGACGCGGAT CGGTAGGATC CCTTCTCAAT TGGCCGCCGT CATGCGGCTG AGCATTTCCG TGCAGGAGCT CACTGTCGAA GCGGCACTGA CAGGCAAGCG TGACCGCATC TATCAGGCCG CGCTGCTCGA TCCGCACACC TCGGCGGAAC TTTCGCCTGA TAAAATCTGG CATATGGTCG ATGACCTCAT CGAGGCACAT GGCGATCTGC TGCCGAACTA CCACTGA
|
Protein sequence | MPKICLIGAG STVFAQNILG DVLSTPSDHD YIIGLFDIDP ERLKTSEIVA RRICASLKLD TVRIEATLNR REALKGADFV ILMMQVGGYK PATVTDFNVA KNYGLRQTIA DTLGIGGIFR GLRTIPVLES ICGDMEEVCP NALLMQYVNP MAINCWAIKE IAPSIRTVGL CHSVQHTADH LARCLGEKID NISYISAGIN HIAFFLKYEK LHGDGSREDL YPKLKALAAE GKVPADDRVR FDALKRLGHF VTESSEHFAE YTSWYIKNHQ PELVDQLNIP LDEYIRRCEL QISQWHVLRQ DLEGGRPIEV CRSNEYASGI IHAAVTGKPA LIYGNVPNNG LIENLPPECI VEVPCHVDRN GVQPTRIGRI PSQLAAVMRL SISVQELTVE AALTGKRDRI YQAALLDPHT SAELSPDKIW HMVDDLIEAH GDLLPNYH
|
| |