Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6878 |
Symbol | |
ID | 8022461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 331728 |
End bp | 333194 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833740 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002984874 |
Protein GI | 241666790 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.036749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.397906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTCA AAATCGCTAT CATCGGTGCG GGCAGCGTCG GTTTCACCAA GAAGCTGTTT ACCGATATAT TGTGCGTTCC CGAGTTTCGC GACGTCGAAT TCGCGCTGAC GGATCTCAGC GAACACAATC TCGAAATGAT CAAGGCGATC CTCGACCGCG TCGTCGAGGC GAACGGGCTG CCGACGAAGG TGACGGCGAC CACCAACCGG CGCCAGGCGT TGGAAGGCGC GCGGTATATT ATCAGCTGCG TTCGGGTCGG TGGGCTCGAG GCCTATGCCG ATGATATCGG AATTCCCCTG AAATACGGCA TCGACCAGTG CGTCGGCGAT ACGATCTGTG CCGGCGGCAT CCTCTATGGC CAGCGCAACA TCCCGGTCAT CCTCGACTTC TGCAAGGATA TCCGCGAGGT CGCCGAGCCC GGCGCGAAAT TCCTGAACTA TGCCAACCCG ATGGCGATGA ACACCTGGGC GGCGATCGAA TATGGCAAGG TCGATACCGT CGGCCTCTGC CACGGCGTCC AGCATGGCGC CGAACAGATC GCCGAGGTGC TCGGCGCGAA GTCGCTGAGC GAGCTCGACT ATATTTGCTC CGGCATCAAC CATCAAACCT GGTTCATCGA CCTGCGCCTC AACGGCCGCA GGATCGGCAA GGACGAGCTC ATCGCCGCCT TCGAGGCGCA TCCGGTCTAT TCGCAGCAGG AGAAATTGCG CATCGACGTG CTGAAGCGTT TCGGCGTCTA TTCCACGGAA AGCAACGGCC ATCTCTCGGA ATACCTGCCC TGGTATCGCA AGCGGCCGGA CGAAATCACC CGCTGGATCG ACATGTCCGA CTGGATCCAC GGCGAGACCG GCGGCTATCT CCGCCATTCC ACTGAAACGC GCAACTGGTT CGAGACGGAA TATCCGCAAT TTTTGGAATC CGCCGCAAAG CCGATCGATC CTGCCAAGCG CTCGAACGAA CATGCCAGCC ATATCCTCGA GGCGCTGGAG ACGAACCGGG TCTATCGCGG CCATTTCAAC CTCAAGAACA ATGGCGTCAT CACCAACCTG CCGTCCGATG CGATCATCGA ATCGCCGGGC TTCGTCGATC GCTTCGGCAT CAACATGGTC TCCGGCGTCA CCCTGCCGGA AGCCTGCGCG GCCACCTGCA TCGCCTCGAT CAACGTCCAG CGCATGTCGG TGCACGCGGC GATATCAGGC GACATCGACC TCTTGAAGCT TGCCGTGCTG CACGACCCGC TGGTCGGCGC CGTCTCGACG CCGGAAGAGG TCTGGCAGAT GGTCGACGAG ATGGTCGTTG CCCAGGCGCG CTGGCTGCCG CAATATGCGC ATGCCGTGCC GGCCGCCAAG GAGCGGCTGT CGAAATCGAA GGTGCAGACC CGCGACTGGG CGGGGGCCGC ACGCCGCAAC GTTCGCTCGA TCGAGGAGCT GCGCGCGGAA AAGGCGGCAC TGAAACAGGC CGTCTGA
|
Protein sequence | MSFKIAIIGA GSVGFTKKLF TDILCVPEFR DVEFALTDLS EHNLEMIKAI LDRVVEANGL PTKVTATTNR RQALEGARYI ISCVRVGGLE AYADDIGIPL KYGIDQCVGD TICAGGILYG QRNIPVILDF CKDIREVAEP GAKFLNYANP MAMNTWAAIE YGKVDTVGLC HGVQHGAEQI AEVLGAKSLS ELDYICSGIN HQTWFIDLRL NGRRIGKDEL IAAFEAHPVY SQQEKLRIDV LKRFGVYSTE SNGHLSEYLP WYRKRPDEIT RWIDMSDWIH GETGGYLRHS TETRNWFETE YPQFLESAAK PIDPAKRSNE HASHILEALE TNRVYRGHFN LKNNGVITNL PSDAIIESPG FVDRFGINMV SGVTLPEACA ATCIASINVQ RMSVHAAISG DIDLLKLAVL HDPLVGAVST PEEVWQMVDE MVVAQARWLP QYAHAVPAAK ERLSKSKVQT RDWAGAARRN VRSIEELRAE KAALKQAV
|
| |