Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5903 |
Symbol | |
ID | 6977459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 318913 |
End bp | 320283 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393356 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002278174 |
Protein GI | 209546284 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.670067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAA ATCCCAAGAT CACATTCATC GGAGCTGGCT CCACCGTTTT CATGAAGAAC ATCATCGGCG ACGTTTTGCA GCGTCCCGCC CTTTCGGGCG CAACGATCGC GCTGATGGAT CTCAACCCGC AGCGGCTTGA GGAAAGCGCC ATCGTCGTCA ACAAGCTGAT CTCGACGCTC GGCGTCAAGG CCAAGGCCGA GACCTATTCC GACCAGCGCA AGGCGCTAGC CGGCGCCGAT TTCGTCGTCG TCGCCTTCCA GATCGGCGGC TATGAGCCCT GCACCGTCAC CGATTTCGAA GTGCCGAAGA AATACGGCCT GCGCCAGACG ATCGCCGATA CGCTCGGCGT CGGCGGCATC ATGCGGGGCT TGCGCACCGT GCCGCATCTC TGGAAGGTCT GCGAGGATAT GCTCGCCGTC TGCCCCGAGG CGATCATGCT GCAATATGTC AACCCGATGG CGATCAACAC CTGGGCGATC TCCGAGAAAT ACCCGACCAT CAGCCAGGTC GGCCTCTGCC ATTCGGTGCA GGGCACGGCG ATGGAGCTGG CCCATGACCT CGACATTCCC TACGAGGAAA TCCGCTACCG CGCGGCCGGC ATCAACCACA TGGCCTTCTA TCTCAAATTC GAGCATCGCC AGGCCGACGG CTCCTACCGC AATCTCTATC CCGATCTCGT GCGCGGCTAC CGCGAGGGCA GAGCGCCGAA GCCCGGCTGG AACCCGCGCT GCCCGAACAA GGTGCGCTAC GAGATGCTGA CGCGGCTCGG CTATTTCGTC ACCGAAAGCT CGGAGCATTT CGCCGAATAC ACGCCCTATT TCATCAAGGA AGGCCGCGAC GACCTGATCG AGAAATTCGG CATTCCGCTC GATGAATATC CGAAACGCTG CATCGAGCAG ATCGAGCGCT GGAAGGGCCA GGCGGAAGCC TATCGCAGCG CCGACAAGAT CGAGGTGACG CCCTCGAAGG AATACGCTTC CTCGATCATC AACTCGGTCT GGACCGGCGA ACCCTCTGTC ATTTACGGCA ATGTCCGCAA CAATGGCTGC ATCACCTCGC TGCCCGCCAA TTGCGCCGCC GAAGTGCCCT GCCTCGTCGA CGCCTCCGGC ATCCAGCCGA CCTTCATCGG CGACCTGCCG CCGCAGCTGA CCGCGCTGAT CCGCACCAAT ATCAACGTCC AGGAACTGAC GGTGCAGGCG CTGATGACCG AAAATCGCGA GCACATCTAC CACGCCGCGA TGATGGACCC GCACACGGCC GCCGAACTCG ACCTCGACCA GATTTGGTCG CTGGTCGACG ACCTGCTCGC CACCCACGGC AACTGGCTGC CCGAATGGGC CCGCACATCT AGAAAAGTTC AAGCCGCCTG A
|
Protein sequence | MAANPKITFI GAGSTVFMKN IIGDVLQRPA LSGATIALMD LNPQRLEESA IVVNKLISTL GVKAKAETYS DQRKALAGAD FVVVAFQIGG YEPCTVTDFE VPKKYGLRQT IADTLGVGGI MRGLRTVPHL WKVCEDMLAV CPEAIMLQYV NPMAINTWAI SEKYPTISQV GLCHSVQGTA MELAHDLDIP YEEIRYRAAG INHMAFYLKF EHRQADGSYR NLYPDLVRGY REGRAPKPGW NPRCPNKVRY EMLTRLGYFV TESSEHFAEY TPYFIKEGRD DLIEKFGIPL DEYPKRCIEQ IERWKGQAEA YRSADKIEVT PSKEYASSII NSVWTGEPSV IYGNVRNNGC ITSLPANCAA EVPCLVDASG IQPTFIGDLP PQLTALIRTN INVQELTVQA LMTENREHIY HAAMMDPHTA AELDLDQIWS LVDDLLATHG NWLPEWARTS RKVQAA
|
| |