Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6873 |
Symbol | |
ID | 8022456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 324184 |
End bp | 325554 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833735 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_002984869 |
Protein GI | 241666785 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGAA ATCCCAAAAT CACTTTCATC GGAGCTGGCT CCACCGTCTT CATGAAGAAC ATCGTCGGTG ACGTGCTGCA GCGTCCGGCC CTGTCGGGTG CGACGATCGC CTTGATGGAT CTCAACCCGC AGCGGCTGGA AGAAAGCGCC ATCGTCGTCA ACAAGCTGAT CTCGACGCTC GGCGTCAAGG CGAAGGCCGA GACCTATTCT GACCAGCGCA AGGCGCTTTC GGGCGCAGAT TTCGTCGTCG TCGCCTTCCA GATCGGCGGC TATGAACCCT GCACGGTCAC CGATTTCGAA GTGCCGAAGA AATATGGCCT GCGCCAGACG ATCGCCGATA CGCTCGGCGT CGGCGGCATC ATGCGCGGGC TTCGCACCGT GCCGCATCTC TGGAAGGTCT GCGAGGACAT GCTCGCCGTC TGCCCCGAGG CGATCATGTT GCAATATGTC AACCCGATGG CGATCAACAC CTGGGCGATA TCGGAGAAGT ATCCGACCAT TCGCCAAGTC GGCCTTTGCC ATTCGGTGCA GGGCACGGCG ATGGAACTGG CCCACGACCT CGAAATTCCC TACGAGGAAA TCCGTTACCG GGCGGCCGGC ATCAACCACA TGGCCTTCTA TCTCAAATTC GAGCATCGCC AGGCCGACGG TTCTTACCGC GACCTCTATC CCGATCTCGT GCGCGCCTAT CGCGAGGGCA GGGCGCCGAA GCCTGGCTGG AACCCGCGCT GCCCGAACAA GGTGCGCTAC GAGATGCTGA CGCGGCTCGG CTATTTCGTC ACCGAGAGCT CGGAGCATTT CGCCGAATAC ACGCCCTATT TCATCAAGGA GGGCCGCGAC GACCTGATCG AGAAATTCGG CATCCCGCTC GATGAATATC CGAAGCGCTG CATCGAGCAG ATCGAGCGCT GGAAAGGCCA GGCGGAGGCC TATCGTTCGG CCGACAAGAT CGAGGTCAAG CCGTCGAAGG AATATGCCTC CTCGATCATC AACTCGGTCT GGACCGGCGA GCCCTCGGTG ATCTACGGCA ATGTCCGCAA CAATGGCTGC ATCACCTCGC TGCCCGCCAA TTGCGCCGCC GAGGTGCCCT GCCTCGTCGA TGCCTCCGGC ATCCAGCCGA CCTTCATCGG CGACCTGCCG CCGCAGCTGA CGGCTCTGAT CCGCACCAAT ATCAACGTGC AGGAACTGAC GGTGCAGGCG CTGATGACGG AAAACCGCGA GCATATCTAT CACGCCGCGA TGATGGACCC GCATACGGCA GCCGAACTCG ACCTCGACCA GATCTGGTCG CTGGTCGACG ATCTGCTCGC CACCCACGGC GACTGGCTGC CCGAATGGGC ACGCACCGCC CGCAAGGTAC AGGCCGCCTG A
|
Protein sequence | MARNPKITFI GAGSTVFMKN IVGDVLQRPA LSGATIALMD LNPQRLEESA IVVNKLISTL GVKAKAETYS DQRKALSGAD FVVVAFQIGG YEPCTVTDFE VPKKYGLRQT IADTLGVGGI MRGLRTVPHL WKVCEDMLAV CPEAIMLQYV NPMAINTWAI SEKYPTIRQV GLCHSVQGTA MELAHDLEIP YEEIRYRAAG INHMAFYLKF EHRQADGSYR DLYPDLVRAY REGRAPKPGW NPRCPNKVRY EMLTRLGYFV TESSEHFAEY TPYFIKEGRD DLIEKFGIPL DEYPKRCIEQ IERWKGQAEA YRSADKIEVK PSKEYASSII NSVWTGEPSV IYGNVRNNGC ITSLPANCAA EVPCLVDASG IQPTFIGDLP PQLTALIRTN INVQELTVQA LMTENREHIY HAAMMDPHTA AELDLDQIWS LVDDLLATHG DWLPEWARTA RKVQAA
|
| |