Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4874 |
Symbol | |
ID | 8007261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 255087 |
End bp | 256787 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644821803 |
Product | Glycosyl hydrolase family 32 domain protein |
Protein accession | YP_002973063 |
Protein GI | 241113228 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.963389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.66875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTGC AGGCAAAATC CGAAACGTCA GCACCTGAAA CCATTGAAGC GGAACTGCCC GAAGGCACGG TGCTGCATCT CTGGCTGAAG GCGCGCCATG CCGGCGGCGA GGCCAAGCTC TTCGTCGCCG TTGAGGGTAA CGACATCGGC GAGCCCTCGA CCCACCGCGC AGGGGAATTT GAGTTCTTCG CTGTGACGCT CGCGAAAGGC GGTCGTGCCA CGCTGTCTTA TGATGCGGCA GCCACAGCGC TTTCCGTCGC TTACGCCTTC CGGCCGGAAA CCGTGATGAA GGAGGGCATC CGCGTCCTGC ACAGCGATGC CCGCACCGCC GCCCCCGACG TGCCCGACAG CTACCACTTC CGCCCACCCT TCGGCTGGAT GAACGATCCG AACGGTTTTG GACGGTTCGG CGGAAACGCT CACCTCTTCT ACCAGCATTA TCCTCATGAG CCGCGCTGGA ACACCATGCA CTGGGGCCAT GCCGTCTCGA GGGATTTCGT CCGCTGGACG CATCTGCCCA TGTTCCTCTT CCCGGCGGCG CATCTATCGG AAAAGGACGA TGGCCGCGGC GGCGCCTTCT CCGGCTCGGC GATCCCCGGC TCCGGCCCGG AAGGCGAGGA AATCCGCGTC TTCTACACCG AGCATGTGCG TGACCGTCTA CCGGAAGAGC AAATCCAGCT TTCCGCCGTC AGCCGCGACG GCATCGTCGC CGGCCCGTCG GAAATCGTGA TGCCGCTGCG CCCGGAAGGC TTGAACGTCA CCACCGATTT CCGCGACCCC TATGTCTTCA AAGGCCCGGA CGGGCGCTGG AAGATGCTGC TCGGCAGCCG CGACAGGCAG GGCGGCGTCG TACTGCTCTA TGAAACCGCA GACGCGCAAG GCGTCGATGG CTGGACCTTC CTCGGCATCC TCCATCGCGA GGACGGTTTC GGCATGACGG CGGCGGAATG CCCCTGCATG GTGCCGCTTT CCGGCAAAAA CGCAGAAACC CGCTGGGCGC TGATCTTCGG TCTGCTCACC AGCCGCGACC CGGCCACCGG CCGCCGCAAC CTCACTTCCG TCACCGTCGG TGGTTTCGAT GGCCGCACCT TCGTCGCGGA ATTCGTGCAG GAGTTGGATT TCGGTTCGGA TGCCTATGCC TTCCAGGCCT TCGTTGATGG CGACGAGCCG GTCGGCATCG CCTGGCTCGC CAACTGGACG GATTTTTCCA AGAAGGACGA TTTCCCGACG GCCATGACCC TGCCGCGCCG CATGCTTCTC GACGGCGACA CCGTGCTGAC CCCGCCGGTC GCAGCCGTCG AAAGCCTACG CCATCGGTTG CTGGACGGCA CCGCGCTTGC CGCCGGCAAG ACCGTGCCGC TCGGCACCGG CGCCGTCGAG ATCGTGCTTG ATCTCACCGC GCCGGGCGCC GCCTTCGATC TCACCTTCGA TCATCCGGAT GTCGATCTAG GCGTTAAACT CGACGCCGAT GGTCTGGCGA TTGTCTTCGA CGCCCGCACC GGTATGAGGC CGCCGCGTTA CGTCGCCGCC GGCGCGAATC CGTCGAGCCT GCGCATCTTC CTCGATGCCG GCTCCATCGA GGTCTTCGCT GACAACGGCC GCTGGACGGG GTCCAAACGC ATTCCGAGCT TTGCCGCCGC ACGTTCGGCG ACGCTCGCCG GCGTCGTCGC CGGGGCCGGC GTCTGGCAAT TGAAACTGTG A
|
Protein sequence | MSLQAKSETS APETIEAELP EGTVLHLWLK ARHAGGEAKL FVAVEGNDIG EPSTHRAGEF EFFAVTLAKG GRATLSYDAA ATALSVAYAF RPETVMKEGI RVLHSDARTA APDVPDSYHF RPPFGWMNDP NGFGRFGGNA HLFYQHYPHE PRWNTMHWGH AVSRDFVRWT HLPMFLFPAA HLSEKDDGRG GAFSGSAIPG SGPEGEEIRV FYTEHVRDRL PEEQIQLSAV SRDGIVAGPS EIVMPLRPEG LNVTTDFRDP YVFKGPDGRW KMLLGSRDRQ GGVVLLYETA DAQGVDGWTF LGILHREDGF GMTAAECPCM VPLSGKNAET RWALIFGLLT SRDPATGRRN LTSVTVGGFD GRTFVAEFVQ ELDFGSDAYA FQAFVDGDEP VGIAWLANWT DFSKKDDFPT AMTLPRRMLL DGDTVLTPPV AAVESLRHRL LDGTALAAGK TVPLGTGAVE IVLDLTAPGA AFDLTFDHPD VDLGVKLDAD GLAIVFDART GMRPPRYVAA GANPSSLRIF LDAGSIEVFA DNGRWTGSKR IPSFAAARSA TLAGVVAGAG VWQLKL
|
| |