Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5748 |
Symbol | |
ID | 6977138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 150515 |
End bp | 151876 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643393204 |
Product | Inulin fructotransferase (DFA-I-forming) |
Protein accession | YP_002278022 |
Protein GI | 209546132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.702747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.432156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGGCG AAAATTGCTA CGACGTTACC AAGTATCCAG CCGGCAATCC TCGTGAGGAT ATTGGCGCGG TCATCAATAG CATCATTGCC GATATCAAAA ACAGGCAAGC GGTTGCCGAT GTAAATGACG GCGGAAAACC TGGATCGGTT ATCTATATAC CGCCAGGCGA TTATCGTCTT GTCACTCAAG TCGTTATAGA CGTGAGCTAT CTGAAAATCG TCGGCTCTGG GCATGGTTTT ACGTCGTCCA GCATCCGTTT CAACACACCC GCAAGCGAGT TGGCCCACTG GCACGAAGTG TGGCCGGGCG GAAGTCGCAT CCTTGTGGAC ACGTCCCCAG AGGCCGCAGA CGGTGAGGCT GCTGGTGCCG CCTTTTATGT CAAGCGCGGC GGAAATCCTC GGATAAGCTC TGTGGAGTTT GCTGACTTTT GCATCGATGG CTTGCACTTC ATCGACGATG GTTCGGGGCA AAACGACGCA GAAAATACAT ACAGAAATGG CAAAACGGGA ATCTACGTAG GCAGCGCCAA TGACTCATTC CGAATAACCG GGATGGGCCT TATCTACCTC GAGCATGGCG TTACTGTTCA TGATGCAGAT GCGCTCGCGA TAGATAACAA TTTCATTGCG GAGTGCGGCA ACTGTATCGA ACTGAAAGGT ATGGGGCAGG CCTCAAGAAT AGCAAATAAT TTTGTCGGCG CCGGATATAG GGGGCACTCC ATTTACGCCG AGAATTATGC GGGCATTCTG GTATCCTCAA ACAACGTATT TCCTCGCGGA GCGAGCAGTG TCCATTTCTC CGGCGTGGTG CGTTCCTCGG TTACAGGAAA CAGGTTCCAT TCCTTTTATC CCGGGATGTT GGTTTTTGCC GCCAACTGCT GCGAGAATTT GGTCTCCTCA AATCACTTTC TGCGAGATCG CGAGCCATGG GCGCCGATGC AGAAGTACGA CAACGGCCTG GATGATCTGT TTGGGCTTTT GCAGATTGAC GGCAGCAACA ATTCGCTGAT CGCGAACCAC ATTTCGGAAA CAATAGATAC CAAATACATC AAGCCTCCAG AAGTAAAGCC TGTGATAATT AATGTAGTTT CCGGTAGTGG CAACTACATA GCCAGCAACC ACATTGTAGC CACCGCCGAA ATATCTCAAA AGGACAAGAG CGATGCGCCA AACAGCGCCT GTTTTTCAAC ACAGGTGAGC GCGTTGCTTT CAACCGGGAA TTCGACGTTG CTCGACGTAA CGACAGTGCT GGTGCAAAAG GAATCCGTGC GGAATACGGT CCTGGACTCC GGAAATGACG AACAAGTTGT GATGGACAGA ACGGTAAATG CATTCAGGGG CACTCCGGTT CCTGGGCAAT AG
|
Protein sequence | MVGENCYDVT KYPAGNPRED IGAVINSIIA DIKNRQAVAD VNDGGKPGSV IYIPPGDYRL VTQVVIDVSY LKIVGSGHGF TSSSIRFNTP ASELAHWHEV WPGGSRILVD TSPEAADGEA AGAAFYVKRG GNPRISSVEF ADFCIDGLHF IDDGSGQNDA ENTYRNGKTG IYVGSANDSF RITGMGLIYL EHGVTVHDAD ALAIDNNFIA ECGNCIELKG MGQASRIANN FVGAGYRGHS IYAENYAGIL VSSNNVFPRG ASSVHFSGVV RSSVTGNRFH SFYPGMLVFA ANCCENLVSS NHFLRDREPW APMQKYDNGL DDLFGLLQID GSNNSLIANH ISETIDTKYI KPPEVKPVII NVVSGSGNYI ASNHIVATAE ISQKDKSDAP NSACFSTQVS ALLSTGNSTL LDVTTVLVQK ESVRNTVLDS GNDEQVVMDR TVNAFRGTPV PGQ
|
| |