Gene Rleg2_5748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5748 
Symbol 
ID6977138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp150515 
End bp151876 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID643393204 
ProductInulin fructotransferase (DFA-I-forming) 
Protein accessionYP_002278022 
Protein GI209546132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.702747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.432156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGCG AAAATTGCTA CGACGTTACC AAGTATCCAG CCGGCAATCC TCGTGAGGAT 
ATTGGCGCGG TCATCAATAG CATCATTGCC GATATCAAAA ACAGGCAAGC GGTTGCCGAT
GTAAATGACG GCGGAAAACC TGGATCGGTT ATCTATATAC CGCCAGGCGA TTATCGTCTT
GTCACTCAAG TCGTTATAGA CGTGAGCTAT CTGAAAATCG TCGGCTCTGG GCATGGTTTT
ACGTCGTCCA GCATCCGTTT CAACACACCC GCAAGCGAGT TGGCCCACTG GCACGAAGTG
TGGCCGGGCG GAAGTCGCAT CCTTGTGGAC ACGTCCCCAG AGGCCGCAGA CGGTGAGGCT
GCTGGTGCCG CCTTTTATGT CAAGCGCGGC GGAAATCCTC GGATAAGCTC TGTGGAGTTT
GCTGACTTTT GCATCGATGG CTTGCACTTC ATCGACGATG GTTCGGGGCA AAACGACGCA
GAAAATACAT ACAGAAATGG CAAAACGGGA ATCTACGTAG GCAGCGCCAA TGACTCATTC
CGAATAACCG GGATGGGCCT TATCTACCTC GAGCATGGCG TTACTGTTCA TGATGCAGAT
GCGCTCGCGA TAGATAACAA TTTCATTGCG GAGTGCGGCA ACTGTATCGA ACTGAAAGGT
ATGGGGCAGG CCTCAAGAAT AGCAAATAAT TTTGTCGGCG CCGGATATAG GGGGCACTCC
ATTTACGCCG AGAATTATGC GGGCATTCTG GTATCCTCAA ACAACGTATT TCCTCGCGGA
GCGAGCAGTG TCCATTTCTC CGGCGTGGTG CGTTCCTCGG TTACAGGAAA CAGGTTCCAT
TCCTTTTATC CCGGGATGTT GGTTTTTGCC GCCAACTGCT GCGAGAATTT GGTCTCCTCA
AATCACTTTC TGCGAGATCG CGAGCCATGG GCGCCGATGC AGAAGTACGA CAACGGCCTG
GATGATCTGT TTGGGCTTTT GCAGATTGAC GGCAGCAACA ATTCGCTGAT CGCGAACCAC
ATTTCGGAAA CAATAGATAC CAAATACATC AAGCCTCCAG AAGTAAAGCC TGTGATAATT
AATGTAGTTT CCGGTAGTGG CAACTACATA GCCAGCAACC ACATTGTAGC CACCGCCGAA
ATATCTCAAA AGGACAAGAG CGATGCGCCA AACAGCGCCT GTTTTTCAAC ACAGGTGAGC
GCGTTGCTTT CAACCGGGAA TTCGACGTTG CTCGACGTAA CGACAGTGCT GGTGCAAAAG
GAATCCGTGC GGAATACGGT CCTGGACTCC GGAAATGACG AACAAGTTGT GATGGACAGA
ACGGTAAATG CATTCAGGGG CACTCCGGTT CCTGGGCAAT AG
 
Protein sequence
MVGENCYDVT KYPAGNPRED IGAVINSIIA DIKNRQAVAD VNDGGKPGSV IYIPPGDYRL 
VTQVVIDVSY LKIVGSGHGF TSSSIRFNTP ASELAHWHEV WPGGSRILVD TSPEAADGEA
AGAAFYVKRG GNPRISSVEF ADFCIDGLHF IDDGSGQNDA ENTYRNGKTG IYVGSANDSF
RITGMGLIYL EHGVTVHDAD ALAIDNNFIA ECGNCIELKG MGQASRIANN FVGAGYRGHS
IYAENYAGIL VSSNNVFPRG ASSVHFSGVV RSSVTGNRFH SFYPGMLVFA ANCCENLVSS
NHFLRDREPW APMQKYDNGL DDLFGLLQID GSNNSLIANH ISETIDTKYI KPPEVKPVII
NVVSGSGNYI ASNHIVATAE ISQKDKSDAP NSACFSTQVS ALLSTGNSTL LDVTTVLVQK
ESVRNTVLDS GNDEQVVMDR TVNAFRGTPV PGQ