Gene Rleg_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1681 
Symbol 
ID8012750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1676029 
End bp1679268 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content67% 
IMG OID644824268 
ProductSporulation domain protein 
Protein accessionYP_002975507 
Protein GI241204411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.393299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATA AACAACTTGC GTATGATACG CGCGGAAAAA ACGACCTGTT TTCCGACGAT 
GATCCGTTGG CTGAACTTGC CCGGATCGTC GGCTTCGAGC CGCGTGTTGC GGCGAATACG
GTGACTGAGA CCGCACGCCG CGAGCCCGCC CTCGATCTCG AAGACGAGCT TGGGCGCGAG
TTCGACCGCT ACGATTCGCC ACGTCCGCTC GCAGAGCTCG ATCGGCCGGC CGAGCCGATC
TCTGATGATC TCACGCCTGA AGATTACGTC GAGCCCGTTC TCGATGCTTC TCCCACTGCC
GAACATGCGG AGGCTCCTGA GCCGGTTTCG GCTTCCGTCG CCGCCTTTGA GGCTGAAGAA
GCAGCTTTGC CCGCCGCAGG CAATGGCGAC GCATCTGTCG CCGACTGGGC GGAGCAGCTT
TCTCCCGAGC CCGATGCGTC GGTGCAATCG GCCTTCGGCG GTGCGCGCGA CCTGATCGAG
GAGCTTGAGC TGTCGATCGG CGCAGCACCC GTCTCTTCGC TGGCGCAGCC CACCAAGGCG
CCGCAATGGT CGGCTGCCAG CATCAGGCTG CCGCTTGCCA ATTTCCATGC TCCGAAGCGC
GAGGAACCGG TTGTTTTGCC GGAACCTGTG GCCGAAACGG TCGCGGCACC GGTTGCTGAA
GCACCGTCCG CCGATCTTCC TGTAGTCGAG CCCCAATCTG TCATCGAGCC GCCGGCGCTT
GTCGCCGCTG TCGAGCCTTC CGAGGAATTC GAATCCGCTT CGCCGTCACT TGGCTTTCCC
GCCGAGCTCG ATCGCCATGA TGAGGTGATC GCGCCGGAAG AGACCGCCGA GGCCGAAGAA
TTCGTCGAAG TCGAGGAAGA GCTGGAGGAT TTCGGGTCCG ACGCCGGTTT CGATCTCATC
GCCGCCGCCG TCGAAGGCGA GATCCAGGCC GATGCCGCGC TGACCGAGGT CGTGCCTGAT
GTTCCGCACA CCGCCGGCAC GTTCGATCTC GACGATCTGC TCGCCGACGT CTCGCGTTAT
CCGGTACCGC AACGTGCCAA TCCGGCGCCC GTCTCGCCGC AGCCCGCATC GATCGAGGCA
GCGCCCGTTC CGGCCGCCCC TGTGGCCGCC GAACCTGTTC AGTCCGAGGT GATCGCGCCC
CCGCCGCTCG CCGCGGCACC CGTCCGGCCT GCTCCGGTCG AGCCGGCAAC GGTCTATGCC
GAAGCCGCAA GACCGGTCGC GCCGCAGCTT GCCGAGGTCG TTACGCCCCA GCCTGCCGCA
ACGGCATATT CGCCGGCGCC GCAGCCGGTA CCGGAAGCCG ACGACCCCTT CGCCGGCCAT
GATTTCGAAC TGGATCTTGC CGGCATCGAG CTGGAACTCA CCGATCTCGA TTTCTCCGAG
CCGTCCGAGC CGGCACCGCA GCCTGAGCCG CCAGCCCCGG CTCCCCAGCA GGCCGCGGCC
GTCGCTCCTC GGTCCGCCGC TCCAGTGTTT GCTCCCGAGC CACCGGCTCC TGCTTTTGAG
CAGGCCGCTG CCGCTCCTGC ACGGTCCGCT CCGGCCTTCG TTCCCGAACC GCAGGCTCCA
GCTCCGGCTT TCAACTGGGC GCCTGTCTCG GACTCGACCG AAGACCTGCC ATTCGATCCG
GCGATGATCT CGGATCCGGA GGATCGTCCC GAGGCCGTCG ACGACATGCA CGTGCCGGCG
CTGCCGCCGG TCGAGCAGCC CGCGCCGGTC GCGAAATCTG CGGATTTCGA TTTCGATCTC
GACGCCGAGA TCGCCAGCTT CTTTGAACCG GCCAAACCGC GGCAAACGCC GGCGCCGGTC
AGGGATACCG CCGCCGCTGC CGCAAAGCCG GTCAAGCCCA CCATCGCCGA TGGCCTCGAT
GATTTCGAAC GGGCGCTGGA GGAGGATTTC CGCCGCAGCG TGCGCGAGCC GGTCGAGCGC
CGCGAGACCT CCGAGGTCCG TATCGAATCG GCAAGCCAGG CCGCTGATTT CAGCCGCGCC
CGGTCGATGC GCCAGCTGCT TGCCGGGGCC GTCGTGCTCG TGGTCTTCGC CGGCGTCGGT
TATGGCGTCT ATTCCTCCGT CTGGAACGGC GAGGGCCTCG GCATTGTCGC GTCCGGCGAG
CCGCGCGTGA TCACCGCCGA CAAAGAGCCG GTCAAGGTCG TTCCGGAAAA TCCCGGCGGC
AAGACCGTGC CCAACCAGGA CAAGGCGGTC TACGACCGCG TTGCGGGTTC TGCCGAAGAG
CCGAAGCAGA AGGCGCTCGT TTCTTCCGAT GAGGCGCCCG TCGATGTCGT CCAGCGCACG
CTGACACCGG AAGCGCTGCC CGAGGACGAC GAGAACGCCA ACGCTGACGA TCAGGTCACG
CCGACTGCGG TCGGTGAGAC GGAGGATCCG CGTCTGCTGC CAACCCAGGA CAACGCCGAC
AACGCTCCGG CAACCGACGC CGACAAGACG CCGTCCGTTT CTCCGCGCAA GGTTCGCACA
ATGATCGTCA AGCCTGACGG TACGTTGGTT GCCCGCGAGG AACCAGCGCC CGTCGACCAG
CCGACACCGT CTGCCCAGGC GACGCAGTCT GCCCAGGCGA CCCAGTCTGC CCAGGCGACG
CCGCCGGCTC AGCCGCCGTT CACGGCTCCG TCGACCCCGC CCGTGCCGCC CGTCGGTGGA
ACGGCCGCAA GCTTCCCAGC AAGAGCCGAG GTTGCTTCCG CCGATGCGCG TTCCGCAGCC
CCCGTCGAAA CCGCGCCGGT ACAGCCGCCG CTCGCCGGCA GTGCTGACGC ACAGGCCGCA
AATCCCGCTC AGGTCGCTCC GCCGGTGCGT CCGGTCAAGA CCTCGGCCAC TGCCGATACC
GCTCCAATCC CGACCGCCCG TCCGGTTGAC CAGCCCGTCA ACGTCGTAGG CACAGTGACC
GAGAAGGGCA ATGTCCGCCC GCCTGCCCAG CAGCCGAAGC CCACACAGCA GCCGAAGACG
ACTGAAGTCG CGGCCGCAGC ACCTGTCGCC GCAAAGCCGC AGCAGGCCGC ATCCGCCGGC
GGCTACGGCA TCCAGATCGC CTCGCTGCCT TCGGAAGACG AGGCGACCAA ATCCTATGCC
AACCTGTCGA AGAAATTCGC CAGCGTGCTT GGCGGCCGCA GCCACGAGAT CCGCAGGGCC
GATATCGCCG GCAAGGGCAC TTTCTACCGT GTCCGCATTC CGGCCGGTTC CAAGGACGAG
GCCGCAGCAC TCTGCGAACA GTATCGCGCG GCGGGCGGAA GCTGCTTGAT CTCCAAGTAA
 
Protein sequence
MADKQLAYDT RGKNDLFSDD DPLAELARIV GFEPRVAANT VTETARREPA LDLEDELGRE 
FDRYDSPRPL AELDRPAEPI SDDLTPEDYV EPVLDASPTA EHAEAPEPVS ASVAAFEAEE
AALPAAGNGD ASVADWAEQL SPEPDASVQS AFGGARDLIE ELELSIGAAP VSSLAQPTKA
PQWSAASIRL PLANFHAPKR EEPVVLPEPV AETVAAPVAE APSADLPVVE PQSVIEPPAL
VAAVEPSEEF ESASPSLGFP AELDRHDEVI APEETAEAEE FVEVEEELED FGSDAGFDLI
AAAVEGEIQA DAALTEVVPD VPHTAGTFDL DDLLADVSRY PVPQRANPAP VSPQPASIEA
APVPAAPVAA EPVQSEVIAP PPLAAAPVRP APVEPATVYA EAARPVAPQL AEVVTPQPAA
TAYSPAPQPV PEADDPFAGH DFELDLAGIE LELTDLDFSE PSEPAPQPEP PAPAPQQAAA
VAPRSAAPVF APEPPAPAFE QAAAAPARSA PAFVPEPQAP APAFNWAPVS DSTEDLPFDP
AMISDPEDRP EAVDDMHVPA LPPVEQPAPV AKSADFDFDL DAEIASFFEP AKPRQTPAPV
RDTAAAAAKP VKPTIADGLD DFERALEEDF RRSVREPVER RETSEVRIES ASQAADFSRA
RSMRQLLAGA VVLVVFAGVG YGVYSSVWNG EGLGIVASGE PRVITADKEP VKVVPENPGG
KTVPNQDKAV YDRVAGSAEE PKQKALVSSD EAPVDVVQRT LTPEALPEDD ENANADDQVT
PTAVGETEDP RLLPTQDNAD NAPATDADKT PSVSPRKVRT MIVKPDGTLV AREEPAPVDQ
PTPSAQATQS AQATQSAQAT PPAQPPFTAP STPPVPPVGG TAASFPARAE VASADARSAA
PVETAPVQPP LAGSADAQAA NPAQVAPPVR PVKTSATADT APIPTARPVD QPVNVVGTVT
EKGNVRPPAQ QPKPTQQPKT TEVAAAAPVA AKPQQAASAG GYGIQIASLP SEDEATKSYA
NLSKKFASVL GGRSHEIRRA DIAGKGTFYR VRIPAGSKDE AAALCEQYRA AGGSCLISK