Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1681 |
Symbol | |
ID | 8012750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1676029 |
End bp | 1679268 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644824268 |
Product | Sporulation domain protein |
Protein accession | YP_002975507 |
Protein GI | 241204411 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.393299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATA AACAACTTGC GTATGATACG CGCGGAAAAA ACGACCTGTT TTCCGACGAT GATCCGTTGG CTGAACTTGC CCGGATCGTC GGCTTCGAGC CGCGTGTTGC GGCGAATACG GTGACTGAGA CCGCACGCCG CGAGCCCGCC CTCGATCTCG AAGACGAGCT TGGGCGCGAG TTCGACCGCT ACGATTCGCC ACGTCCGCTC GCAGAGCTCG ATCGGCCGGC CGAGCCGATC TCTGATGATC TCACGCCTGA AGATTACGTC GAGCCCGTTC TCGATGCTTC TCCCACTGCC GAACATGCGG AGGCTCCTGA GCCGGTTTCG GCTTCCGTCG CCGCCTTTGA GGCTGAAGAA GCAGCTTTGC CCGCCGCAGG CAATGGCGAC GCATCTGTCG CCGACTGGGC GGAGCAGCTT TCTCCCGAGC CCGATGCGTC GGTGCAATCG GCCTTCGGCG GTGCGCGCGA CCTGATCGAG GAGCTTGAGC TGTCGATCGG CGCAGCACCC GTCTCTTCGC TGGCGCAGCC CACCAAGGCG CCGCAATGGT CGGCTGCCAG CATCAGGCTG CCGCTTGCCA ATTTCCATGC TCCGAAGCGC GAGGAACCGG TTGTTTTGCC GGAACCTGTG GCCGAAACGG TCGCGGCACC GGTTGCTGAA GCACCGTCCG CCGATCTTCC TGTAGTCGAG CCCCAATCTG TCATCGAGCC GCCGGCGCTT GTCGCCGCTG TCGAGCCTTC CGAGGAATTC GAATCCGCTT CGCCGTCACT TGGCTTTCCC GCCGAGCTCG ATCGCCATGA TGAGGTGATC GCGCCGGAAG AGACCGCCGA GGCCGAAGAA TTCGTCGAAG TCGAGGAAGA GCTGGAGGAT TTCGGGTCCG ACGCCGGTTT CGATCTCATC GCCGCCGCCG TCGAAGGCGA GATCCAGGCC GATGCCGCGC TGACCGAGGT CGTGCCTGAT GTTCCGCACA CCGCCGGCAC GTTCGATCTC GACGATCTGC TCGCCGACGT CTCGCGTTAT CCGGTACCGC AACGTGCCAA TCCGGCGCCC GTCTCGCCGC AGCCCGCATC GATCGAGGCA GCGCCCGTTC CGGCCGCCCC TGTGGCCGCC GAACCTGTTC AGTCCGAGGT GATCGCGCCC CCGCCGCTCG CCGCGGCACC CGTCCGGCCT GCTCCGGTCG AGCCGGCAAC GGTCTATGCC GAAGCCGCAA GACCGGTCGC GCCGCAGCTT GCCGAGGTCG TTACGCCCCA GCCTGCCGCA ACGGCATATT CGCCGGCGCC GCAGCCGGTA CCGGAAGCCG ACGACCCCTT CGCCGGCCAT GATTTCGAAC TGGATCTTGC CGGCATCGAG CTGGAACTCA CCGATCTCGA TTTCTCCGAG CCGTCCGAGC CGGCACCGCA GCCTGAGCCG CCAGCCCCGG CTCCCCAGCA GGCCGCGGCC GTCGCTCCTC GGTCCGCCGC TCCAGTGTTT GCTCCCGAGC CACCGGCTCC TGCTTTTGAG CAGGCCGCTG CCGCTCCTGC ACGGTCCGCT CCGGCCTTCG TTCCCGAACC GCAGGCTCCA GCTCCGGCTT TCAACTGGGC GCCTGTCTCG GACTCGACCG AAGACCTGCC ATTCGATCCG GCGATGATCT CGGATCCGGA GGATCGTCCC GAGGCCGTCG ACGACATGCA CGTGCCGGCG CTGCCGCCGG TCGAGCAGCC CGCGCCGGTC GCGAAATCTG CGGATTTCGA TTTCGATCTC GACGCCGAGA TCGCCAGCTT CTTTGAACCG GCCAAACCGC GGCAAACGCC GGCGCCGGTC AGGGATACCG CCGCCGCTGC CGCAAAGCCG GTCAAGCCCA CCATCGCCGA TGGCCTCGAT GATTTCGAAC GGGCGCTGGA GGAGGATTTC CGCCGCAGCG TGCGCGAGCC GGTCGAGCGC CGCGAGACCT CCGAGGTCCG TATCGAATCG GCAAGCCAGG CCGCTGATTT CAGCCGCGCC CGGTCGATGC GCCAGCTGCT TGCCGGGGCC GTCGTGCTCG TGGTCTTCGC CGGCGTCGGT TATGGCGTCT ATTCCTCCGT CTGGAACGGC GAGGGCCTCG GCATTGTCGC GTCCGGCGAG CCGCGCGTGA TCACCGCCGA CAAAGAGCCG GTCAAGGTCG TTCCGGAAAA TCCCGGCGGC AAGACCGTGC CCAACCAGGA CAAGGCGGTC TACGACCGCG TTGCGGGTTC TGCCGAAGAG CCGAAGCAGA AGGCGCTCGT TTCTTCCGAT GAGGCGCCCG TCGATGTCGT CCAGCGCACG CTGACACCGG AAGCGCTGCC CGAGGACGAC GAGAACGCCA ACGCTGACGA TCAGGTCACG CCGACTGCGG TCGGTGAGAC GGAGGATCCG CGTCTGCTGC CAACCCAGGA CAACGCCGAC AACGCTCCGG CAACCGACGC CGACAAGACG CCGTCCGTTT CTCCGCGCAA GGTTCGCACA ATGATCGTCA AGCCTGACGG TACGTTGGTT GCCCGCGAGG AACCAGCGCC CGTCGACCAG CCGACACCGT CTGCCCAGGC GACGCAGTCT GCCCAGGCGA CCCAGTCTGC CCAGGCGACG CCGCCGGCTC AGCCGCCGTT CACGGCTCCG TCGACCCCGC CCGTGCCGCC CGTCGGTGGA ACGGCCGCAA GCTTCCCAGC AAGAGCCGAG GTTGCTTCCG CCGATGCGCG TTCCGCAGCC CCCGTCGAAA CCGCGCCGGT ACAGCCGCCG CTCGCCGGCA GTGCTGACGC ACAGGCCGCA AATCCCGCTC AGGTCGCTCC GCCGGTGCGT CCGGTCAAGA CCTCGGCCAC TGCCGATACC GCTCCAATCC CGACCGCCCG TCCGGTTGAC CAGCCCGTCA ACGTCGTAGG CACAGTGACC GAGAAGGGCA ATGTCCGCCC GCCTGCCCAG CAGCCGAAGC CCACACAGCA GCCGAAGACG ACTGAAGTCG CGGCCGCAGC ACCTGTCGCC GCAAAGCCGC AGCAGGCCGC ATCCGCCGGC GGCTACGGCA TCCAGATCGC CTCGCTGCCT TCGGAAGACG AGGCGACCAA ATCCTATGCC AACCTGTCGA AGAAATTCGC CAGCGTGCTT GGCGGCCGCA GCCACGAGAT CCGCAGGGCC GATATCGCCG GCAAGGGCAC TTTCTACCGT GTCCGCATTC CGGCCGGTTC CAAGGACGAG GCCGCAGCAC TCTGCGAACA GTATCGCGCG GCGGGCGGAA GCTGCTTGAT CTCCAAGTAA
|
Protein sequence | MADKQLAYDT RGKNDLFSDD DPLAELARIV GFEPRVAANT VTETARREPA LDLEDELGRE FDRYDSPRPL AELDRPAEPI SDDLTPEDYV EPVLDASPTA EHAEAPEPVS ASVAAFEAEE AALPAAGNGD ASVADWAEQL SPEPDASVQS AFGGARDLIE ELELSIGAAP VSSLAQPTKA PQWSAASIRL PLANFHAPKR EEPVVLPEPV AETVAAPVAE APSADLPVVE PQSVIEPPAL VAAVEPSEEF ESASPSLGFP AELDRHDEVI APEETAEAEE FVEVEEELED FGSDAGFDLI AAAVEGEIQA DAALTEVVPD VPHTAGTFDL DDLLADVSRY PVPQRANPAP VSPQPASIEA APVPAAPVAA EPVQSEVIAP PPLAAAPVRP APVEPATVYA EAARPVAPQL AEVVTPQPAA TAYSPAPQPV PEADDPFAGH DFELDLAGIE LELTDLDFSE PSEPAPQPEP PAPAPQQAAA VAPRSAAPVF APEPPAPAFE QAAAAPARSA PAFVPEPQAP APAFNWAPVS DSTEDLPFDP AMISDPEDRP EAVDDMHVPA LPPVEQPAPV AKSADFDFDL DAEIASFFEP AKPRQTPAPV RDTAAAAAKP VKPTIADGLD DFERALEEDF RRSVREPVER RETSEVRIES ASQAADFSRA RSMRQLLAGA VVLVVFAGVG YGVYSSVWNG EGLGIVASGE PRVITADKEP VKVVPENPGG KTVPNQDKAV YDRVAGSAEE PKQKALVSSD EAPVDVVQRT LTPEALPEDD ENANADDQVT PTAVGETEDP RLLPTQDNAD NAPATDADKT PSVSPRKVRT MIVKPDGTLV AREEPAPVDQ PTPSAQATQS AQATQSAQAT PPAQPPFTAP STPPVPPVGG TAASFPARAE VASADARSAA PVETAPVQPP LAGSADAQAA NPAQVAPPVR PVKTSATADT APIPTARPVD QPVNVVGTVT EKGNVRPPAQ QPKPTQQPKT TEVAAAAPVA AKPQQAASAG GYGIQIASLP SEDEATKSYA NLSKKFASVL GGRSHEIRRA DIAGKGTFYR VRIPAGSKDE AAALCEQYRA AGGSCLISK
|
| |