Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5047 |
Symbol | |
ID | 8007640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 429264 |
End bp | 431114 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644821962 |
Product | hypothetical protein |
Protein accession | YP_002973222 |
Protein GI | 241113387 |
COG category | [S] Function unknown |
COG ID | [COG4289] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.193591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATATG ATCCCGCCCG GGCCAATCCG CTTCTCGGCA ATCCCCTGAA GACACGCGAC GACCTCGCCA AGGCAGTCAC CGATCTCTTC GAGCCGCTGC TGCCGTATTT TTCCGAAGGC GGCGCCCGTG TGCGCCTCGG CGCAGCCGGC GCGATTTTCG ATCGGGCGGC GGCGGATCTG GAGGGATTTG CGCGGCCGCT CTGGGGGATC GTTCCGCTCG TTGCCGGCGG CGGCGCGTTT CCGCATTGGG ACCTCTATCG CCGCGGGCTG GCAAACGGCA CCAATCCTGC TCATCCCGAA TATTGGGGCG ATCTTGCCGA CCGCAATCAG CGGCTGGTCG AGCTCGCCGC GGTCGGCTTC GCCCTGGCGC TCGTGCCCGA GCACATCTGG GAACCGCTCA ACGACGGCGA GAAGAAGACG GTCGCTGCCT ATCTTCTTCG AGCGCGCGAG TTGGAATTCA TCGACAATAA CTGGAAATTC TTCCGTGTGC TCATCGATCT CGGGTTGGAA CGCGTGGGCG TGGCGTTCGA TCACCGGAAA ACCCTCGCCT ATCTCGAAGA ACTGGAGGCC TTCGACCTCG GAGAAGGCTG GTATCGCGAC GGGCCGGTTC GGCGGGTCGA TCATTACATT CCCTTTGCCA TGCATTTCTA CGGAATGATC TATGCCGTCC TGGCCAAGGG CGACGAGGCG CGCAAGGATC GCTTCCGCGA TCGCGCCGAG ATCTTCGCCA GCGATATCCG CCACTGGTTC GGCCCGGATG GGGCAGCCCT TGCCTTCGGC CGCAGCCAGA CCTACCGCTT CGCGGCCGGA GGTTTCTGGG GCGCGCTTGC CTTTGCCGGT GTCGAAGCCC TGCCCTGGGC CGAGATCAAG GGCTATTACA TGCGCCATAT CCGCTGGTGG GCGGCGATGC CGATTGCCGA TCGCGACGGC GTTCTTTCGG TCGGCTATGG CTATCCGAAC CTCTTCATGA GCGAGAGCTA CAACTCTCCC GGCTCGCCCT ATTGGGCGCT GAAATTCTTC CTGCCGCTCG CCCTTCCGGG GGATCATCGC TTCTGGGCGG CCGAGGAGGC GTCGCAGCCG GAATTTCCGG AGCCGGTTGC GTTGAAGCCG GCGGGAATGG TCGCCATGCA CACGCCGGGA AACGTGGTCG TGCTCTCCTC AGGGCAGCAG CACGACAAGA TGCTCGGTGC AAACGAGAAA TATTCGAAAT TCGTCTATTC CACCCGCTAC GCCTTCAACG TCGAAGCCGA CGACCGGAAT TTCTCCGCCG CAAGCTTCGA CGGCATGCTC GGCCTCTCCG ACGACGGCGT CCATTTCCGC ATGCGCGAAA CCCTCGAAGA GGCGTTGATC GCAGGCGACC TGCTCTATTC GCGCTGGCGC CCCTGGAGCG ATGTCACTAT CGAAACCTGG CTGCTTCCTG AAAATCCGTG GCACATCCGC ATTCACCGCA TCGCCACGCC ACGCACACTC AGCACCATCG AGGGCGGTTT TGCGATCGAG CGCGCGGATT TCAATGCCGA CCGCTCCGAT GCAAGGGATG GCCGGGCTGT CTGGTACGGG CAGACCGACG TCAGCGCCAT CGTCGATCTA TCGCCCAATC CAAGGGCCGG CCATGCGATG AGCCCGATCC CGAACACCAA TCTCATCCAC GCCAAGACCC TACTGCCGCA GCTGCGCGGC AACATCGGCG CAGGCACCAT CGTGCTGGTG ACCGCCGCGA TGGCCCTGCC CAGCCGTGAG AACTGGGCAA AAGCGCTCGA TAATCCGCCA GCCCGTCCCC GCCTCGACGA GGTAGAGCGG CTCTTCCGCG AGAAAGGCGT ACAGGTGCCG GCATTCGCCC TCGGGATGTA G
|
Protein sequence | MIYDPARANP LLGNPLKTRD DLAKAVTDLF EPLLPYFSEG GARVRLGAAG AIFDRAAADL EGFARPLWGI VPLVAGGGAF PHWDLYRRGL ANGTNPAHPE YWGDLADRNQ RLVELAAVGF ALALVPEHIW EPLNDGEKKT VAAYLLRARE LEFIDNNWKF FRVLIDLGLE RVGVAFDHRK TLAYLEELEA FDLGEGWYRD GPVRRVDHYI PFAMHFYGMI YAVLAKGDEA RKDRFRDRAE IFASDIRHWF GPDGAALAFG RSQTYRFAAG GFWGALAFAG VEALPWAEIK GYYMRHIRWW AAMPIADRDG VLSVGYGYPN LFMSESYNSP GSPYWALKFF LPLALPGDHR FWAAEEASQP EFPEPVALKP AGMVAMHTPG NVVVLSSGQQ HDKMLGANEK YSKFVYSTRY AFNVEADDRN FSAASFDGML GLSDDGVHFR MRETLEEALI AGDLLYSRWR PWSDVTIETW LLPENPWHIR IHRIATPRTL STIEGGFAIE RADFNADRSD ARDGRAVWYG QTDVSAIVDL SPNPRAGHAM SPIPNTNLIH AKTLLPQLRG NIGAGTIVLV TAAMALPSRE NWAKALDNPP ARPRLDEVER LFREKGVQVP AFALGM
|
| |