Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2475 |
Symbol | |
ID | 6981216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2507289 |
End bp | 2508326 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643397189 |
Product | integrase family protein |
Protein accession | YP_002281975 |
Protein GI | 209550058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGAA AGTTGGTCGG CGTGCATAAG GTCAATGTGA AATTGGCCGA TGGGACCATC GAGACCTATT ACTATGCCTG GCGCGGCAAG GGAGCCCCCA GGATGCACGC GAAGCCAGGG ACGAAGGCTT TCACGCAGGA ATATGTGCGC CTCACTCGCG AGCGCGAAAA AGCTTCCGAA GACGGGACGA TCGGATCTCT GATCGATGAG TTCCGCAAGA CGGCCGGCTA TATGAAGCTG GCGGCGTCCA CCAGGCGCGA CTATGAGCGT CAGTTCGCCA TGATCCGCCT GAAGTTCGAG GGCTTCCCGA TTAAAGCGAT CGAGGCGCGC GGCAGCCGTC GCATCTTCCT CAACTGGCGC GACACCATGA GGGATTCCCC CAGATCGGCA GATATGCATA TCGCGTTGCT ATCGCGGCTC TTCTCATGGG CGAAGGGGAA CGAAGTCATC CTGCGAAACC CGCTTGAGGA AGTCGAGCGC CTGCATTCCG GCACGCGGAA AGATATCATC TGGACTGACG ATCAACTCGC GAAGCTGCTG ACGGAGGGCG TGCCCCACCT CCGCAACGTC GCGCTCGTCG CCCTCTGGAC GATGCAGCGC CAGGCGGACA TCCTCAGCAT GCCGACGTTG GCTTTCGACG GCGAGCGCGT CTCGATCAGG CAGGGGAAAA CGGGAGCCCG TGTGCGTGTG ATGGCGGCTC CGGATATCTT GCCGGTTCTC AAGGACGCCA AAACGACGTC GCGCCAGCGA GTGCTGGTGA ACTCGTTCGG GCAGAACTGG ACATCGAGCG GATTCCGGGC TTCGTGGCGC AAGGAGATGG CGCGTCTCGG CATCAAGGGG GTAACTTTCC ACGATCTGCG CGGGACGGCG ATCACCTTTG CCTATGCCAA TCTCGACAAG GCGCACGATG AGAAGATCAA GCTTATTTCG GAGATTTCAG GCCATTCGCA GGACGACGCC GAATCAATCA TCCGCAAACA TTATCTGGCC GGACAAGACG TGATCGACGC GATCAGCCGG GGAACAAAGA AGGCATAA
|
Protein sequence | MEGKLVGVHK VNVKLADGTI ETYYYAWRGK GAPRMHAKPG TKAFTQEYVR LTREREKASE DGTIGSLIDE FRKTAGYMKL AASTRRDYER QFAMIRLKFE GFPIKAIEAR GSRRIFLNWR DTMRDSPRSA DMHIALLSRL FSWAKGNEVI LRNPLEEVER LHSGTRKDII WTDDQLAKLL TEGVPHLRNV ALVALWTMQR QADILSMPTL AFDGERVSIR QGKTGARVRV MAAPDILPVL KDAKTTSRQR VLVNSFGQNW TSSGFRASWR KEMARLGIKG VTFHDLRGTA ITFAYANLDK AHDEKIKLIS EISGHSQDDA ESIIRKHYLA GQDVIDAISR GTKKA
|
| |