Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5526 |
Symbol | |
ID | 6978620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1175173 |
End bp | 1176204 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643394625 |
Product | hypothetical protein |
Protein accession | YP_002279443 |
Protein GI | 209547525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00568584 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCCGT CGGCGATCTC GAGAAGACAG TTTCTGCTGG GCGGCATCGC GCTTCAGGTC CTGCGGCCGG AACTTGCCGC CGCGCAGCCG ACAAGCACCA GCGATCACGG AATAGGCGGC TCCGGTCTTG CCGTCCAGGG CGGCGGCGAG AATGAAGATC ACGGGATCGG CGGAACCGGC ATTGTCGGGA CCATCCAGGG TTTCGGCAGC ATCATCGTCA ACAATATCCA CATACCGTTC AGCGCGACGA CGCCGGTCGA GATCGATGGA CGGCGCGTAC CCGCGAGTGC GATGAAGGTC GGCCATGTTG CCCGGGTGCT GCTGACAGGA AAGCGCGCCG CCCGCATTAC GATCGTCAGC GAGGTCCAGG GCCGTATCGA CCGGATTGAT AAGACCGGCC TGACCATATT GGCGCAAACC GTCGACACAT CAGGTTTAGC GACGAAGGGC CTGCGGAAAG GCAAGCGGGT TGCCGTGTTC GGCATCCGCA ACCCAGACGG TACGATTATT GCCCGGCGTA TCGAGCCTCG CTCCGTCTCC GACGGCGCCC ATCTTCGCGG CGTTCCCGTC AAGAGCGGCA ACCGCGTCCT GATCGGCGGC CTTTCGCTTG GAAGCACGCA TGGCTACCTC GCCGGCAAAC AAACGCTCGT GCGTCTCAAG GCGATCGCCG ATCGCTTGGT GATTACCCGC GTTCAGACCG AACCGGTGGT GCCGGGCCTC AAACGCGGCA TCGTCAATAT CGAAACCTTC CGACCCACGG ACAGAGGCGA GGCGGGTTCC GGACCTGGTT CTGCGCCTTC CGGCTTTGTC GATATCGGCG TCCGGGATTC GAGCAGAATG ACCGGCTTTC CCGGACCTGG CCCCGACGGT TTGGGTTCCC GCCCGCCGCA CGGCCCTGGG GATGGGCCGC GGCCCGATCA CTCGCCTTTC GGCAGAGGCG GGCCAGGCGG AGATTTTCCC GATCCGGACA GGCGAGGCCC GCCACCCGGC CCTGGCGGCC CGCCTCCTGG GCCGCCCCCG GGCCCGCACT GA
|
Protein sequence | MSPSAISRRQ FLLGGIALQV LRPELAAAQP TSTSDHGIGG SGLAVQGGGE NEDHGIGGTG IVGTIQGFGS IIVNNIHIPF SATTPVEIDG RRVPASAMKV GHVARVLLTG KRAARITIVS EVQGRIDRID KTGLTILAQT VDTSGLATKG LRKGKRVAVF GIRNPDGTII ARRIEPRSVS DGAHLRGVPV KSGNRVLIGG LSLGSTHGYL AGKQTLVRLK AIADRLVITR VQTEPVVPGL KRGIVNIETF RPTDRGEAGS GPGSAPSGFV DIGVRDSSRM TGFPGPGPDG LGSRPPHGPG DGPRPDHSPF GRGGPGGDFP DPDRRGPPPG PGGPPPGPPP GPH
|
| |