Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3870 |
Symbol | |
ID | 8014693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3941157 |
End bp | 3942173 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826440 |
Product | hypothetical protein |
Protein accession | YP_002977652 |
Protein GI | 241206556 |
COG category | [S] Function unknown |
COG ID | [COG4093] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.13922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCGT CAAGCCAATC CGGCAGCAGC CAATCCGGCA GCGGTAAGAA ATTCTGGTTG CTGGGTGGAG GCGTCCTCCT GGTGATTGCG CTTTATACCG GCGGCTGGTT CTATGCGGCC TCGGCGCTGA AGAACACAGT GCTGAAAGCG ATCGCGCCGC GCGACCAGGC AGGCGTCAGC GGCGAATGCT CCGATATCGA ATTTCGCGGC TATCCCTTCC GTATCGGCCT GTTCTGCTCC AAGATCGACG TCGACGACAA TGTCAACGGC GTCTCCGCCA CCTTCGGCGC GCTGCGCTCG GCAGCACAGG TCTACGCGCC CGGCAATATC GTCTGGGAAC TCGATTCTCC GGCAGAGATC CGCACCAGCA ACGGCCTTTC GATCTCGGCC CAATGGACGA ACCTGCAGGC GAGCCTTACG ACGAGGCTGC AGGGCATCGA CCACAGCTCG ACCGTCATCG AGGGTCTGAA GGCGATGGCC TTCTCCTCCT ACACCGGCCA GACCATGAGC TTCGATGCCG CTCGCACCGA AATCCACCTG CGCCAGAATG GTGCTGATCT CGACGGTGCG ATTTCCGTGC AGGACGCCAA CGCGGCGATC AAGGACTGGC CGCAGATCTT CCCGAAATTC TCGGCGAGCA TCGATCTGAC CGTCGCCGGC AAGGCCGGCC TGATCGACGG CAGTGACCGG AACGGCCTCA ATGGCGCCAC CGGCGACCTG CGCCGCATCG TCGCCGACAT CGGTGACGGC AAGGTGATGA CGCTCACCGG CCCCTTCTCC TTCGACGAGC AGGGCTTGCT TTCGGGAAAA TTCAAACTGG AGATCGAACA ACTCGGCCCT TGGGGGGACA GCCTGAAACA GGCCTTTCCG GATATCGCCT CGACCGTCAA CACGGCGACG AAGCTGCTGA AATCGCTTGC CGGCGGCGGC GACAAGGTCT CCGTCGATCT CGTCGTCAAT CGCGGCAATG CCACCGTCAG CGGTTTCATC CCGCTCGGCC GCATTCCACC GATCTGA
|
Protein sequence | MAASSQSGSS QSGSGKKFWL LGGGVLLVIA LYTGGWFYAA SALKNTVLKA IAPRDQAGVS GECSDIEFRG YPFRIGLFCS KIDVDDNVNG VSATFGALRS AAQVYAPGNI VWELDSPAEI RTSNGLSISA QWTNLQASLT TRLQGIDHSS TVIEGLKAMA FSSYTGQTMS FDAARTEIHL RQNGADLDGA ISVQDANAAI KDWPQIFPKF SASIDLTVAG KAGLIDGSDR NGLNGATGDL RRIVADIGDG KVMTLTGPFS FDEQGLLSGK FKLEIEQLGP WGDSLKQAFP DIASTVNTAT KLLKSLAGGG DKVSVDLVVN RGNATVSGFI PLGRIPPI
|
| |