Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0828 |
Symbol | |
ID | 6979546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 845532 |
End bp | 846806 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643395539 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002280348 |
Protein GI | 209548431 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCTGG AAATCGGAAT TGTGGCGTTT CTCACCATCG TCAATGGTGT GCTCGCCATG TCGGAGTTGG CTGTTGTGTC TTCTCGAACA GCCCGCCTAA AAGTTCTCTC CGACCATGGG AGCAAGGGGG CAGCTCAAGC TATTAAACTT GCCGAAAACC CCGGTCGTTT TCTCTCTACG GTTCAGATCG GCATTACGCT GGTCGGTGTT CTCTCGGGCG CTTTCTCAGG GGCCACCCTC GGCAGCCGCT TGACCGGATG GCTGGAGACA CAGGGGATGT CATCGACGTT AGCCGACGCG ATTGGCGTCG GTTCGGTCGT TGTGGCGATC ACCTATCTTT CTTTGATTGT GGGCGAACTT GTTCCGAAGC AGATCGCATT GCGAGAACCC GAAGCCGTTG CGGCGAAGGT CGCACCGGCC ATGGCGGTGC TTTCAAAAAT CGCGCTGCCG CTCGTGTGGC TTCTGAACGC CTCCGGAAAT CTTGTGCTGA AGCTCTTGGG CCAAGCCGGA AAAGGCGGCG ACAACGTTTC TGACGAAGAG ATCAAAACCG TTCTGGCCGA GGCGCAGTCG GCCGGCGTTA TCGAAAGCGA AGAGTCCGCG ATGATATCAG GCGTCATGCG CCTGGCGGAC CGCACCGCCC GAGCGCTTAT GACGCCCCGA CGCGATGTCG AAATTATCGA TATCGACGAC AGCCTTGATG AAGTTCGCAC TCAATTGCAC CGGACGAAAC GGTCTCGGCT GCCCGTTCGC AAAGGCAGTT CGGACGAGGT GATCGGCATC CTTCCGGTCA AGGATTTTTA CGACTCGATG TCGGAACACG GCAGTGCCGA TATTAAGGCC CTGACGCAGG ACGTCCCGGT GGTTTCAGAC CTTTCGACCG CGATCAATGT GATTGAAGCC ATCAGGAAAT CGCCAGTCCA CATGGTGCTG GTTTTTGACG AGTACGGCCA TTTCGAGGGG ATTGTTTCGT CAGGCGACAT TCTGGAAGCG ATCATGGGGG CTCTGCAGGA GGGACCTGTG GACGAGCAGG CCATCGCGCG CCGCGACGAC GGTTCCTATC TGGTGTCCGG CTGGACGCCG ATCGACGAAT TCGCCGAGTT CCTGAATCTC AAGCTCGATG ACGATCTCGA ATATCAGACT GTGGCCGGCC TGGTGCTGGA AGAGCTGAAA CATCTGCCCG AGTTGGGTGA GAGTTTCACG CGAGGCGGAT GGCGCTTCGA AGTCATCGAT CTCGACGGCA GGCGCGTCGA CAAGATACTT GTGTCTGCGG AGTGA
|
Protein sequence | MFLEIGIVAF LTIVNGVLAM SELAVVSSRT ARLKVLSDHG SKGAAQAIKL AENPGRFLST VQIGITLVGV LSGAFSGATL GSRLTGWLET QGMSSTLADA IGVGSVVVAI TYLSLIVGEL VPKQIALREP EAVAAKVAPA MAVLSKIALP LVWLLNASGN LVLKLLGQAG KGGDNVSDEE IKTVLAEAQS AGVIESEESA MISGVMRLAD RTARALMTPR RDVEIIDIDD SLDEVRTQLH RTKRSRLPVR KGSSDEVIGI LPVKDFYDSM SEHGSADIKA LTQDVPVVSD LSTAINVIEA IRKSPVHMVL VFDEYGHFEG IVSSGDILEA IMGALQEGPV DEQAIARRDD GSYLVSGWTP IDEFAEFLNL KLDDDLEYQT VAGLVLEELK HLPELGESFT RGGWRFEVID LDGRRVDKIL VSAE
|
| |