Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1236 |
Symbol | |
ID | 8012342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1210611 |
End bp | 1211480 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644823817 |
Product | protein of unknown function DUF519 |
Protein accession | YP_002975067 |
Protein GI | 241203971 |
COG category | [R] General function prediction only |
COG ID | [COG2961] Protein involved in catabolism of external DNA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.187373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000201181 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACTACC GCCACATTTA TCACGCGGGC AACTTTGCCG ATGTGCTGAA ACATGTCGTG CTGACGCGGC TGATCCGCTA CATGCAGAAG AAGGATGGCG GGTTCCGCGT GCTCGACACG CATGCCGGCA TCGGGCTCTA CGACCTCTCC TTGGAAGAAG CGCAGAAAAC CGGCGAATGG CTCGATGGCA TCGGCAAGCT GATGGAGGCA GACCTCGGCC CTCAGGTTTC CGAACTGCTG GAACCCTACC TCTCCGCAAT CCGCGAACTC AATCCGCAGG GCGGCATCCG TTTCTATCCC GGATCGCCGA AACTTGCGCG CATGCTGTTC CGGCCGCAAG ATCGGCTGTC GGCGATGGAA CTGCACCCCG AGGACTATGT CAGGCTGCAC CGGCTGTTCG AGGGCGATCA CCATGCCCGC ATCACCGAGC TTGACGGCTG GCTGGCGCTC GGCGCGCATC TGCCGCCGAA GGAGAAGCGC GGCATCGTCC TCGTCGATCC GCCCTTCGAG GAAGAGGACG AATATCAGCG GCTGGCCGAG GGACTGGAAA AAGCCTACCG CCGCTTTCCC GGCGGCACCT ATTGCCTGTG GTATCCGCTG AAAAAGGGCG CGCCGATCAA GGAATTCCAC GAGACGCTGC AGGCGCTCGA CATCCCGAAA ATGCTCTGCG CCGAACTCGC CGTTCGCAGC GACCGCGGCA TTACGGGACT GACGGGCTCA GGCCTCGTCA TCGTCAACCC GCCCTTCACG CTGAAGGATG AGTTGCACCA ATTGCTGCCC GCATTGAAGG ATCATCTGGC GCAAGACCGT TTCGCCTCTC ACCGCGCCTT CTGGCTGCGT GGCGAGAACA AGGCGGTCAA GGACGATTGA
|
Protein sequence | MNYRHIYHAG NFADVLKHVV LTRLIRYMQK KDGGFRVLDT HAGIGLYDLS LEEAQKTGEW LDGIGKLMEA DLGPQVSELL EPYLSAIREL NPQGGIRFYP GSPKLARMLF RPQDRLSAME LHPEDYVRLH RLFEGDHHAR ITELDGWLAL GAHLPPKEKR GIVLVDPPFE EEDEYQRLAE GLEKAYRRFP GGTYCLWYPL KKGAPIKEFH ETLQALDIPK MLCAELAVRS DRGITGLTGS GLVIVNPPFT LKDELHQLLP ALKDHLAQDR FASHRAFWLR GENKAVKDD
|
| |