Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3895 |
Symbol | |
ID | 8014715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3962866 |
End bp | 3964041 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826465 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_002977677 |
Protein GI | 241206581 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00307533 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCGC TCAACGACAG GATTCCGGTC ACCATCCTGA CCGGCTTTCT CGGCGCCGGC AAATCGACGC TCCTCAACCG CATCCTCAAA GATCCTGTCA TGAAGGATGC GGCCGTCATC ATCAACGAAT TCGGCGATGT CGGCATCGAT CATCTTCTGG TCGAAAGTTC CGGCGATTCG ATCATCGAAC TGTCCGATGG ATGCCTCTGC TGCACCGTGC GCGGCGAGCT TGTCGATACG CTGGCCAACC TGATGGACGC GGTGCAGACG GGCCGCGTCA AGCCGGTCAA GCGCGTCGTC ATCGAGACGA CGGGCCTTGC CGATCCTTCG CCGGTCATGC AGGCGATCAT GGGCAACCCC GTCATCGCCA CGAATTTCGA GCTGGATGGC GTCGTCACTG TCGTCGACGC AGTGAATGGG CTGCAGACGC TCGACAATCA TGAGGAAGCG CGCAAGCAGG CGGCGGTCGC CGACCGGCTG ATCGTTTCGA AAAAATCGAT GGCCGAGGTG ACGGGTGGGC TGGAAAAGCG CCTGCGGGCG CTCAATCCGC GTGCCGTGAT GATGGATGCC GACAGCGCCG AGGCCGGCAG TGCCGCAGTG CTGGTCAACG GGCTCTACGA TCCGGCGACG AAGATTGCCG ATGTCGGCCG CTGGCTGCAG GACGAGGATG CACACGAGGC GCATCACAAT CACGACCATG ATCGTGATGG ACATCACGAT CATCATCATG ATGGCGATCA CCATGGTCCC CACCATCATG ATCATGCTGC GGATCAGGAC CCGCACGACG TCAATCGTCA CGATGCCTCG ATCCGCTCTT TCTCGATCAT CGAGGAGAAG CCGATCGATC CGATGGCGCT CGAGATGTTT ATCGATCTTC TGCGTTCGGC TCATGGCGAG AAGCTGCTGC GGATGAAGGC GATCGTGTCC GTTTCCGACC GTCCGGATCG GCCGCTGGTG CTGCATGGGG TGCAGAGCAT CTTCCATCCG CCGGTGCGGC TCCCCGCCTG GCCGGGGGAA GACCGGCGCA CGCGCATGGT GCTGATCACC CGGGATCTGC CGGAAGCCTT CGTGAAGGAT CTCTTCGACG CCTTCCTCGG CAAGCCGCGC ATCGATATGC CCGATCGTGT AGCGCTGTCG GACAATCCGC TCGCTATCCC CGGCCTTAGA ATTTAG
|
Protein sequence | MSALNDRIPV TILTGFLGAG KSTLLNRILK DPVMKDAAVI INEFGDVGID HLLVESSGDS IIELSDGCLC CTVRGELVDT LANLMDAVQT GRVKPVKRVV IETTGLADPS PVMQAIMGNP VIATNFELDG VVTVVDAVNG LQTLDNHEEA RKQAAVADRL IVSKKSMAEV TGGLEKRLRA LNPRAVMMDA DSAEAGSAAV LVNGLYDPAT KIADVGRWLQ DEDAHEAHHN HDHDRDGHHD HHHDGDHHGP HHHDHAADQD PHDVNRHDAS IRSFSIIEEK PIDPMALEMF IDLLRSAHGE KLLRMKAIVS VSDRPDRPLV LHGVQSIFHP PVRLPAWPGE DRRTRMVLIT RDLPEAFVKD LFDAFLGKPR IDMPDRVALS DNPLAIPGLR I
|
| |