Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4754 |
Symbol | |
ID | 8007007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 122779 |
End bp | 124641 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644821684 |
Product | hypothetical protein |
Protein accession | YP_002972944 |
Protein GI | 241113109 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00204432 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.330996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTCG ATGCGTTCCA GCTCTACGGC ACCCGCCTCG TTGAAACGCC GCCGGTTCGG CTGAGAGCCG GAAAACTGGA AGCCGATCTC GCCAATGGCA ACCTCCGCAC CATCCGCTAC GATGGGACCG AGGTGCTGCG AGCGATCTCC TACCTCGTTC GCGACCCGGA CTGGGGCACC TACAGCCCTG TAATTGTTGA TCTCCGCATC GAGCAGAGTG ACAATCGTTT CGCGGTCGCC TATCGAGCCC GCTGCGAGGG ACCTGATGAC ACGAGGCTTG TCATTGACGT TCGCATCACC GGAAGCGCGG ACCGGCTCGA CTTCGAGGCC GAAGCCATCA CAGAGACCGG CTTCGAGACC AATCGCTGCG GCTTCTGCAT CCTGCATCCG ATCGTCGGCG TGGCGGGTTC ACCGGCGACG GTCGAACATG TCGACGGCCG GAAAGTGGCA ACCCGGTTTC CCGATGTCAT CGAGCCCTGG CAGCCTTTCA AGGACATGCG CGCCATCACT CATGCGATCA TGCCTGACGT TCAGGCGGAA TGCCGGATGG AGGGCGACAC CTTTGAAATG GAAGACCAAA GGAACTGGTC GGACGCATCC TATAAGGCAT ATGTCAGGCC GCTCGCCCTG CCCTGGCCAT ACCAGATTGC CGCCAATCAG CCCGTTCGGC AAAAGACGTC GCTTGTTATC AGGGATATCG GCGGTTCGAC ACGGCATCCT CCAGCTGCGG CGTCAGGCGG CGCCATAAAA CTCGAACTCG GGGCGCGAAC CGGCACCATG CCTGATATCG GCGTGATCGT TACGCCCGAG GAAGCCGATG CGACACTGTC GGCAAAGTCC GTGCTGTCGG AAATCGCTCC CCAGGAACTG CTCTTCCATT TCGACCCCAG TGCAGGACAC GGCGTCGACG CGCTCACGCA GTTCGCCATG CTCGCCGCGG CCCATCTCGG CCGCTCGACG CTGGAGATCG CCCTTCCCTG CACATCGTCG CCGTCAAGCG AGGTGGCCGA AATCGCCCAC CAGATGCGGC TGGCGGAATT CAGGCCGGAT GCGATCATGA TCTCGCCTTC GGTTGACCGG CAGTCGACGC CGCCCGGCAG CACATGGCCG GAATGCCCGC CTTTGGATGA AGTCTATACC GCCGCTCGCG CCGCCTTTCC CGGCATTCGC ATCGGCGGGG GTATGCTGAG CTATTTCACC GAGCTCAACC GGAAGCGCGT CCCGGATGGA CAGCTCGACT TCGTCAGCCA CTGCACCAAT CCGATCGTGC ATGCCGCCGA CGATCTTAGC GTCATGCAGA CATTGGAAGC GCTGCCCTTC ATCACACGGT CGGTGCGTGC GATCTACGGT GACAGACCCT ACCGGATCGG CCCGTCGACG ATCCCGATGC GACAGAATCC CTATGGCAGC CGCACGATGG ATAATCCGTC GGGCGCACGC GTTCCCATGG CCAACCGCGA CCCGCGTCAC AATGGACGCT TCGCGGAGGC CTTCGCGCTC GGCTACGCGA TACGGGTACT GGATGCCGGT CTGGAATGCC TGACGCTCTC GGCCTTGTCA GGCCCGTTCG GTCTGATCGC CGGTCCAGCC GAACCGACCG AGCAAGGCGG GCGGCGCCCG CTGTTCAACA CAGTGCGGAC ATTGTCTCGA TTGGCTGGCG CATCCTGGCA GGCATGCGTC TCCTCCTCGC CCTCCGAGGT GCTGTCTTTC GTTGCACGCG ATGCCGCAGG CGCCAGGCTT CACGTCGTCA ATCTGACGGG CGAAGAACGA AAGGTCGATT GCGACGCCTG CCGGCCGGCA GATTCGGGCA AAGAGTTTCT GCTCGCGCCG TTTGCGACCG TCGTCCTGCC GCTGGCGGAT TGA
|
Protein sequence | MKVDAFQLYG TRLVETPPVR LRAGKLEADL ANGNLRTIRY DGTEVLRAIS YLVRDPDWGT YSPVIVDLRI EQSDNRFAVA YRARCEGPDD TRLVIDVRIT GSADRLDFEA EAITETGFET NRCGFCILHP IVGVAGSPAT VEHVDGRKVA TRFPDVIEPW QPFKDMRAIT HAIMPDVQAE CRMEGDTFEM EDQRNWSDAS YKAYVRPLAL PWPYQIAANQ PVRQKTSLVI RDIGGSTRHP PAAASGGAIK LELGARTGTM PDIGVIVTPE EADATLSAKS VLSEIAPQEL LFHFDPSAGH GVDALTQFAM LAAAHLGRST LEIALPCTSS PSSEVAEIAH QMRLAEFRPD AIMISPSVDR QSTPPGSTWP ECPPLDEVYT AARAAFPGIR IGGGMLSYFT ELNRKRVPDG QLDFVSHCTN PIVHAADDLS VMQTLEALPF ITRSVRAIYG DRPYRIGPST IPMRQNPYGS RTMDNPSGAR VPMANRDPRH NGRFAEAFAL GYAIRVLDAG LECLTLSALS GPFGLIAGPA EPTEQGGRRP LFNTVRTLSR LAGASWQACV SSSPSEVLSF VARDAAGARL HVVNLTGEER KVDCDACRPA DSGKEFLLAP FATVVLPLAD
|
| |