Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4241 |
Symbol | |
ID | 8015024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4340894 |
End bp | 4342540 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826811 |
Product | Heparinase II/III family protein |
Protein accession | YP_002978020 |
Protein GI | 241206924 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.222931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTTC GGGAGGCTTG GCGGCGCGCC TTGCGCCGCG TCGCGTTGCT GCGCCTGAAG CTCTTTCGCC ATTCGATCAA GGTGCCCGAG CGTCTGATCG TGGCGCCGAC CGATCTCCGT AGCATCGATC CGCATGTGGC CGACGAAATC CTCAACGGAC GGTTTCTCCT GGCCGGGCGC ATGCTGGAAA CGAACGGAAA GTCGCCCTTC ACCTTCACCC TGCCCTCACG ACCCTTTGCG ACCCGTCTTC ACAGCTTCGG CTGGCTGCGT CACATGCGGG CGAACAAGAC GGAGCGCAAC TCGGCTGCCG CCCGCGCGAT CGTCGACAGC TGGCTTTCCA TCCATGCCGG TCGCATGGAG GGGATTGCCT GGGAGACCGA CGTCACCGCG CAACGCGTTA TCGCCTGGCT GTCGCATTCG CCGGTGGTGC TGCAGAATGC CGACCGCGGC TTCTATCGTC GCTTCATGAA GTCGCTGGCG TTCCAGGTGA GGTTCCTGCG CCGCATGGCG CCGTTTACCC TCGGCGGCTT GGAGCTGTTT CGGTTGCGTA TCGCGCTCGC CATGGCTTCC GTCGCCATGC CGACCCGCGC CTCTACGCTC AGAAGGGCGG CGCAGGCGCT CGACCGCGAA TTCGATAGCC AGATTCTGCC GGATGGCGGC CATGTGTCGC GCAATCCGCG CGTCGGGCTG GAATTGCTGC TCGATCTGCT GCCGCTCCGG CAGACCTATG TCAATCTCGG CCATGACCTG CCGCAGAAGC TGATCTCCGG CATAGACCGC ATCTATCCGG CATTGCGGTT CTTTCGCCAT CAGGACGGGG ACCTGGCGCT CTTCAACGGG GCGACCTCGA CGCTCGCAAA CGAGCTGATG TCGGTGCTGC GGTATGACGA GACCGCCGGC CAGCCGTTCA AGGCTTTGCC GCATTCGCGC TATCAGCGGC TTTCCGGGGG AAAGACGGTG ATCATTGCCG ATACCGGCAC GCCGCCTTCT GGAGGCGCGC TTCGGACCGT CCATGCCGGC AGCCTCTCCT TCGAGATGTC GTCCGGACGC CATCGATTCA TCGTCAATTC CGGTTCGCCG AAATTTGCCG GGCACCGTTA TGTCCAGATG GCGCGCACGA CGGCGGCGCA TTCGACCGTC ATCCTCAACG ACACCTCGTC CAGCCGTTTT TCGCCCTCGC CCTTCCTTAA CCACGCAATC ACCGAACCGG TGAGGACAAT CACCGTCGAG CGTGCCGAAA CCGAGGACGG ACGCGACGGC ATCAAGCTCA GCCATGACGG TTATCTCAGG GTGTTCGGGG TGCTGCACGA GCGTGAGCTG ACACTCAATG CTGCAGGCTC GATCGTGACC GGCCGCGACC GGCTCGTCGT CCGCGAAGGA TATGAGCATG ACGAACCGTT GAAGGCGGTC GCCCGCTTTC ATATCCATCC TTCGATCGTC CTGCATCAGA GTGATGGCGA GTCCGTGCTG CTGACGGCGC CGGACGGCGA AAGCTGGCTG TTTTCCGCAC CTGGCAATGA AGTGCTGATC ACCGAGGACA TCTTCTTTGC CGACAGCTCC GGCATTTGCG GCTCGGACCA GATCGAGATC GATTTCGATC TTGCCGAGAA GATGGAAATC CGCTGGTTTT TGTCCCGCAA AAGCTAG
|
Protein sequence | MYVREAWRRA LRRVALLRLK LFRHSIKVPE RLIVAPTDLR SIDPHVADEI LNGRFLLAGR MLETNGKSPF TFTLPSRPFA TRLHSFGWLR HMRANKTERN SAAARAIVDS WLSIHAGRME GIAWETDVTA QRVIAWLSHS PVVLQNADRG FYRRFMKSLA FQVRFLRRMA PFTLGGLELF RLRIALAMAS VAMPTRASTL RRAAQALDRE FDSQILPDGG HVSRNPRVGL ELLLDLLPLR QTYVNLGHDL PQKLISGIDR IYPALRFFRH QDGDLALFNG ATSTLANELM SVLRYDETAG QPFKALPHSR YQRLSGGKTV IIADTGTPPS GGALRTVHAG SLSFEMSSGR HRFIVNSGSP KFAGHRYVQM ARTTAAHSTV ILNDTSSSRF SPSPFLNHAI TEPVRTITVE RAETEDGRDG IKLSHDGYLR VFGVLHEREL TLNAAGSIVT GRDRLVVREG YEHDEPLKAV ARFHIHPSIV LHQSDGESVL LTAPDGESWL FSAPGNEVLI TEDIFFADSS GICGSDQIEI DFDLAEKMEI RWFLSRKS
|
| |