Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3916 |
Symbol | |
ID | 6982680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4063041 |
End bp | 4064687 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398639 |
Product | Heparinase II/III family protein |
Protein accession | YP_002283404 |
Protein GI | 209551487 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.729618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.655355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTTC GGGAGGCTTG GCGGCGCGCC TCGCGCCGCG TCGCGTTGCT GCGCCTAAAG CTCTTCCGCC ATTCAATCAA CGTGCCCGAG CGTCTGATCG TCGCGCCGAC CGATCTCCGC AGCATCGATT CGCATGTGGC CGACGAGATT CTCAACGGAC GTTTTCTGCT GGCCGGGCGG ATGCTGGAAA CGAGTGGAAA GTCACCCTTC ACCTTTACCT TGCCCTCGCG CCCCTTCGCG ATCCGCCTTC ATAGCTTCGG CTGGCTGCGG CATATCCGGG CGAACAAGAC GGAGCGCAGT TCGGCGGCGG CCCGCGCGAT CGTCGACAGC TGGCTTTCCA TCCATGCCGG GCGCATGGAA GGGATTGCAT GGGAGATCGA CGTCACCGCG CAGCGCGTCA TTGCCTGGCT CTCGCATTCG CCGGTGGTGC TGCAGAATGC CGACCGCGGC TTTTATCGCC GCTTCATGAA GTCGCTGGCC TTCCAAGTGA GGTTCCTGCA CCGCATGGCG CCGTATACGC TTGGCGGCCT GGAGCTGTTT CGGCTGCGTA TCGCGCTCGC CATGGCCTCC GTCGCCATGC CTGCCCGCGC ATCGACGCTC AGACGGGCGG CGCAGGCGCT CGACCGCGAA TTCGATAGCC AGATTCTACC GGATGGCGGC CATATCTCGC GCAATCCGCG CGTCGGTCTG GAATTGCTGC TCGATCTGCT GCCGCTGAGA CAGACCTATG TCAATCTCGG CCATGACCTG CCGCAGAAGC TGATCTCCGG CATCGACCGC ATCTATCCGG CTCTGCGGTT CTTTCGCCAT CAGGACGGGG ATCTGGCGCT GTTTAACGGG GCGACCTCGA CGCTGGCAAA CGAGCTGATT TCCGTGCTGC GCTATGACGA GACCGCCGGT CAGCCGTTCA AGGCTTTGCC GCAGTCGCGC TATCAGCGGC TTTCCGGCGG AAAAACAGTC ATCATTGCCG ATGTCGGCAC GCCGCCTTCG GGCGGCGCGT TGCGGACCGT TCATGCCGGC AGCCTCTCCT TCGAAATGTC GTCGGGCCGC CACCGCTTCA TCGTCAATTC CGGCTCGCCT AAATTTGCCG GGCACCGCTA TGTCCAGATG GCCCGCACGA CGGCCGCGCA TTCGACGGTT ATTTTGAACG ACACTTCGTC CAGCCGCTTT TCGCCTTCGC CCTTCCTCAA TCACGCGATT ACCGAACCGG TGAGAACAGT GACCGTCGAG CGTGCCGAAA CGGAGGATGG ACGTGACGGC ATCAAGCTCA GCCATGACGG TTATCTCAGG GCGTTCGGGG TGCTGCACGA ACGTGAGCTG ACACTCAATG CCGCAGGCTC GATCGTGACC GGGCGCGACC GGCTCGTCGT CCGGGAAGGG TATGAACACG ACGAGCCCTT GAAGGCCGTC GCTCGTTTCC ACATTCATCC CTCCATCGTC CTGCAGCAGA GCGACGGGGA GTCCGTGCTG CTGACAGCGC CGGACGGCGA AAGCTGGCTG TTTTCGGCGC CCGGCAACGA AGTGCTGATC ACCGAGGACA TCTTCTTTGC CGACAGCTCC GGCATTTGCG GTTCGGATCA GATCGAAATC GACTTCGATC TTGCCGAGAA GACGGAAATC CGCTGGTTTT TGTCCCGCAA GGGATAG
|
Protein sequence | MYVREAWRRA SRRVALLRLK LFRHSINVPE RLIVAPTDLR SIDSHVADEI LNGRFLLAGR MLETSGKSPF TFTLPSRPFA IRLHSFGWLR HIRANKTERS SAAARAIVDS WLSIHAGRME GIAWEIDVTA QRVIAWLSHS PVVLQNADRG FYRRFMKSLA FQVRFLHRMA PYTLGGLELF RLRIALAMAS VAMPARASTL RRAAQALDRE FDSQILPDGG HISRNPRVGL ELLLDLLPLR QTYVNLGHDL PQKLISGIDR IYPALRFFRH QDGDLALFNG ATSTLANELI SVLRYDETAG QPFKALPQSR YQRLSGGKTV IIADVGTPPS GGALRTVHAG SLSFEMSSGR HRFIVNSGSP KFAGHRYVQM ARTTAAHSTV ILNDTSSSRF SPSPFLNHAI TEPVRTVTVE RAETEDGRDG IKLSHDGYLR AFGVLHEREL TLNAAGSIVT GRDRLVVREG YEHDEPLKAV ARFHIHPSIV LQQSDGESVL LTAPDGESWL FSAPGNEVLI TEDIFFADSS GICGSDQIEI DFDLAEKTEI RWFLSRKG
|
| |