Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1423 |
Symbol | |
ID | 7091763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 1538567 |
End bp | 1540276 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643464761 |
Product | Heparinase II/III family protein |
Protein accession | YP_002361750 |
Protein GI | 217977603 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.332975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGGCG CGATTTTGAA GGATCGGCTG CATGTCGCGG GCCTCGTTCT CGCGCGCGGC GCGGCGGCGG CGTGGCGCTT CGCCGCGAGC CCCTGGCGCG CGCTCGCCGG GCTGCGGCGC CGGCCGCCCG AACGGCTGCT GATCGCGCCG CAGGACATCC GAACCAGCGA TCCGACGCTC GCCGCCGATA TTTACGCCGG CTATTTCGCT TTCGGCGGCA AGATCGTCAA CGCGCATGGA CGCTCGCCTT TCGAGCTTGA GCCGCAATCG GACGCCTGGG CGCGCTCGCT CGCCAGTTTC AGCTGGCTGC GCCATCTGCG CGCGGCCGAT ACGGCGATCT CGAAAGCGAA TGCGCGGGCG CTGGTCGAGG ATTTTCTCGT CGATTTCAGC AAGCCCGGCG CGAACGCGGC CTGGGAGCCG CGCGTCGCGG CCAGACGGCT GCTCGCCTTC CTCTCGCAGT CGCCGATCAT CCTGCATGGC GCCGACCGCG CTTTCTATCG CCGCTTCATG AAGGCGATCG GGCGCACGCA GCAATTTCTC GAACGCAAGA TGGCCGAGGG GCTCGCCGGC GAGGACCGGC TGCTTGTCGC GATCGCGCTG GTCGAGCTCA GCCTTTGCGC CGAGGACGCC GGAAAATTGC GCCAGCGCGC CGGCCGGATG CTCGCCGAGG AGCTGCAGCG CCAGATCCTG CCGGATGGCG GCCACATCAG CCGCAATCCG CAGATCCTGA TCGACCTGCT GCTCGATCTT CTGCCGCTGC GTCAGGCCTA TGCCGCGCGC GGCGCGCAGC CGCCGCCGCA GCTTCTCAAC GCGATCGACC GCATGATGCC CATGCTGCGG CTGTTCCGCC ATGGCGACGG CGCGCTGGCG CTGTTCAACG GAATGGGCGT GACGCCGCCG GAGCAGCTTG CGACGGTGCT CGCCTATGAC GACAGCCGGG CGCGGGCGCT GACCAATGCG CCCCATTCCG GCTATCAGCG GCTGGAGGGG CAGGACGCGG TCGTCGTCGT CGACGCCGGC CGGCCGCCGC CGCCGGTGTT TTCGACGCGC GCCCACGCCG GCTGCGCCTC GTTTGAATTT TCGGTCGGCG CGCAGCGTCT CGTGTTGAAT TGCGGGGCGC CGGAGGCAAA CCGAGCCGCC GCGCGCGAGG CGGCGCGCAT GACCGCCGCC CATTCGACGC TTGTCGTCGA CGATCTGTCC TCATCGCGCT TCGCCTTTCA TCTCGGCTTG CGCAAATGGC TCGGGGACGA AATCGTGTCG GGGCCGGAAC AGGTCGAGAT CGAACGCCGC GACGAGGCGG CGGGCTCGAC TCTCGTGGTC CAGCACGACG GTTATGCCTC CCGCTTCGGC CTCATTTGCC AGCGCCGCCT CGTGTTGCAC AAGGACGGCA AATGGCTCGA CGGCGCTGAC CGCATGGTCG CCGCGACGCC CGGCGACGCC ATCGAGCGCC GCCCCTTCGC GGTGCGCTTC CACATCCATC CCAATGTGCG GCTGAAGCGG GTGCGCGAGG GCCATGCGGT GTTGTGCCTG CTTCCGAACG GGCGGCGCTG GCTGTTCGAG ACGCCCTGGA TCGCCGAGAT CGAGGAGAGC ATTTTCTTCG CCGCCCCGGA TGGACCAAGA GCCTGCTCTC AGATCGTGCT CGAGGGCGAG ACGCGCGACG GCCTGGAGCT GACTTGGAGC TTTCGGCAGG CGGAGAAGAA GAAGCGGTAG
|
Protein sequence | MAGAILKDRL HVAGLVLARG AAAAWRFAAS PWRALAGLRR RPPERLLIAP QDIRTSDPTL AADIYAGYFA FGGKIVNAHG RSPFELEPQS DAWARSLASF SWLRHLRAAD TAISKANARA LVEDFLVDFS KPGANAAWEP RVAARRLLAF LSQSPIILHG ADRAFYRRFM KAIGRTQQFL ERKMAEGLAG EDRLLVAIAL VELSLCAEDA GKLRQRAGRM LAEELQRQIL PDGGHISRNP QILIDLLLDL LPLRQAYAAR GAQPPPQLLN AIDRMMPMLR LFRHGDGALA LFNGMGVTPP EQLATVLAYD DSRARALTNA PHSGYQRLEG QDAVVVVDAG RPPPPVFSTR AHAGCASFEF SVGAQRLVLN CGAPEANRAA AREAARMTAA HSTLVVDDLS SSRFAFHLGL RKWLGDEIVS GPEQVEIERR DEAAGSTLVV QHDGYASRFG LICQRRLVLH KDGKWLDGAD RMVAATPGDA IERRPFAVRF HIHPNVRLKR VREGHAVLCL LPNGRRWLFE TPWIAEIEES IFFAAPDGPR ACSQIVLEGE TRDGLELTWS FRQAEKKKR
|
| |