Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0933 |
Symbol | |
ID | 8252027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1085865 |
End bp | 1086980 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644934588 |
Product | restriction endonuclease |
Protein accession | YP_003091217 |
Protein GI | 255530845 |
COG category | [V] Defense mechanisms |
COG ID | [COG3183] Predicted restriction endonuclease |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.989614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000000169325 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATACGTT CTCAAAATTT TTGGCTGGTA GCTTTGTTTC TTTCAAAATT CGGAGATCTT AATAAAGCAA ACAAATCTGT TCCTCCTCAG GAAGTGGGCG GAACTTTATG GAAGGATGCC TATCAATATT TTTTTAATGA TTTGGGAGAG GGTAGAACGA CCTCTTCATT TGAACACAGT TTGAAGAATG CCAGAGATGC ATTTGACAGT CACCTGAAAA AATCAACACG GATAGGATGG AAAGATTTAA GGGGCAGGGC GGCTATTTTA CCCAAAGAAG CATTATATGT TTTTAAAAAA TATAAAAATG TAGAGAGAAA TGATTTGTGG AAGGAAATTC AATTATCAGT TCTGAAAACT AAAAATAATA ATTCACTAAA AACTGAGCAA ATAGCTAGTC CGAGTAGTAA AAATCCTAAT TGGGTCAGAC AAGAATTGAT TCTTGCGCTT GATTTGTACT TCGATCTTGA TCAGGGACAA ATGCATAGAT CAAATGAAAA AGTTATTGCG CTGAGCGATT TGCTTAGAAA ATTGTCCGTA CATAAGCATA TTCCAGATAT AAAGAAATTC AGAAATCCGA GTGGAGTTGC CAGAAGATTA GGCAATTTTA AAGCAATGGA CTCAGGTTAT ACGGGTGATG GTTTGTCAAA TTCAGGTAAG CTGGCGAAAA TAATATTTGA TGAATTCCGT ATGCATCGTG GGAGGTTGAA AGAGGAGGCT GAATTAATTA AACAAATTGC AAATAAGGCG GTAGAGGGGA AGTTAGCCGA ACCAGCTGTA TCATACACTT CATCCAAGGA ACAAGAATTT AAATACAATT ACCATAAAAA TCTGGAGTTG AATCCACTAA CTTTCAGAGT AAAAAAGCAA AGCATTAACA ACAGCGAACT AATCACCTGT TTTTTATGTA AAATGAATTC ACAGGATGTA TATGGTACCT TGGGAAGTGA CTTGATGGAA TTACACTATG TCGGCAACAT TGATGAAACA TCGTTAACAA GTGGCTTCAA TCCTGAGGAT TTTATATTAG TCTGCCCTAA CTGCCATAAG CTGCTTGATA CCTATTACGC AATTATAACA TATGATGACT TAAAGAATAT TCTATCAAGT AAATAA
|
Protein sequence | MIRSQNFWLV ALFLSKFGDL NKANKSVPPQ EVGGTLWKDA YQYFFNDLGE GRTTSSFEHS LKNARDAFDS HLKKSTRIGW KDLRGRAAIL PKEALYVFKK YKNVERNDLW KEIQLSVLKT KNNNSLKTEQ IASPSSKNPN WVRQELILAL DLYFDLDQGQ MHRSNEKVIA LSDLLRKLSV HKHIPDIKKF RNPSGVARRL GNFKAMDSGY TGDGLSNSGK LAKIIFDEFR MHRGRLKEEA ELIKQIANKA VEGKLAEPAV SYTSSKEQEF KYNYHKNLEL NPLTFRVKKQ SINNSELITC FLCKMNSQDV YGTLGSDLME LHYVGNIDET SLTSGFNPED FILVCPNCHK LLDTYYAIIT YDDLKNILSS K
|
| |