Gene Phep_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0933 
Symbol 
ID8252027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1085865 
End bp1086980 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content35% 
IMG OID644934588 
Productrestriction endonuclease 
Protein accessionYP_003091217 
Protein GI255530845 
COG category[V] Defense mechanisms 
COG ID[COG3183] Predicted restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.989614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000169325 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATACGTT CTCAAAATTT TTGGCTGGTA GCTTTGTTTC TTTCAAAATT CGGAGATCTT 
AATAAAGCAA ACAAATCTGT TCCTCCTCAG GAAGTGGGCG GAACTTTATG GAAGGATGCC
TATCAATATT TTTTTAATGA TTTGGGAGAG GGTAGAACGA CCTCTTCATT TGAACACAGT
TTGAAGAATG CCAGAGATGC ATTTGACAGT CACCTGAAAA AATCAACACG GATAGGATGG
AAAGATTTAA GGGGCAGGGC GGCTATTTTA CCCAAAGAAG CATTATATGT TTTTAAAAAA
TATAAAAATG TAGAGAGAAA TGATTTGTGG AAGGAAATTC AATTATCAGT TCTGAAAACT
AAAAATAATA ATTCACTAAA AACTGAGCAA ATAGCTAGTC CGAGTAGTAA AAATCCTAAT
TGGGTCAGAC AAGAATTGAT TCTTGCGCTT GATTTGTACT TCGATCTTGA TCAGGGACAA
ATGCATAGAT CAAATGAAAA AGTTATTGCG CTGAGCGATT TGCTTAGAAA ATTGTCCGTA
CATAAGCATA TTCCAGATAT AAAGAAATTC AGAAATCCGA GTGGAGTTGC CAGAAGATTA
GGCAATTTTA AAGCAATGGA CTCAGGTTAT ACGGGTGATG GTTTGTCAAA TTCAGGTAAG
CTGGCGAAAA TAATATTTGA TGAATTCCGT ATGCATCGTG GGAGGTTGAA AGAGGAGGCT
GAATTAATTA AACAAATTGC AAATAAGGCG GTAGAGGGGA AGTTAGCCGA ACCAGCTGTA
TCATACACTT CATCCAAGGA ACAAGAATTT AAATACAATT ACCATAAAAA TCTGGAGTTG
AATCCACTAA CTTTCAGAGT AAAAAAGCAA AGCATTAACA ACAGCGAACT AATCACCTGT
TTTTTATGTA AAATGAATTC ACAGGATGTA TATGGTACCT TGGGAAGTGA CTTGATGGAA
TTACACTATG TCGGCAACAT TGATGAAACA TCGTTAACAA GTGGCTTCAA TCCTGAGGAT
TTTATATTAG TCTGCCCTAA CTGCCATAAG CTGCTTGATA CCTATTACGC AATTATAACA
TATGATGACT TAAAGAATAT TCTATCAAGT AAATAA
 
Protein sequence
MIRSQNFWLV ALFLSKFGDL NKANKSVPPQ EVGGTLWKDA YQYFFNDLGE GRTTSSFEHS 
LKNARDAFDS HLKKSTRIGW KDLRGRAAIL PKEALYVFKK YKNVERNDLW KEIQLSVLKT
KNNNSLKTEQ IASPSSKNPN WVRQELILAL DLYFDLDQGQ MHRSNEKVIA LSDLLRKLSV
HKHIPDIKKF RNPSGVARRL GNFKAMDSGY TGDGLSNSGK LAKIIFDEFR MHRGRLKEEA
ELIKQIANKA VEGKLAEPAV SYTSSKEQEF KYNYHKNLEL NPLTFRVKKQ SINNSELITC
FLCKMNSQDV YGTLGSDLME LHYVGNIDET SLTSGFNPED FILVCPNCHK LLDTYYAIIT
YDDLKNILSS K