Gene Ppha_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0354 
Symbol 
ID6461574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp342033 
End bp343055 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content35% 
IMG OID642726644 
ProductHNH nuclease 
Protein accessionYP_002017302 
Protein GI194335508 
COG category[V] Defense mechanisms 
COG ID[COG3440] Predicted restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.783884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATG TTCTGCTTGT AAATATTACA TGGAACCCGA AAAACTGGAC AAACAGATAT 
ATTAATCCAC GAGCAGGGCA TGAATACGCC AGACAAAATG CTGGACACGA ATCACTCAAC
TTCAAATTTG ATAAACGAGA TATTGATACA GAAAAATTTA TACACGGTTT CGTTCAATGG
ACAAATCCCC CAAAAAAGTT TCAGAACGGA GGATTAATTA TTTTTTACAC AAAAAATATT
GACTATAAAA AAGGACAAGT TGTGGGAGTC TACGGCAAAT CCGAAATTAT TGATCCGCAA
CAGCACTTTG TTGTTAAAGG TTTTCAAAAC AACCAATATA GCATTAACAT CAGAGCCGAA
AAGCGGTTTT CGATTTTATT CCCTGTTCCC CTTTCTGCAG ACAATTACAA AGCCAAATCA
AGTGACAGAA TGGTTCCACA AGTTGGATTC ACTTATAAAG ACAACATATT TGCGAAAAAA
ATTCTCTATG ATGAACTTGT AGAATTATCT AACACAGGTA TTTTGCAATC CGATTACAAA
AAGCTCTCTG ATATTTACGA ATACTATGTC GAAGAAAAAT TCGAACTTCC GTATTGCTCG
CAAGACGAAA AAGAGCAGGA AGAACTTGCC CAATTCTATA AACAAGCAAA GATAAAAGAA
GAAATTCTTG ATGAGCTAAA TAATCTACAA CCGTCGGATA GTGAAGAAAT TATTGTGAAC
AAAAAAACAT ATAAACGAGA CAATAAGACA ATTGCTCAAA TAAAAATTCT TCGGGACTTC
AAATGCCAAA TTTGCGGAGT AACAATTACA AAAAAGGATC GCAGTAAATA TATCGAAGCG
GCGCACATCA AAGCGAAGCA TCAAAAAGGC AGAGAGACTA TAGATAATAT AATTTTACTT
TGCCCCAACC ACCACAAAGA ATTTGACTTA GGCGATAGAA TAATAAAATC ACACGATAGC
AATTTTATTG ATTTTGTACT CAACGGGCAA CAGTACAAAA TCAGTTTGAC AACAGGACAA
TAA
 
Protein sequence
MNDVLLVNIT WNPKNWTNRY INPRAGHEYA RQNAGHESLN FKFDKRDIDT EKFIHGFVQW 
TNPPKKFQNG GLIIFYTKNI DYKKGQVVGV YGKSEIIDPQ QHFVVKGFQN NQYSINIRAE
KRFSILFPVP LSADNYKAKS SDRMVPQVGF TYKDNIFAKK ILYDELVELS NTGILQSDYK
KLSDIYEYYV EEKFELPYCS QDEKEQEELA QFYKQAKIKE EILDELNNLQ PSDSEEIIVN
KKTYKRDNKT IAQIKILRDF KCQICGVTIT KKDRSKYIEA AHIKAKHQKG RETIDNIILL
CPNHHKEFDL GDRIIKSHDS NFIDFVLNGQ QYKISLTTGQ