Gene Ppha_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1456 
Symbol 
ID6463550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1528136 
End bp1530934 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content50% 
IMG OID642727686 
Producttype III restriction protein res subunit 
Protein accessionYP_002018327 
Protein GI194336533 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG AGCAGCAAAC CCGCATCGAA TTGATTGACA GGATGTTACT GCAGGCGAGC 
TGGAACGTGA ACGATCCTCT CCAGGTTGTA GCAGAGTTCG ATATTCTTGT CGGTTTGCCC
GAAGGGGTGC AGGAACCCCG CACTCCTTAT GAAGGCCATC AGTTCAGCGA CTATGTTTTG
CTGGGTAAGG ATGGCAAACC TCTTGCCGTC GTTGAAGCAA AAAAAACAAG TAGGGATGCA
GCCATTGGCC GTGAACAGGC AAAACAGTAT TGCTGCAATA TCTGGAAACA GTTGGGCGTC
GAGCTTCCAT TCTGTTTTTA TACCAATGGC CTTGAGACCT TTTTCTGGGA TATCGACAAC
TACCCTCCAC GAAAGGTAAT CGGCTTTCCA ACCCGTGATG ACCTTGAGCG GTTCAGCTAT
ATCCGCAGAA GCCGCAAGCC TCTCACCGAA GAACTGATCA ATACAGCCAT TGCCGGACGG
GATTATCAGA TTCGCGCCAT TCGATCCGTC CTCGAAGCCA TTGAACAGAA AAAGCGGGAC
TTCCTGCTCG TTATGGCCAC CGGCACAGGG AAAACCCGCA CCGCAATCGC CATGGTTGAT
GCCCTAATGC GTGGCGGACA TGCTGAAAAA ATTCTTTTTC TGGTCGATCG CATTGCCTTG
CGTGAGCAGG CGCTCTCCGC CTTCAAGGAG CATTTGCCCC ACGAACCTCG CTGGCCCAAC
AGCGGTGAAA AGGTTTTTGC AAAAGATCGC CGTATCTACA TTGCGACCTA CCCAACGATG
CTCAACCTCA TCAGGGATGA ATCATCATAC CTTTCACCCT CTTTCTTTGA CTTCATTGTC
GTAGACGAGA GCCATCGCTC CATCTACAAC ACCTGGGGGG AGATCCTCGA TTACTTCAAA
ACAATCACCC TGGGGCTGAC GGCAACCCCC ACAGATATTC TTGACCACAA CACCTTCAAC
CTCTTTCACT GCGAGAATGG CCTTCCAACC TTTGCCTATA CCTATGAAGA GGCAGTGAAC
AATATTCCGC CATATCTGTG CAATTTCCAG GTGATGAAAA TCCAGACGAA ATTCCAGATG
GAGGGCATCA GCAAGCGGAC AATCTCCCTT GAAGATCAGA AAAAACTGAT TCTCGAAGGC
AAGGATATCG AAGAGATCAA TTTTGAAGGG ACGCAGCTTG AAAAGCAGGT GATCAATCGG
GGGACGAACA GCCTGATTGT CAAAGAGTTC ATGGAAGAGT GCATCAAAGA CCAGAACGGC
GTTCTTCCCG GAAAAACCAT ATTCTTCTGC GCCACCATAG CCCATGCACG CAGAATTGAA
GAGATATTCG ACAGGCTCTA CCCGGAATAC AAAGGCGAAC TTGCCAAGGT TCTGGTTTCC
GATGACCCCC GCGTCTACGG CAAGGGAGGT TTGCTCGATC AGTTTACGAA CAGCGATATG
CCCCGCATCG CCATCAGTGT TGACATGCTC GATACCGGTA TAGACATCCG GGAACTCGTC
AATCTCGTCT TTGTCAAGCC GGTCTACTCC TACACAAAAT TCTGGCAGAT GATTGGCCGA
GGCACGCGGC TCCTTGAACC CGCAAAAATC AAGCCATGGT GCACCAAAAA AGAGCTTTTC
CTGATTCTTG ACTGCTGGGA CAACTTCGAA TATTTCAAGC TTCAGCCCAA AGGCAAAGAG
CTGACACAGC AACTCCCGCT TCCGGTGAAA CTGTTCGGGC TGCGGCTCGA CAAAATTGAA
TATGCCCTCT CAATTGGTAA CACGGCCATT GCAGAACGGG AAGTGGTAAA ACTGCGTAAA
CAGATTGCCG GGCTTCCGCA TACCTCAGTG GTGATCAAAG AGGCCGCATC GCTTCTTCAC
CCTCTGGAAG AGGAGAACTT CTGGATATCT CTCACACCCC AAAAGCTGGA AAATCTGAGA
AGCGAGATCA AACCACTCTT CAGAACCGTC TCGGATGCCG ATTTTAAAGC CATGCGTTTT
GAGCGGGACG TTCTGGAGAG TTCACTGGCA AAACTTCGCG ACCAGAAAGA GCGCTACGGC
ACGCTCAACG ACGGTATTGC CGAGCAGATC AGCCAGCTTC CCCTGAGCGT CGGCTTTGTG
AAACAAGAAG AGGAATTGAT ACGGGCCGCT CAAACGAAAC ACTTCTGGAA CCAGGCTACG
GAAGAGAGCT TCGACGAGCT GATTGAAAAG CTCTCCCAGT TGATGAAGTT TCGCGAGCCT
GATAGCGGCG CAATCGGTCA AGTTCACCTG AACTTGCAGG ATCTTCTGCA CCATAAAGAG
ATGGTTGAAT TCGGCCCCCG AAATGAGGCC GTCAGCATTA CCCGCTACCG CGAAATGGTT
GAATCGCTCA TTACCGAACT GACAAAACAG AATCCGATTC TTTCCAAAAT CAAGGAGGGT
AAAGAGATTT CCCCTGAAGA GGCCACTGAA CTTGCGGAGC TGCTCCACGA AGAGCATCCG
CACATTACCG AGGAGCTGTT GCGTGCCGTC TATCAAAACC GCAAGGCCCA TTTCATCCAG
TTTATCCGCC ATATTCTCGG CCTCGAAATT CTCAAGAGCT TCCCTGAAAC GGTTGCCGAT
GCGTTCAATC AGTTTATCAA ACAGCATTCC AACTTTTCCA CTCGCCAACT CGACTTTTTA
AACCTCCTCA AAAACGTTCT TGCGGAACGT GAAAAAGTTG AAAAAAGAGA CCTGATCAAC
GCCCCCTTCA CGGTCATCCA CCCGAAAGGC ATTCGCGGAG TCTTCAGTCC GGCTGAAATC
AATGAAATTC TGGCATTGGC CCGGCAACTT GCAGCATAA
 
Protein sequence
MKNEQQTRIE LIDRMLLQAS WNVNDPLQVV AEFDILVGLP EGVQEPRTPY EGHQFSDYVL 
LGKDGKPLAV VEAKKTSRDA AIGREQAKQY CCNIWKQLGV ELPFCFYTNG LETFFWDIDN
YPPRKVIGFP TRDDLERFSY IRRSRKPLTE ELINTAIAGR DYQIRAIRSV LEAIEQKKRD
FLLVMATGTG KTRTAIAMVD ALMRGGHAEK ILFLVDRIAL REQALSAFKE HLPHEPRWPN
SGEKVFAKDR RIYIATYPTM LNLIRDESSY LSPSFFDFIV VDESHRSIYN TWGEILDYFK
TITLGLTATP TDILDHNTFN LFHCENGLPT FAYTYEEAVN NIPPYLCNFQ VMKIQTKFQM
EGISKRTISL EDQKKLILEG KDIEEINFEG TQLEKQVINR GTNSLIVKEF MEECIKDQNG
VLPGKTIFFC ATIAHARRIE EIFDRLYPEY KGELAKVLVS DDPRVYGKGG LLDQFTNSDM
PRIAISVDML DTGIDIRELV NLVFVKPVYS YTKFWQMIGR GTRLLEPAKI KPWCTKKELF
LILDCWDNFE YFKLQPKGKE LTQQLPLPVK LFGLRLDKIE YALSIGNTAI AEREVVKLRK
QIAGLPHTSV VIKEAASLLH PLEEENFWIS LTPQKLENLR SEIKPLFRTV SDADFKAMRF
ERDVLESSLA KLRDQKERYG TLNDGIAEQI SQLPLSVGFV KQEEELIRAA QTKHFWNQAT
EESFDELIEK LSQLMKFREP DSGAIGQVHL NLQDLLHHKE MVEFGPRNEA VSITRYREMV
ESLITELTKQ NPILSKIKEG KEISPEEATE LAELLHEEHP HITEELLRAV YQNRKAHFIQ
FIRHILGLEI LKSFPETVAD AFNQFIKQHS NFSTRQLDFL NLLKNVLAER EKVEKRDLIN
APFTVIHPKG IRGVFSPAEI NEILALARQL AA