Gene Ppha_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0743 
Symbol 
ID6461389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp776460 
End bp779459 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content55% 
IMG OID642726999 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002017654 
Protein GI194335860 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.575824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTG AAAACCAGAC CGAGCTGAGC CTGATCGATA AACTCCAGGA TCTCAAATAC 
AGCTACCGCC CCGACATTCG CGACCGGGAC GCACTCGAAA AAAACTTCCG CGAAAAGTTC
GAAGCCCTCA ACCAGATTCA TCTCACCGAC GCCGAGTTTG CCCGGCTGCT CGATCAAATC
GTCACCCCGG ACGTTTTCGC CGCCTCTCGT CATCTGCGCG AACGCAACAG CTTCGAGCGC
GACGACGGCA CACCACTCTT CTACACTCTG GTCAACATCC GGGAGTGGTG CAAAAACAGC
TTCGAGGTCG TCAACCAGCT CCGCATCAAC ACCAACAACA GCCATCACCG CTACGATGTG
TTGCTCCTCA TCAACGGTGT GCCGGTGGTT CAGATCGAGC TGAAGACCCT CGCCATCAGC
CCGCGCCGCG CCATGCAGCA GATTGTCGAG TACAAAAACG ACCCCGGCAA CGGCTACAGC
AAAACCCTGC TCTGCTTTTT GCAACTCTTC ATCGTCAGCA ACCGCACCGA CACCTGGTAC
TTCGCCAACA ATAACAGTCG CCACTTCAGC TTTAACGCCG ACGAGCGTTT TCTGCCGTTC
TACCAGTTCG CCGGAGAAGA CAACAAAAAA ATCACCCATC TCGACAGCTT CGCCGAAAAG
TTCCTCGCCA AATGCACCCT CGGCGAAATG ATCAGCCGCT ACATGGTGCT GGTGACGAGC
GAGCAAAAGC TGATGATGAT GCGCCCCTAC CAGATCTATG CCGTCAAGGC TATCGTGGAG
TGCATTCACC AGAACTGCGG TAACGGCTAC ATCTGGCACA CCACCGGCAG CGGCAAAACC
CTCACCTCCT TCAAGGCATC AACCCTGCTC AAGGATAACC CGGATATCGA CAAATGCCTT
TTCGTCGTTG ACCGCAAAGA CCTCGACCGG CAGACGCGGG AGGAGTTCAA CCGCTTTCAG
GAGAAGTGCG TCGAAGAGAA CACCAACACC GAAACCCTGG TGCAGCGGTT GCTCTCCGAT
GACTATGCCA ATAAAGTGAT CGTCACCACC ATCCAGAAGC TCGGCCTTGC CCTCGACGGC
AGCAACAAAC GCAACTACAA GGAGCGGCTC GAACTGCTCC GCAAAAAGCG CATGGTTTTC
ATCTTTGACG AATGCCACCG CTCCCAGTTC GGCGAAAACC ACAAGGCGAT CAAAGAGTTT
TTCCCCAACG CCCAGCTCTT CGGCTTCACC GGCACACCAA TTTTCCCCGA AAACGCCAGC
TACCAGCAGA TTGAAGGGGA GCAGGCCACC TGGAAAACCA CGGAAGAGAT CTTCCAGCAG
CAACTGCATG CCTACACCAT TACCCACGCC ATCGAAGACC GCAACGTCCT GCGCTTCCAC
ATCGACTATT TCAAGCCCGA AGGTAAGAAC ACGCCCAAGC CCGGCGAAAC GCTGGCAAAA
AAAGCCATCG TTGAAGCCAT CCTTCAAAAA CACGATACCG CCACCTACGG GCGCAAGTTT
AACGCCCTTC TGGCCACCGC ATCTATCAAC GACGCCATAG CTTATCACGA GCTGTTCAAG
AGCATTCAGG AGGAAAAGCA GGCCAAAGAT GAAAATTTCC AGCCACTCAA CATCGCCTGC
GTCTTTTCCC CGCCTGCCCA GGCCATTGCT GGTGATGCTG GGAATCGAAA CGAAAAGAAT
GTGGCAGACA TCAAGCAGCT TCAGGAAGAT CTGCCACAGG AAAAGGCCGA TAACGAGGAA
GACCCGGACA AAAAAAAGGC GGCCCTCAAA GCCATCATTG CCGACTACAA CGACCGTTAC
AAAACCAACC ACCGCATCGA CGAATTCGAC CTCTACTACC AAGATGTGCA AAAGCGTATC
AAAGATCAGC AATACCCCAA CCAGGACTTG CCGCACGCGC AGAAGATCGA CCTGATAATC
GTGGTCGACA TGCTGCTCAC CGGCTTCGAC TCCAAATTCC TGAACACCCT CTACGTCGAC
AAAAACCTCA AGTACCACGG GCTCATTCAG GCCTTCTCAC GCACCAACCG CGTACTGAAC
GGCACCAAGC CCTACGGCAC CATTCTCGAC TTCCGCCAGC AGCAAAGCGC CGTCGATGAA
GCCATCAAAC TATTCTCTGG CGAACAAGCC GACCGCGCCA CCGAAATCTG GCTGGTCGAT
TCCGCACCGG TGGTCATCAA CAAACTCGAA ATTGCGGTTA AAAAGCTGGA CGAATTCATG
CGCTCTCAGG GGCTGGAAAG CGCCCCTCAA GAGGTGGCGA ACCTCAAGGG CGACGCCGCA
CGAGGCCAGT TCATCAACCT CTTCAAGGAG GTGCAGCGCC TCAAGACCCA GCTCGACCAA
TACACCGACC TGACGCCGGA AAATGCCGCC AGCATCAGCC GGGTCATCCC ACAAGAGCAA
TTGCAGGGCT TCCGTGGCGT CTATCTCGAA ACCGCCCAGC GCATGAAAGA AAAACAGAAA
AAAGGCAGCG ACGGCCCGGA AACAGAGCAG CTCGACTTTG AATTCGTGCT CTTTGCCTCA
GCCATGATTG ATTACGATTA CATCATGACC CTGATTACAA GCTACTCGCA GCAACTGCCC
GGCAAACAAA AGATGACCCG CACCGAGCTT ATCGGTCTCA TCGACTCCGA GGCCAACCTC
CTTGAAGTTC GCGAAGACAT TGCCGACTAT ATTGGCACCC TCAAGTCAGG CGAAGGGTTG
AAAGAGAGCG ACATCCGTCA GGGTTACGAA ACCTTCAAGG CCGAAAAGAG CGCCCAACAA
CTGGCCGAAA TTGCCGAAAA GCACGGACTG GAAACCTCCG TGCTCCAGGC CTTTGTCGAT
GGCATCATGC AGCGCATGAT CTTCGACGGC GAACACCTGA CCGATCTGCT CGCCCCGCTC
GGCCTCAACT GGAAACAGAG AAGGCAAAAC GAGCTGGCCC TGATGGAAGA GCTGATTCCC
GTGCTGCACA AACTTGCGCA AGGACGCGAA ATTTCAGGGC TGGAGGCGTA TGAGCAATAA
 
Protein sequence
MTTENQTELS LIDKLQDLKY SYRPDIRDRD ALEKNFREKF EALNQIHLTD AEFARLLDQI 
VTPDVFAASR HLRERNSFER DDGTPLFYTL VNIREWCKNS FEVVNQLRIN TNNSHHRYDV
LLLINGVPVV QIELKTLAIS PRRAMQQIVE YKNDPGNGYS KTLLCFLQLF IVSNRTDTWY
FANNNSRHFS FNADERFLPF YQFAGEDNKK ITHLDSFAEK FLAKCTLGEM ISRYMVLVTS
EQKLMMMRPY QIYAVKAIVE CIHQNCGNGY IWHTTGSGKT LTSFKASTLL KDNPDIDKCL
FVVDRKDLDR QTREEFNRFQ EKCVEENTNT ETLVQRLLSD DYANKVIVTT IQKLGLALDG
SNKRNYKERL ELLRKKRMVF IFDECHRSQF GENHKAIKEF FPNAQLFGFT GTPIFPENAS
YQQIEGEQAT WKTTEEIFQQ QLHAYTITHA IEDRNVLRFH IDYFKPEGKN TPKPGETLAK
KAIVEAILQK HDTATYGRKF NALLATASIN DAIAYHELFK SIQEEKQAKD ENFQPLNIAC
VFSPPAQAIA GDAGNRNEKN VADIKQLQED LPQEKADNEE DPDKKKAALK AIIADYNDRY
KTNHRIDEFD LYYQDVQKRI KDQQYPNQDL PHAQKIDLII VVDMLLTGFD SKFLNTLYVD
KNLKYHGLIQ AFSRTNRVLN GTKPYGTILD FRQQQSAVDE AIKLFSGEQA DRATEIWLVD
SAPVVINKLE IAVKKLDEFM RSQGLESAPQ EVANLKGDAA RGQFINLFKE VQRLKTQLDQ
YTDLTPENAA SISRVIPQEQ LQGFRGVYLE TAQRMKEKQK KGSDGPETEQ LDFEFVLFAS
AMIDYDYIMT LITSYSQQLP GKQKMTRTEL IGLIDSEANL LEVREDIADY IGTLKSGEGL
KESDIRQGYE TFKAEKSAQQ LAEIAEKHGL ETSVLQAFVD GIMQRMIFDG EHLTDLLAPL
GLNWKQRRQN ELALMEELIP VLHKLAQGRE ISGLEAYEQ