Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_0743 |
Symbol | |
ID | 6461389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 776460 |
End bp | 779459 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642726999 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002017654 |
Protein GI | 194335860 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.575824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACTG AAAACCAGAC CGAGCTGAGC CTGATCGATA AACTCCAGGA TCTCAAATAC AGCTACCGCC CCGACATTCG CGACCGGGAC GCACTCGAAA AAAACTTCCG CGAAAAGTTC GAAGCCCTCA ACCAGATTCA TCTCACCGAC GCCGAGTTTG CCCGGCTGCT CGATCAAATC GTCACCCCGG ACGTTTTCGC CGCCTCTCGT CATCTGCGCG AACGCAACAG CTTCGAGCGC GACGACGGCA CACCACTCTT CTACACTCTG GTCAACATCC GGGAGTGGTG CAAAAACAGC TTCGAGGTCG TCAACCAGCT CCGCATCAAC ACCAACAACA GCCATCACCG CTACGATGTG TTGCTCCTCA TCAACGGTGT GCCGGTGGTT CAGATCGAGC TGAAGACCCT CGCCATCAGC CCGCGCCGCG CCATGCAGCA GATTGTCGAG TACAAAAACG ACCCCGGCAA CGGCTACAGC AAAACCCTGC TCTGCTTTTT GCAACTCTTC ATCGTCAGCA ACCGCACCGA CACCTGGTAC TTCGCCAACA ATAACAGTCG CCACTTCAGC TTTAACGCCG ACGAGCGTTT TCTGCCGTTC TACCAGTTCG CCGGAGAAGA CAACAAAAAA ATCACCCATC TCGACAGCTT CGCCGAAAAG TTCCTCGCCA AATGCACCCT CGGCGAAATG ATCAGCCGCT ACATGGTGCT GGTGACGAGC GAGCAAAAGC TGATGATGAT GCGCCCCTAC CAGATCTATG CCGTCAAGGC TATCGTGGAG TGCATTCACC AGAACTGCGG TAACGGCTAC ATCTGGCACA CCACCGGCAG CGGCAAAACC CTCACCTCCT TCAAGGCATC AACCCTGCTC AAGGATAACC CGGATATCGA CAAATGCCTT TTCGTCGTTG ACCGCAAAGA CCTCGACCGG CAGACGCGGG AGGAGTTCAA CCGCTTTCAG GAGAAGTGCG TCGAAGAGAA CACCAACACC GAAACCCTGG TGCAGCGGTT GCTCTCCGAT GACTATGCCA ATAAAGTGAT CGTCACCACC ATCCAGAAGC TCGGCCTTGC CCTCGACGGC AGCAACAAAC GCAACTACAA GGAGCGGCTC GAACTGCTCC GCAAAAAGCG CATGGTTTTC ATCTTTGACG AATGCCACCG CTCCCAGTTC GGCGAAAACC ACAAGGCGAT CAAAGAGTTT TTCCCCAACG CCCAGCTCTT CGGCTTCACC GGCACACCAA TTTTCCCCGA AAACGCCAGC TACCAGCAGA TTGAAGGGGA GCAGGCCACC TGGAAAACCA CGGAAGAGAT CTTCCAGCAG CAACTGCATG CCTACACCAT TACCCACGCC ATCGAAGACC GCAACGTCCT GCGCTTCCAC ATCGACTATT TCAAGCCCGA AGGTAAGAAC ACGCCCAAGC CCGGCGAAAC GCTGGCAAAA AAAGCCATCG TTGAAGCCAT CCTTCAAAAA CACGATACCG CCACCTACGG GCGCAAGTTT AACGCCCTTC TGGCCACCGC ATCTATCAAC GACGCCATAG CTTATCACGA GCTGTTCAAG AGCATTCAGG AGGAAAAGCA GGCCAAAGAT GAAAATTTCC AGCCACTCAA CATCGCCTGC GTCTTTTCCC CGCCTGCCCA GGCCATTGCT GGTGATGCTG GGAATCGAAA CGAAAAGAAT GTGGCAGACA TCAAGCAGCT TCAGGAAGAT CTGCCACAGG AAAAGGCCGA TAACGAGGAA GACCCGGACA AAAAAAAGGC GGCCCTCAAA GCCATCATTG CCGACTACAA CGACCGTTAC AAAACCAACC ACCGCATCGA CGAATTCGAC CTCTACTACC AAGATGTGCA AAAGCGTATC AAAGATCAGC AATACCCCAA CCAGGACTTG CCGCACGCGC AGAAGATCGA CCTGATAATC GTGGTCGACA TGCTGCTCAC CGGCTTCGAC TCCAAATTCC TGAACACCCT CTACGTCGAC AAAAACCTCA AGTACCACGG GCTCATTCAG GCCTTCTCAC GCACCAACCG CGTACTGAAC GGCACCAAGC CCTACGGCAC CATTCTCGAC TTCCGCCAGC AGCAAAGCGC CGTCGATGAA GCCATCAAAC TATTCTCTGG CGAACAAGCC GACCGCGCCA CCGAAATCTG GCTGGTCGAT TCCGCACCGG TGGTCATCAA CAAACTCGAA ATTGCGGTTA AAAAGCTGGA CGAATTCATG CGCTCTCAGG GGCTGGAAAG CGCCCCTCAA GAGGTGGCGA ACCTCAAGGG CGACGCCGCA CGAGGCCAGT TCATCAACCT CTTCAAGGAG GTGCAGCGCC TCAAGACCCA GCTCGACCAA TACACCGACC TGACGCCGGA AAATGCCGCC AGCATCAGCC GGGTCATCCC ACAAGAGCAA TTGCAGGGCT TCCGTGGCGT CTATCTCGAA ACCGCCCAGC GCATGAAAGA AAAACAGAAA AAAGGCAGCG ACGGCCCGGA AACAGAGCAG CTCGACTTTG AATTCGTGCT CTTTGCCTCA GCCATGATTG ATTACGATTA CATCATGACC CTGATTACAA GCTACTCGCA GCAACTGCCC GGCAAACAAA AGATGACCCG CACCGAGCTT ATCGGTCTCA TCGACTCCGA GGCCAACCTC CTTGAAGTTC GCGAAGACAT TGCCGACTAT ATTGGCACCC TCAAGTCAGG CGAAGGGTTG AAAGAGAGCG ACATCCGTCA GGGTTACGAA ACCTTCAAGG CCGAAAAGAG CGCCCAACAA CTGGCCGAAA TTGCCGAAAA GCACGGACTG GAAACCTCCG TGCTCCAGGC CTTTGTCGAT GGCATCATGC AGCGCATGAT CTTCGACGGC GAACACCTGA CCGATCTGCT CGCCCCGCTC GGCCTCAACT GGAAACAGAG AAGGCAAAAC GAGCTGGCCC TGATGGAAGA GCTGATTCCC GTGCTGCACA AACTTGCGCA AGGACGCGAA ATTTCAGGGC TGGAGGCGTA TGAGCAATAA
|
Protein sequence | MTTENQTELS LIDKLQDLKY SYRPDIRDRD ALEKNFREKF EALNQIHLTD AEFARLLDQI VTPDVFAASR HLRERNSFER DDGTPLFYTL VNIREWCKNS FEVVNQLRIN TNNSHHRYDV LLLINGVPVV QIELKTLAIS PRRAMQQIVE YKNDPGNGYS KTLLCFLQLF IVSNRTDTWY FANNNSRHFS FNADERFLPF YQFAGEDNKK ITHLDSFAEK FLAKCTLGEM ISRYMVLVTS EQKLMMMRPY QIYAVKAIVE CIHQNCGNGY IWHTTGSGKT LTSFKASTLL KDNPDIDKCL FVVDRKDLDR QTREEFNRFQ EKCVEENTNT ETLVQRLLSD DYANKVIVTT IQKLGLALDG SNKRNYKERL ELLRKKRMVF IFDECHRSQF GENHKAIKEF FPNAQLFGFT GTPIFPENAS YQQIEGEQAT WKTTEEIFQQ QLHAYTITHA IEDRNVLRFH IDYFKPEGKN TPKPGETLAK KAIVEAILQK HDTATYGRKF NALLATASIN DAIAYHELFK SIQEEKQAKD ENFQPLNIAC VFSPPAQAIA GDAGNRNEKN VADIKQLQED LPQEKADNEE DPDKKKAALK AIIADYNDRY KTNHRIDEFD LYYQDVQKRI KDQQYPNQDL PHAQKIDLII VVDMLLTGFD SKFLNTLYVD KNLKYHGLIQ AFSRTNRVLN GTKPYGTILD FRQQQSAVDE AIKLFSGEQA DRATEIWLVD SAPVVINKLE IAVKKLDEFM RSQGLESAPQ EVANLKGDAA RGQFINLFKE VQRLKTQLDQ YTDLTPENAA SISRVIPQEQ LQGFRGVYLE TAQRMKEKQK KGSDGPETEQ LDFEFVLFAS AMIDYDYIMT LITSYSQQLP GKQKMTRTEL IGLIDSEANL LEVREDIADY IGTLKSGEGL KESDIRQGYE TFKAEKSAQQ LAEIAEKHGL ETSVLQAFVD GIMQRMIFDG EHLTDLLAPL GLNWKQRRQN ELALMEELIP VLHKLAQGRE ISGLEAYEQ
|
| |