Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_1130 |
Symbol | |
ID | 6462667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 1171304 |
End bp | 1174279 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642727376 |
Product | type III restriction system endonuclease |
Protein accession | YP_002018026 |
Protein GI | 194336232 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACTGC ACTTTGAACC GAACCTTGAC TATCAACTAC AGGCGGTTGA AGCGGTCTGC GACCTCTTTC GCGGGCAGGA GATCTGCCGC ACGGAGTTTA CCGTTACCAT GCCGACCACG AAAGACATGC AGTTTCCTGG CATGGAGAGT GATCTTGGCA TTGGCAACCG GCTTACGTTG CTTGATGATG ACCTGCTCAA AAGCCTGCAT GATATTCAGC TCCGCAACGG CCTGCCGCCT TCCGGAACCC TTGCTTCCGG CGACTTCACG GTAGAGATGG AGACCGGAAC GGGCAAAACC TACGTCTATC TGCGTACCAT CTTTGAACTC AACAAGCGGT ACGGCTTTAC CAAGTTTGTG ATTGTCGTCC CGTCAGTGGC TATCAAGGAG GGGGTTTACA AAACCCTTCA GATTACCGAA GATCACTTCA AGGGGCTCTA TGCCGGAGTA CCTTTTGAAT ATTTCCTCTA TGACTCCTCA AAACTGGGGC AGGTGCGTAA TTTTGCCACC AGCTCGCAAA TCCAGATTAT GGTGATGACG GTTGGCGCCA TCAACAAAAA GGATGTCAAT AATATTTACG GTAAGGGTGC CAATGAGTCT ACTGGCGGGG AAAAGCCGAT TGATCTTGTC AAGGCGGTGC GGCCCATCCT GATTATTGAT GAACCGCAAA GTGTGGATGG TGGTCTTTCC GGTGCTGGCA AAACCGCGCT CGATGCCATG AGCCCGCTTT GTACCTTGCG TTATTCCGCT ACGCATGTGG ACAAGCACCA GATGGTTTAT CGTCTTGATG CGGTTGATGC GCACGATAAG GGGCTGGTGA AGCAGATTGA GATTGCTGCC GCTACGGTGG AAGATGCCTT CAACAAGCCT TATGTTCGTC TCTTGAAGGT TGAGAACAAG CGGGGCAGGA TTTCCGCACT CGTTGAACTC GACAAGATGA CGGCCAGCGG TGTCAGGCGA AAAGAGAGTG TTGTGGTTGA TGGTGATAAT CTTGAAATGG TTACCGGACG GGCGATCTAT GCTGATTGCT CTATCGGTGA AATTCGTGTT GAAAAGGGCA AGGAGTTTAT GGAGTTGCGC TTCCCCGGTG GTGAGCAGTT CCTGAAACCT GGCGAGGCGT ATGGAGATGT GGATGCTCTT GCCGTTCAGC GCCAGATGAT TCGCCGTACC ATCAAGGAGC ACCTCGACAA GGAGCTGCGT CTGCGCCCGC AGGGTATCAA AGTGCTTTCG CTCTTCTTTA TTGATGAGGT TGCCAAATAC CGCTCTTACG ACAGTGACGG TAACTCAGTG AAGGGCGATT ATGCCCGCAT TTTCGAGGAG GAGTACCGCC GGGCGGCAAA GCTGCCTGAT TATTTTACAC TGTTCAGTGA GGTTGACCTG ACGCATGCCC CCGAAGAGGT GCATAACGGC TATTTCTCGA TCGACAAAAA AGGTGGCTGG ACCAATACCG AGGAGAACAA TCAGGGCAAT CGTGAAAGTG CTGAACGGGC CTACAACCTG ATCATGAAGG AGAAAGAGAA GCTGCTCTCA CTGGAGACAC CACTGAAATT CATCTTTTCT CACTCTGCAC TCAAGGAGGG GTGGGACAAC CCCAATGTCT TCCAGATTTG TGCACTGCGT GAAATGGGCT CAGAGCGTGA ACGCCGCCAG ACGATAGGGC GTGGTCTGCG GTTATGCGTC AATCAGCACG GTCTGCGACT GCGTGGATTT GAAGTCAATA CGCTGACGGT TATTGCAACG GAGAGTTATG AGCAGTTTGC CGAAAATCTT CAGAGAGAGA TTGAGGCTGA AACCGGTATT CGATTCGGGA TTGTCGAAGA ACACCAGTTT GCGGCGATTT CCGTTACTAC TGACGAAGGA GTCTCAGTTC CTCTTGGATT CGAGCAGTCG AAGCTCTTGT GGGAGCATCT GAAAGAGCAG AACTATATCG ACAGTAAGGG CAAGGTGCAG GATTCGCTGA AAAGTGCGCT CAAGGAGGGG ACTCTTCTTG TGCCGGAGCA ATTCAGGCCA CAGCTTGATC AGATTACCGC TGTACTGAAA AAGCTTGCAG GACGTCTGGA AATCAAGAAT GCGAATGAGC GGAGACAGGT ACACACCCGG CAGGCAGTGC TGCACAGTCC GGAATTCAAA GAGCTTTGGG AGCGTATCAA GCATAAAACG ACCTATCGGG TGGAGTTCGA CAATGAAAAG CTGGTGGAGA GCTGCATCAA GGCATTACAG AATGCTCCCG CCATTTTGAA GACACGCATG CAGTGGCACA AGGCTGAAAT CGTTATTGGT AAATCGGGCG TTGAAGCTAC TGAACGGAGT GGTGCCGCAA CGGTTGTGCT TGATGAAAAC GATATTGAGT TGCCTGACCT TCTTACGGAT CTGCAGGATC GTACCCGGCT GACCCGCCGC ACGATTCATC GTATTTTGAT CGGAAGTGAG CGACTCGACG ATTTTAAGCG TAACCCGCAA CAGTTCATTG AGCTGGCAGC CGAAGCCATC AAGCGGTGCA AGCAACTCGC TGTTGTGGAT GGCATCAAGT ATCACCGCCT TGGTGACGAA CATTTTTACG CCCAGGAGCT TTTTGAGCAA CAGGAACTTA CTGGTTACCT CAAGAACCTT ATCCCGGTTC AGAAATCAGT CTATGAGCAG GTGGTCTATG ACTCCGATCC CGAAGCAACT TTTGCTGATC AGTTGGAGAA GAATCTGGCC ATCAAGGTCT ATGCAAAGCT TCCGGGATGG TTCAGGATTC CAACACCACT CGATACCTAC AATCCCGACT GGGCGGTACT CGTCGAAAAG GATGGCGAAG AGCGAATCTA CTTTGTTGTC GAGACCAAGA GCAGCCTGTT TACCGATGAC CTGCGTGACA AGGAGAGTGC AAAGATTGAG TGCGGCAAAG CTCACTTCAA GGCGCTCAGT ATTTGCGAAA CCCCCGCAAG GTATGTTGTG GCCCGCTCTT TGGATGACGT GCTGAACAGG GTTTAA
|
Protein sequence | MKLHFEPNLD YQLQAVEAVC DLFRGQEICR TEFTVTMPTT KDMQFPGMES DLGIGNRLTL LDDDLLKSLH DIQLRNGLPP SGTLASGDFT VEMETGTGKT YVYLRTIFEL NKRYGFTKFV IVVPSVAIKE GVYKTLQITE DHFKGLYAGV PFEYFLYDSS KLGQVRNFAT SSQIQIMVMT VGAINKKDVN NIYGKGANES TGGEKPIDLV KAVRPILIID EPQSVDGGLS GAGKTALDAM SPLCTLRYSA THVDKHQMVY RLDAVDAHDK GLVKQIEIAA ATVEDAFNKP YVRLLKVENK RGRISALVEL DKMTASGVRR KESVVVDGDN LEMVTGRAIY ADCSIGEIRV EKGKEFMELR FPGGEQFLKP GEAYGDVDAL AVQRQMIRRT IKEHLDKELR LRPQGIKVLS LFFIDEVAKY RSYDSDGNSV KGDYARIFEE EYRRAAKLPD YFTLFSEVDL THAPEEVHNG YFSIDKKGGW TNTEENNQGN RESAERAYNL IMKEKEKLLS LETPLKFIFS HSALKEGWDN PNVFQICALR EMGSERERRQ TIGRGLRLCV NQHGLRLRGF EVNTLTVIAT ESYEQFAENL QREIEAETGI RFGIVEEHQF AAISVTTDEG VSVPLGFEQS KLLWEHLKEQ NYIDSKGKVQ DSLKSALKEG TLLVPEQFRP QLDQITAVLK KLAGRLEIKN ANERRQVHTR QAVLHSPEFK ELWERIKHKT TYRVEFDNEK LVESCIKALQ NAPAILKTRM QWHKAEIVIG KSGVEATERS GAATVVLDEN DIELPDLLTD LQDRTRLTRR TIHRILIGSE RLDDFKRNPQ QFIELAAEAI KRCKQLAVVD GIKYHRLGDE HFYAQELFEQ QELTGYLKNL IPVQKSVYEQ VVYDSDPEAT FADQLEKNLA IKVYAKLPGW FRIPTPLDTY NPDWAVLVEK DGEERIYFVV ETKSSLFTDD LRDKESAKIE CGKAHFKALS ICETPARYVV ARSLDDVLNR V
|
| |