Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_0849 |
Symbol | |
ID | 8131774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | + |
Start bp | 987593 |
End bp | 990589 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644864130 |
Product | Type III site-specific deoxyribonuclease |
Protein accession | YP_003016436 |
Protein GI | 253687246 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCATC ATTCTCAAAA TACGGGCTTT CAATTTGATT CACATCAGCA ATATCAGCTC GATGCCATCA ACGCCGTGGT GGATTTGTTC GATGGTCAGC CAAAGGATGC TGACAAGATC GCGATTGCCT TGCGTGGCAG CGTTGCCAGC CGGGAAGGTG AGCTGGATTT AGGCATCGAG CAAGTAATTG GTGCTATCGG CAATAATCTG GTGCTGGATG AAGACGCTAT TCTTGCTAAT TTACAAACGG TACAGGATCG CAACGGGCTG GAGGTCAGCG AAAAACTGGT TGATGGCAAA CTGGATTTCG ACATCGAAAT GGAAACCGGT ACGGGTAAAA CTTACGTTTA TCTGCGTACC GTTTTCGAAC TGGCAAAGAA ATATGGCTTC ACCAAATTTA TCATTCTGGT TCCCAGCGTC GCCATTCGTG AAGGCGTGAA CACCAGTATT CGCCTGATGC GCGATCACTT CCGTAGCCTC TATCCGGCGC AGCCTTTCGA TGCGAATGTC TACAGAGGCG ACAAGGCCGA GGAAGTACAG GCTTTTGCTA CTGCAACCAA CGTGCAAATA CTGGTGATGA CGATTGATGC GGTGCGTAGT GACAAGAATA AAAACGAGAA TAAGCGCATT ATTCACAAAC CTTATGACAA GCTCAATGGC TTGCGTCCGT TAGATTACCT GAAGGGCACG CATCCCGTGG TGATCATGGA TGAACCACAA AATATGGAGT CGTTACTGTC GCAGTCGGCA GTGGTCGAGT TCGATCCCGT ATTCACGCTG CGCTACTCGG CCACGCATAA GCAGCGGAGA AACCAGGTCT ATCGGCTCGA TCCGGTTGAT GCACACGATC TAGGTCTGGT GAAACAGATC GTCGTGGCTG AAGTTGCACA GCACGGTGCG GATGTTGCGC CTTACGTCAA ACTACTGGAA GTGAACGACA CGAAAGGTGC GAAACTTGAA CTGGCTTGCC GCAAGGCCGA TGGCTCCATC GCGCGTCGTA GCAAAAAGGT CAAGCCGCAT CAGGAATTGT CCGATGTCAC GGGTAATCCG GCTTACAAAG ATTGGCGTAT TAATGCGTTG AGCATCGCCG CTTTCGGTGA ATCAGCCAGT ATTGAACTGA CGCCGAATGG AAACTTGCTG CACGAAGGTG AATCCCTGGG GGGCGCAACG GGCGCGGTTT ACAAGGAAAT GATTCGGGAA ACCGTGCGTG AGCATTTGCG TAAAGAGGCC ATGTTACGCC CGAAAGGGAT CAAGGTGCTG AGTCTGTTTT TCGTGGACAA GGTGGCAAGC TTCCTCGGCG ACGGGGTAAA CAACGAAGAC GCTAATGGCG AGTTCGTGAA ATGGTTTGAT GAAGTGTTCA TGGAAGAACG CGGCAAGTCT GACGTCTACC AAAAATTGCT GCCGCAGGAG CCTTGCGAAT TACGACGAGC CTATTTTTCT CAACTAAAAA CACGTGGCAG AACAACCTTT GTGGATTCAT CTGGTTCGGC TGTCAAAGAT GACGATGCCT ACAAACTCAT TATGCAGGAC AAGCAACGCC TGCTGGACAG CGATGAACCC GTACGCTTCA TCTTTAGCCA TTCCGCATTG CGTGAAGGCT GGGACAACCC CAACGTGTTT CAGATTTGTA CACTGCGTGA AATGGGCGCG GAAACGGAAC GTCGGCAGAC GCTGGGGCGT GGTCTGCGCC TGCCGGTAGC GAAAACAGTA AAAGGCTATG AGCGAGTTAC CGACCACAGC GTTGCACAGC TCACCGTGGT TGCTAACGAA TCTTATGCTA CGTTCGCTCA GAATCTGCAA GCTGAATACA AAGAGGCGGG TGTTGCCATT GGACAGGTGC GCTCCAATGA ATTTGCCAAA CTGATGAAAC GGGATATTCA TGGCGTGATG ACCGATGATA TGCTGGGTTT CAAGACATCT TCTGCCATCT TCAAACATCT GGAAAACGCG GGATTCATCA AGGATGGTAA GACAACTTCC ATGTTTCTGC CGGATGCAGA AGGATTCTCA TTACATCTAC CTGATGAGCT CAGGCCCTAC GAATCAGATA TTATCCGGTG CATCCTTAAT GCGGGTATTG AGAAATATGT GAAGCCTGTA AGTGCGCGGG CTAAGCGTAA ATTCAATAAA GAGCTCTATG CCTCGCCGGA TTTTGAGAGG CTTTGGAGCG CTATCAGCCA GAAAACTACT TATCGGGTGA CAGTTAAACA CCCTATGCTG GTTGAGGCGT GTATAAAGGC GATCAAGGCT GAACCCAAAA TCCAGCCGTT ACGTATTGAT GTCACCCGTG CGGGTGTCAG GGTGCTGCGT GGCGGCACGC AAGGGCAAGA ACTGGGCGCG CGGACTGTCG ACCTCAAAGG CAGCTATGAC CTGCCGGACA TTATCGGTGA GTTGCAGCAA GGTACGTCTC TCACGCGCAA GACGCTGGTG GATATTCTCA TCGGTAGCGG CAGGTTGGAC GAGTTCATTG CCAATCCGAA CGACTTTATG GCGATGGTTC GCCGTTGCGT AGAGGGCGAG CTACAGCGCG TCATTGTAGA AGGCGTTCAG TATGAAAAGA TCGGCGGCTC CGTCTATGAA CTCCGTGAGT TGCAAGCCGA CGGCCTTGCT GAAAATGATT TCTTCAAAGA GCATCTTTAC CGCGTTGAAC ACCCCGAGAA AACTGACTTT GATTATGTCG TGTTCGATGG CAGACCGGAT AGCCCGGAGC GAAAGTTTGC CGAGTTCCTC GACCACCGTG AGGATATCCG GCTGTTTATG AAGCTGCCGC CAAGATTCCA GATCGACACG CCGGTTGGCC CCTATAACCC GGACTGGGCT ATCATCAAAT CCGAAGACGG CGAAGACCGT ATCTATATGG TTCGCGAAAC CAAGAGCACC ATGGATGAAC AAAAAAGGAG ACCTTCCGAA AATGCCAAAA TCAAGTCTGC CGAGGCACAC TTCAGAGAAA TTGGCATCGA GTATGCGGTT TCAGTACCTG AACAGTGGAA TATCTGA
|
Protein sequence | MNHHSQNTGF QFDSHQQYQL DAINAVVDLF DGQPKDADKI AIALRGSVAS REGELDLGIE QVIGAIGNNL VLDEDAILAN LQTVQDRNGL EVSEKLVDGK LDFDIEMETG TGKTYVYLRT VFELAKKYGF TKFIILVPSV AIREGVNTSI RLMRDHFRSL YPAQPFDANV YRGDKAEEVQ AFATATNVQI LVMTIDAVRS DKNKNENKRI IHKPYDKLNG LRPLDYLKGT HPVVIMDEPQ NMESLLSQSA VVEFDPVFTL RYSATHKQRR NQVYRLDPVD AHDLGLVKQI VVAEVAQHGA DVAPYVKLLE VNDTKGAKLE LACRKADGSI ARRSKKVKPH QELSDVTGNP AYKDWRINAL SIAAFGESAS IELTPNGNLL HEGESLGGAT GAVYKEMIRE TVREHLRKEA MLRPKGIKVL SLFFVDKVAS FLGDGVNNED ANGEFVKWFD EVFMEERGKS DVYQKLLPQE PCELRRAYFS QLKTRGRTTF VDSSGSAVKD DDAYKLIMQD KQRLLDSDEP VRFIFSHSAL REGWDNPNVF QICTLREMGA ETERRQTLGR GLRLPVAKTV KGYERVTDHS VAQLTVVANE SYATFAQNLQ AEYKEAGVAI GQVRSNEFAK LMKRDIHGVM TDDMLGFKTS SAIFKHLENA GFIKDGKTTS MFLPDAEGFS LHLPDELRPY ESDIIRCILN AGIEKYVKPV SARAKRKFNK ELYASPDFER LWSAISQKTT YRVTVKHPML VEACIKAIKA EPKIQPLRID VTRAGVRVLR GGTQGQELGA RTVDLKGSYD LPDIIGELQQ GTSLTRKTLV DILIGSGRLD EFIANPNDFM AMVRRCVEGE LQRVIVEGVQ YEKIGGSVYE LRELQADGLA ENDFFKEHLY RVEHPEKTDF DYVVFDGRPD SPERKFAEFL DHREDIRLFM KLPPRFQIDT PVGPYNPDWA IIKSEDGEDR IYMVRETKST MDEQKRRPSE NAKIKSAEAH FREIGIEYAV SVPEQWNI
|
| |