Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1128 |
Symbol | |
ID | 4570335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1276385 |
End bp | 1279183 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765724 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_911592 |
Protein GI | 119356948 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACG AGCAGCAAAC CCGCATCGAA TTGATTGACA GGATGTTACT GCAGGCAAGC TGGAACGTGA ACGATCCTCT CCAGGTTGTG GAGGAGTTCG ATATTCTTGT CGGTTTGCCC GAAGGGGTGC AGGAACCCCG CACCTCTTAT GAAGGGCATC AGTTCAGCGA CTATGTGCTG CTTGGAAAGG ATGGCAAGCC TCTTGCCGTC GTTGAAGCAA AAAAAACAAG TAGGGATGCA GCCATTGGCC GTGAACAGGC CAAGCAGTAT TGCTGCAATA TCCGGAAACA GTTGGGCGTT GAGCTTCCAT TCTGTTTTTA TACCAATGGC CTTGAGACCT TTTTCTGGGA TATCGACAAC TACCCTCCAC GAAAGGTAAT CGGCTTTCCA ACCCGTGACG ACCTTGAGCG GTTCAGCTAT ATCCGCAGAA GCCGCAAGCC TCTCACCGGG GAACTGATCA ATACAGCCAT TGCCGGACGC GACTATCAGA TTCGCGCCAT TCGATCCGTC CTCGAAGCCA TTGAGCAGAA AAAGCGGGAC TTTCTGCTCG TTATGGCCAC CGGAACCGGT AAAACCCGCA CAGCTATCGC CATGGTTGAC GCCCTGATGC GTGGCGGACA TGCCGAAAAA ATTCTTTTTC TGGTCGATCG TATTGCCTTG CGTGAGCAGG CGCTCTCCGC CTTCAAGGAG CATTTGCCCC ACGAACCTCG CTGGCCAAAC AGCGGTGAAA AGGTTTTTGC CAAAGATCGC CGCATCTACA TTGCCACCTA CCCAACGATG CTCAACCTCA TCAGGGATGA ATCATCGTAC CTCTCACCCT GGTTTTTTGA CTTTATCGTT ATTGATGAGA GCCACCGCTC CATCTACAAC ACCTGGGGGG AAATCCTCGA TTACTTCAAA ACAATCACCC TGGGGCTGAC GGCAACCCCA ACCGATATTC TTGACCACAA CACCTTCAAC CTCTTTCACT GCGAGAATGG CCTTCCAACC TTTGCCTATA CCTATGAAGA GGCAGTGAAC AATATTCCTC CATATCTGTG CAATTTCCAG GTGATGAAAA TCCAGACGAA ATTCCAGATG GAGGGTATCA GCAAGCGGAC GATCTCCCTT GAAGATCAGA AAAAACTGAT TCTCGAAGGC AAGGATATCG AAGAGATCAA TTTTGAAGGG ACGCAGCTTG AAAAGCAGGT GATCAATCGG GGGACGAACA GCCTGATTGT CAAAGAGTTC ATGGAGGAGT GCATCAAAGA CCAGAACGGT GTTCTTCCCG GAAAAACCAT ATTCTTCTGC GCCACCATAG CGCATGCCCG CAGAATTGAG GAGATATTCG ACCGGCTCTA CCCGGAATAC AAAGGCGAAC TTGCCAAGGT TCTTGTCTCT GATGACCCCC GAGTCTACGG CAAGGGAGGC TTGCTCGATC AGTTCACGAA CAGCGATATG CCCCGCATCG CCATCAGTGT TGACATGCTC GATACCGGTA TAGACATCCG GGAACTCGTT AATCTCGTCT TTGTCAAGCC GGTCTACTCC TACACAAAAT TCTGGCAGAT GATTGGCCGA GGGACACGGC TTCTTGAACC CGCTAAAATC AAGCCATGGT GCACCAAAAA AGAGCTTTTC CTGATTCTTG ACTGCTGGGA CAACTTTGAG TACTTCAAGC TTCAGCCCAA AGGCAAAGAG CTGACACAGC AACTCCCGCT TCCGGTGAAA CTGTTCGGGC TGCGGCTCGA CAAAATTGAA TATGCGCTCT CAATCGGTAA CACGGCCATT GCTGAGCGGG AAACGGTAAA ACTGCGTAAA CAGATTGCCG GGCTTCCGCA TACCTCAGTG GTGATCAAAG AGGCCGCATC GCTTCTTCAC CCTCTCGAAG AGGAGAACTT CTGGATATCT CTCACACCCC AAAAGCTGGA AAATCTGAGA AGCGGGATCA AACCGCTCTT CAGAACCGTG TCGGATGCCG ACTTTAAAGC CATGCGTTTT GAGCGGGACG TTCTGGAGAG TTCACTGGCA CAACTTCGCG ACCAGAAAGA GCGCTACGGC ACGCTTAACG ACGGCATTGC CGAGCAGATC AGCCAGCTTC CCCTGAGCGT CGGCTTTGTG AAACAGGAAG AGGAACTGAT ACGGGCCGCT CAAACGAAAC ACTTCTGGAA CAAGGCTACG GAAGAGAGCT TCGACGAACT GATTGAAAAA CTCTCGCCGC TGATGAAATT TCGCGAGCCT GATAGCGGCG CAATCGGTCA AGTATACCTG AACTTGCAGG ATCTTCTGCA CCATAAAGAG ATGGTTGAAT TCGGCCCCCG GAATGAGGCC GTCAGCATTA CCCGCTACCG CGAAATGGTT GAATTGCTCA TTACCGAGCT GACAAAACAG AATCCGATTC TCTCCAGAAT CAAGGAGGGC AAAGAGATTT CGCCTGAAGA GGCCGCTGAA CTTGCGGAAA TGCTCCACGA AGAGCATCCG CACATCACCG AGGATCTGTT GCGCTCGGTC TATAACAACC GCAAGGCCCA TTTCATCCAG TTTATCCGCC ATATTCTCGG GCTCGAAATT CTCAAGAGTT TTCCTGAAAC GGTTGCCGAT GCTTTTGATC AGTTTATCAA AGAGCACTCA ACCTTCTCCA GCCGCCAACT CGACTTTTTA AACCTCCTCA AAAACGTTCT TGTAGAACGT GAAAAGATTG AAAAAAGAGA CCTGATCAAT GCCCCATTTA CGGTCATACA CCCGAAAGGC ATTCGCGGAG TCTTCAATCC GGCTGAAATC AATGAAATTC TGGCTCTGGC CCGGCAACTT GCAGCATAA
|
Protein sequence | MKNEQQTRIE LIDRMLLQAS WNVNDPLQVV EEFDILVGLP EGVQEPRTSY EGHQFSDYVL LGKDGKPLAV VEAKKTSRDA AIGREQAKQY CCNIRKQLGV ELPFCFYTNG LETFFWDIDN YPPRKVIGFP TRDDLERFSY IRRSRKPLTG ELINTAIAGR DYQIRAIRSV LEAIEQKKRD FLLVMATGTG KTRTAIAMVD ALMRGGHAEK ILFLVDRIAL REQALSAFKE HLPHEPRWPN SGEKVFAKDR RIYIATYPTM LNLIRDESSY LSPWFFDFIV IDESHRSIYN TWGEILDYFK TITLGLTATP TDILDHNTFN LFHCENGLPT FAYTYEEAVN NIPPYLCNFQ VMKIQTKFQM EGISKRTISL EDQKKLILEG KDIEEINFEG TQLEKQVINR GTNSLIVKEF MEECIKDQNG VLPGKTIFFC ATIAHARRIE EIFDRLYPEY KGELAKVLVS DDPRVYGKGG LLDQFTNSDM PRIAISVDML DTGIDIRELV NLVFVKPVYS YTKFWQMIGR GTRLLEPAKI KPWCTKKELF LILDCWDNFE YFKLQPKGKE LTQQLPLPVK LFGLRLDKIE YALSIGNTAI AERETVKLRK QIAGLPHTSV VIKEAASLLH PLEEENFWIS LTPQKLENLR SGIKPLFRTV SDADFKAMRF ERDVLESSLA QLRDQKERYG TLNDGIAEQI SQLPLSVGFV KQEEELIRAA QTKHFWNKAT EESFDELIEK LSPLMKFREP DSGAIGQVYL NLQDLLHHKE MVEFGPRNEA VSITRYREMV ELLITELTKQ NPILSRIKEG KEISPEEAAE LAEMLHEEHP HITEDLLRSV YNNRKAHFIQ FIRHILGLEI LKSFPETVAD AFDQFIKEHS TFSSRQLDFL NLLKNVLVER EKIEKRDLIN APFTVIHPKG IRGVFNPAEI NEILALARQL AA
|
| |