Gene Cpha266_1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1128 
Symbol 
ID4570335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1276385 
End bp1279183 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content50% 
IMG OID639765724 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_911592 
Protein GI119356948 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG AGCAGCAAAC CCGCATCGAA TTGATTGACA GGATGTTACT GCAGGCAAGC 
TGGAACGTGA ACGATCCTCT CCAGGTTGTG GAGGAGTTCG ATATTCTTGT CGGTTTGCCC
GAAGGGGTGC AGGAACCCCG CACCTCTTAT GAAGGGCATC AGTTCAGCGA CTATGTGCTG
CTTGGAAAGG ATGGCAAGCC TCTTGCCGTC GTTGAAGCAA AAAAAACAAG TAGGGATGCA
GCCATTGGCC GTGAACAGGC CAAGCAGTAT TGCTGCAATA TCCGGAAACA GTTGGGCGTT
GAGCTTCCAT TCTGTTTTTA TACCAATGGC CTTGAGACCT TTTTCTGGGA TATCGACAAC
TACCCTCCAC GAAAGGTAAT CGGCTTTCCA ACCCGTGACG ACCTTGAGCG GTTCAGCTAT
ATCCGCAGAA GCCGCAAGCC TCTCACCGGG GAACTGATCA ATACAGCCAT TGCCGGACGC
GACTATCAGA TTCGCGCCAT TCGATCCGTC CTCGAAGCCA TTGAGCAGAA AAAGCGGGAC
TTTCTGCTCG TTATGGCCAC CGGAACCGGT AAAACCCGCA CAGCTATCGC CATGGTTGAC
GCCCTGATGC GTGGCGGACA TGCCGAAAAA ATTCTTTTTC TGGTCGATCG TATTGCCTTG
CGTGAGCAGG CGCTCTCCGC CTTCAAGGAG CATTTGCCCC ACGAACCTCG CTGGCCAAAC
AGCGGTGAAA AGGTTTTTGC CAAAGATCGC CGCATCTACA TTGCCACCTA CCCAACGATG
CTCAACCTCA TCAGGGATGA ATCATCGTAC CTCTCACCCT GGTTTTTTGA CTTTATCGTT
ATTGATGAGA GCCACCGCTC CATCTACAAC ACCTGGGGGG AAATCCTCGA TTACTTCAAA
ACAATCACCC TGGGGCTGAC GGCAACCCCA ACCGATATTC TTGACCACAA CACCTTCAAC
CTCTTTCACT GCGAGAATGG CCTTCCAACC TTTGCCTATA CCTATGAAGA GGCAGTGAAC
AATATTCCTC CATATCTGTG CAATTTCCAG GTGATGAAAA TCCAGACGAA ATTCCAGATG
GAGGGTATCA GCAAGCGGAC GATCTCCCTT GAAGATCAGA AAAAACTGAT TCTCGAAGGC
AAGGATATCG AAGAGATCAA TTTTGAAGGG ACGCAGCTTG AAAAGCAGGT GATCAATCGG
GGGACGAACA GCCTGATTGT CAAAGAGTTC ATGGAGGAGT GCATCAAAGA CCAGAACGGT
GTTCTTCCCG GAAAAACCAT ATTCTTCTGC GCCACCATAG CGCATGCCCG CAGAATTGAG
GAGATATTCG ACCGGCTCTA CCCGGAATAC AAAGGCGAAC TTGCCAAGGT TCTTGTCTCT
GATGACCCCC GAGTCTACGG CAAGGGAGGC TTGCTCGATC AGTTCACGAA CAGCGATATG
CCCCGCATCG CCATCAGTGT TGACATGCTC GATACCGGTA TAGACATCCG GGAACTCGTT
AATCTCGTCT TTGTCAAGCC GGTCTACTCC TACACAAAAT TCTGGCAGAT GATTGGCCGA
GGGACACGGC TTCTTGAACC CGCTAAAATC AAGCCATGGT GCACCAAAAA AGAGCTTTTC
CTGATTCTTG ACTGCTGGGA CAACTTTGAG TACTTCAAGC TTCAGCCCAA AGGCAAAGAG
CTGACACAGC AACTCCCGCT TCCGGTGAAA CTGTTCGGGC TGCGGCTCGA CAAAATTGAA
TATGCGCTCT CAATCGGTAA CACGGCCATT GCTGAGCGGG AAACGGTAAA ACTGCGTAAA
CAGATTGCCG GGCTTCCGCA TACCTCAGTG GTGATCAAAG AGGCCGCATC GCTTCTTCAC
CCTCTCGAAG AGGAGAACTT CTGGATATCT CTCACACCCC AAAAGCTGGA AAATCTGAGA
AGCGGGATCA AACCGCTCTT CAGAACCGTG TCGGATGCCG ACTTTAAAGC CATGCGTTTT
GAGCGGGACG TTCTGGAGAG TTCACTGGCA CAACTTCGCG ACCAGAAAGA GCGCTACGGC
ACGCTTAACG ACGGCATTGC CGAGCAGATC AGCCAGCTTC CCCTGAGCGT CGGCTTTGTG
AAACAGGAAG AGGAACTGAT ACGGGCCGCT CAAACGAAAC ACTTCTGGAA CAAGGCTACG
GAAGAGAGCT TCGACGAACT GATTGAAAAA CTCTCGCCGC TGATGAAATT TCGCGAGCCT
GATAGCGGCG CAATCGGTCA AGTATACCTG AACTTGCAGG ATCTTCTGCA CCATAAAGAG
ATGGTTGAAT TCGGCCCCCG GAATGAGGCC GTCAGCATTA CCCGCTACCG CGAAATGGTT
GAATTGCTCA TTACCGAGCT GACAAAACAG AATCCGATTC TCTCCAGAAT CAAGGAGGGC
AAAGAGATTT CGCCTGAAGA GGCCGCTGAA CTTGCGGAAA TGCTCCACGA AGAGCATCCG
CACATCACCG AGGATCTGTT GCGCTCGGTC TATAACAACC GCAAGGCCCA TTTCATCCAG
TTTATCCGCC ATATTCTCGG GCTCGAAATT CTCAAGAGTT TTCCTGAAAC GGTTGCCGAT
GCTTTTGATC AGTTTATCAA AGAGCACTCA ACCTTCTCCA GCCGCCAACT CGACTTTTTA
AACCTCCTCA AAAACGTTCT TGTAGAACGT GAAAAGATTG AAAAAAGAGA CCTGATCAAT
GCCCCATTTA CGGTCATACA CCCGAAAGGC ATTCGCGGAG TCTTCAATCC GGCTGAAATC
AATGAAATTC TGGCTCTGGC CCGGCAACTT GCAGCATAA
 
Protein sequence
MKNEQQTRIE LIDRMLLQAS WNVNDPLQVV EEFDILVGLP EGVQEPRTSY EGHQFSDYVL 
LGKDGKPLAV VEAKKTSRDA AIGREQAKQY CCNIRKQLGV ELPFCFYTNG LETFFWDIDN
YPPRKVIGFP TRDDLERFSY IRRSRKPLTG ELINTAIAGR DYQIRAIRSV LEAIEQKKRD
FLLVMATGTG KTRTAIAMVD ALMRGGHAEK ILFLVDRIAL REQALSAFKE HLPHEPRWPN
SGEKVFAKDR RIYIATYPTM LNLIRDESSY LSPWFFDFIV IDESHRSIYN TWGEILDYFK
TITLGLTATP TDILDHNTFN LFHCENGLPT FAYTYEEAVN NIPPYLCNFQ VMKIQTKFQM
EGISKRTISL EDQKKLILEG KDIEEINFEG TQLEKQVINR GTNSLIVKEF MEECIKDQNG
VLPGKTIFFC ATIAHARRIE EIFDRLYPEY KGELAKVLVS DDPRVYGKGG LLDQFTNSDM
PRIAISVDML DTGIDIRELV NLVFVKPVYS YTKFWQMIGR GTRLLEPAKI KPWCTKKELF
LILDCWDNFE YFKLQPKGKE LTQQLPLPVK LFGLRLDKIE YALSIGNTAI AERETVKLRK
QIAGLPHTSV VIKEAASLLH PLEEENFWIS LTPQKLENLR SGIKPLFRTV SDADFKAMRF
ERDVLESSLA QLRDQKERYG TLNDGIAEQI SQLPLSVGFV KQEEELIRAA QTKHFWNKAT
EESFDELIEK LSPLMKFREP DSGAIGQVYL NLQDLLHHKE MVEFGPRNEA VSITRYREMV
ELLITELTKQ NPILSRIKEG KEISPEEAAE LAEMLHEEHP HITEDLLRSV YNNRKAHFIQ
FIRHILGLEI LKSFPETVAD AFDQFIKEHS TFSSRQLDFL NLLKNVLVER EKIEKRDLIN
APFTVIHPKG IRGVFNPAEI NEILALARQL AA