Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1495 |
Symbol | |
ID | 4570284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1695958 |
End bp | 1698753 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639766078 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_911943 |
Protein GI | 119357299 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAC GATATTCTCT GAACCAGACT CCCGAGCAGA TTGCGCGGGA TGTTATCGAT GTACAGTTGC GGCTGGCAGG TTGGGCTGTT CAGGAAAAAA ATCGCATCGA CTGGCAGGTT TCGTCGGGTA TAGCCGTGAG GCATTATCCT ACACAGGATG GTCTTGAAGC CGATTATGTT CTGTTTGTCG ACCGCAGGCC GGTCGGGGTC ATCGAGGCGA AAAAAGAGGA TGAGGGGCAT CATCTTACCG TGGTTGAAGA GCAGTCTTTC GGATATGCCG AAAGCAAGCT GAAGCATCTC AACAACGATC CGCTGCCGTT TGTTTACGAA AGTACCGGCA CGTTGACCCG CTTTACCGAT TACCGCGATC CGAAACCCCG TTCACGACCC GTCTTTACTT TTCACCGTCC CGAGACATTT CGTGAATGGC TCGGCCAGGA GCGGAGCCTC CGGGAACGCC TTTATGATAT TCCCGGACTG AATCCTGCCG CTTTGCGGGA GTGCCAGACC ATGGCAATCA ACAACCTCGA AAGCTCTTTT CGGGACGGAC GACCCAGAGC GCTGATCCAG ATGGCGACCG GCTCCGGCAA GACCTTCGCT GCCATTACCT TTATTTACCG TTTGCTCAAA CATGCCGATG CCAAACGGAT ACTCTTTCTG GTCGATACCC GCAACCTCGG CGAACAGGCA GAGCAGGAGT TCAGGGCATA CACGCCGAAC GACGATAACC GAAAATTCGT AGAACTGTAC AACGTGCAGC GGTTGCAGTC AAGCTCGATT GCCGGCGACA GTCAGGTCTG CATCACCACC ATCCAGCGGC TCTATTCCAT CCTGAAAGGG GAGGAGCTTG ACGCTTCGCT TGAGGAGCAA AATCCTGCTG AAAAAAGCTG GCAGCCGAAG GAGCCTGTTC CGGTGGCATA CAATGCGAAG GTTCCCATAG AGTTTTTCGA CTTCATTGTC ATCGACGAGT GTCATCGCTC GATCTACAAT CTCTGGAAGC AGGTGCTCGA CTATTTCGAC GCATTCCTGA TTGGCCTGAC CGCAACGCCC GACAAGCGCA CCTTTGGTTT TTTCAACGAA AACATCGTGA GCGAATACAG TCACGAACGA GCCGTGGCAG ACGGGGTCAA CGTCGGTTAC GATGTCTATA CCATCGAGAC TGAAATAACC CGGAACGGCT CCCGGATAAG GGCTCGGGAG TTTATCGACA AACGTGAAAA ACTCTCTCGC CGCAAACGGT GGGAGCAGCT TGAAGATGAT GTTGTCTATA CGTCGTCGCA GCTTGACCGG GATGTGGTGA ACCCGAGCCA GATCCGCAAC GTCATCCGCG CGTTCCGTGA TGCACTTCCG GTTTTGTTTC CCGGACGAAC CGAGGTGCCC AAGACCCTTG TATTTGCAAA GACCGACAGC CATGCCGACG ATATCATCCA GATTATCCGT GAGGAGTTCA ACGAAGGGAA CGCATTCTGT AACAAGATAA CCTACAAGGC CGAAGACGAC CCGAAATCGC TGCTTGCCCG GTTCCGGAAC GAGTACAACC CGAGAATAGC CGTTACGGTC GATATGATAG CCACGGGCAC CGACGTCAAG CCGCTCGAAT GCCTGCTGTT CATGCGCGAC GTCAGAAGCA GCAACTATTT CGAGCAGATG AAAGGGCGGG GCACACGGAC GCTCAGTTTT GACGATCTGA AAAAGGTTAC CCCATCGGTT ACTTCCGCCA AAACTCATTT CGTGATCATC GACGCCGTAG GGGTGACAAA ATCCCTGAAG ACCGACAGCC GCCCGCTCGA ACGCAAGCCG ACGGCATCGC TGAAGGAGTT GCTTGAAGCT GTAACCTTCG GGGCACAGGA TGAGGATCTC TACACCTCGC TTGCCAACCG CCTTGCCCGG CTCGACAAGC AGATTACCGA ACAGGAGCGT GCGGCATTTA TCGACAAAAC CGGAGGCAAG AGCATCAATC AGGTTGTTCG CGAACTACTC GACTCATGGG ATCCCGACAG CATCAACCGG AAAGCCCGGG AGATGAACCC GGAGGCAGTT CAGGAGATGG GCGAAAGCCC CTCTGGCGAA ACAACCATGT TTCTTGAACA GGCGCAGCAG GCGCTCCTTC ACGAAGCCCG ATTGACCTTC AACGGCTCGC TGAACGAATT CATCGACACC GTTCGCCGGG TGCATGAGCA GATCATCGAT ACGGTCAATC TCGATCAGGT AACGAGGTCA GAATGGGCAG CAGAAAGTGG TGAAAAGGCA GCAGAACTGA TCGGGGAGTT CAAGGCCTAT CTTGAAGCGC ACAAGGACGA AATCACCGCG CTCGGAATAT TTTACAATCA GCCCTACCGG CGCAGGGAGC TGACCTTCAG GATGATCAGG GAAGTGCTTG ACCGTCTCAA AGCCGACAGG CCGATGCTTG CCCCGATGCG TATCTGGCAT GCCTATGAAC AGATCGAAAA GGTTAACGGT TCGAGCCCGA AAAACGAACT CATTGCCCTT GTTGCGCTCA TCCGCCGGGT AACCGGCATC GATCCGGTTC TGACCGTTTA CGACAGAACC GTTGACGCGA ATTTCAAGCA GTGGGTGTTC AGCAGGCACT CCGACGCCGG TGACAAGTTC ACCGAGGAGC AGATGAATTG GCTTCGCATG ATCAAGGAGC ATATCGCTTC AAGCATCCAC ATGGAGCAGG ACGACCTCGA TCTCACACCG TTCGACGCCT ATGGGGGTCG CGGCAGGATG TGGCAACTTT TCGGGGATCG TATGGATGGG ATTATCGACG AACTTAACGA AGCGTTGACG GTATGA
|
Protein sequence | MQERYSLNQT PEQIARDVID VQLRLAGWAV QEKNRIDWQV SSGIAVRHYP TQDGLEADYV LFVDRRPVGV IEAKKEDEGH HLTVVEEQSF GYAESKLKHL NNDPLPFVYE STGTLTRFTD YRDPKPRSRP VFTFHRPETF REWLGQERSL RERLYDIPGL NPAALRECQT MAINNLESSF RDGRPRALIQ MATGSGKTFA AITFIYRLLK HADAKRILFL VDTRNLGEQA EQEFRAYTPN DDNRKFVELY NVQRLQSSSI AGDSQVCITT IQRLYSILKG EELDASLEEQ NPAEKSWQPK EPVPVAYNAK VPIEFFDFIV IDECHRSIYN LWKQVLDYFD AFLIGLTATP DKRTFGFFNE NIVSEYSHER AVADGVNVGY DVYTIETEIT RNGSRIRARE FIDKREKLSR RKRWEQLEDD VVYTSSQLDR DVVNPSQIRN VIRAFRDALP VLFPGRTEVP KTLVFAKTDS HADDIIQIIR EEFNEGNAFC NKITYKAEDD PKSLLARFRN EYNPRIAVTV DMIATGTDVK PLECLLFMRD VRSSNYFEQM KGRGTRTLSF DDLKKVTPSV TSAKTHFVII DAVGVTKSLK TDSRPLERKP TASLKELLEA VTFGAQDEDL YTSLANRLAR LDKQITEQER AAFIDKTGGK SINQVVRELL DSWDPDSINR KAREMNPEAV QEMGESPSGE TTMFLEQAQQ ALLHEARLTF NGSLNEFIDT VRRVHEQIID TVNLDQVTRS EWAAESGEKA AELIGEFKAY LEAHKDEITA LGIFYNQPYR RRELTFRMIR EVLDRLKADR PMLAPMRIWH AYEQIEKVNG SSPKNELIAL VALIRRVTGI DPVLTVYDRT VDANFKQWVF SRHSDAGDKF TEEQMNWLRM IKEHIASSIH MEQDDLDLTP FDAYGGRGRM WQLFGDRMDG IIDELNEALT V
|
| |