Gene Cpha266_1495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1495 
Symbol 
ID4570284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1695958 
End bp1698753 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content54% 
IMG OID639766078 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_911943 
Protein GI119357299 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAC GATATTCTCT GAACCAGACT CCCGAGCAGA TTGCGCGGGA TGTTATCGAT 
GTACAGTTGC GGCTGGCAGG TTGGGCTGTT CAGGAAAAAA ATCGCATCGA CTGGCAGGTT
TCGTCGGGTA TAGCCGTGAG GCATTATCCT ACACAGGATG GTCTTGAAGC CGATTATGTT
CTGTTTGTCG ACCGCAGGCC GGTCGGGGTC ATCGAGGCGA AAAAAGAGGA TGAGGGGCAT
CATCTTACCG TGGTTGAAGA GCAGTCTTTC GGATATGCCG AAAGCAAGCT GAAGCATCTC
AACAACGATC CGCTGCCGTT TGTTTACGAA AGTACCGGCA CGTTGACCCG CTTTACCGAT
TACCGCGATC CGAAACCCCG TTCACGACCC GTCTTTACTT TTCACCGTCC CGAGACATTT
CGTGAATGGC TCGGCCAGGA GCGGAGCCTC CGGGAACGCC TTTATGATAT TCCCGGACTG
AATCCTGCCG CTTTGCGGGA GTGCCAGACC ATGGCAATCA ACAACCTCGA AAGCTCTTTT
CGGGACGGAC GACCCAGAGC GCTGATCCAG ATGGCGACCG GCTCCGGCAA GACCTTCGCT
GCCATTACCT TTATTTACCG TTTGCTCAAA CATGCCGATG CCAAACGGAT ACTCTTTCTG
GTCGATACCC GCAACCTCGG CGAACAGGCA GAGCAGGAGT TCAGGGCATA CACGCCGAAC
GACGATAACC GAAAATTCGT AGAACTGTAC AACGTGCAGC GGTTGCAGTC AAGCTCGATT
GCCGGCGACA GTCAGGTCTG CATCACCACC ATCCAGCGGC TCTATTCCAT CCTGAAAGGG
GAGGAGCTTG ACGCTTCGCT TGAGGAGCAA AATCCTGCTG AAAAAAGCTG GCAGCCGAAG
GAGCCTGTTC CGGTGGCATA CAATGCGAAG GTTCCCATAG AGTTTTTCGA CTTCATTGTC
ATCGACGAGT GTCATCGCTC GATCTACAAT CTCTGGAAGC AGGTGCTCGA CTATTTCGAC
GCATTCCTGA TTGGCCTGAC CGCAACGCCC GACAAGCGCA CCTTTGGTTT TTTCAACGAA
AACATCGTGA GCGAATACAG TCACGAACGA GCCGTGGCAG ACGGGGTCAA CGTCGGTTAC
GATGTCTATA CCATCGAGAC TGAAATAACC CGGAACGGCT CCCGGATAAG GGCTCGGGAG
TTTATCGACA AACGTGAAAA ACTCTCTCGC CGCAAACGGT GGGAGCAGCT TGAAGATGAT
GTTGTCTATA CGTCGTCGCA GCTTGACCGG GATGTGGTGA ACCCGAGCCA GATCCGCAAC
GTCATCCGCG CGTTCCGTGA TGCACTTCCG GTTTTGTTTC CCGGACGAAC CGAGGTGCCC
AAGACCCTTG TATTTGCAAA GACCGACAGC CATGCCGACG ATATCATCCA GATTATCCGT
GAGGAGTTCA ACGAAGGGAA CGCATTCTGT AACAAGATAA CCTACAAGGC CGAAGACGAC
CCGAAATCGC TGCTTGCCCG GTTCCGGAAC GAGTACAACC CGAGAATAGC CGTTACGGTC
GATATGATAG CCACGGGCAC CGACGTCAAG CCGCTCGAAT GCCTGCTGTT CATGCGCGAC
GTCAGAAGCA GCAACTATTT CGAGCAGATG AAAGGGCGGG GCACACGGAC GCTCAGTTTT
GACGATCTGA AAAAGGTTAC CCCATCGGTT ACTTCCGCCA AAACTCATTT CGTGATCATC
GACGCCGTAG GGGTGACAAA ATCCCTGAAG ACCGACAGCC GCCCGCTCGA ACGCAAGCCG
ACGGCATCGC TGAAGGAGTT GCTTGAAGCT GTAACCTTCG GGGCACAGGA TGAGGATCTC
TACACCTCGC TTGCCAACCG CCTTGCCCGG CTCGACAAGC AGATTACCGA ACAGGAGCGT
GCGGCATTTA TCGACAAAAC CGGAGGCAAG AGCATCAATC AGGTTGTTCG CGAACTACTC
GACTCATGGG ATCCCGACAG CATCAACCGG AAAGCCCGGG AGATGAACCC GGAGGCAGTT
CAGGAGATGG GCGAAAGCCC CTCTGGCGAA ACAACCATGT TTCTTGAACA GGCGCAGCAG
GCGCTCCTTC ACGAAGCCCG ATTGACCTTC AACGGCTCGC TGAACGAATT CATCGACACC
GTTCGCCGGG TGCATGAGCA GATCATCGAT ACGGTCAATC TCGATCAGGT AACGAGGTCA
GAATGGGCAG CAGAAAGTGG TGAAAAGGCA GCAGAACTGA TCGGGGAGTT CAAGGCCTAT
CTTGAAGCGC ACAAGGACGA AATCACCGCG CTCGGAATAT TTTACAATCA GCCCTACCGG
CGCAGGGAGC TGACCTTCAG GATGATCAGG GAAGTGCTTG ACCGTCTCAA AGCCGACAGG
CCGATGCTTG CCCCGATGCG TATCTGGCAT GCCTATGAAC AGATCGAAAA GGTTAACGGT
TCGAGCCCGA AAAACGAACT CATTGCCCTT GTTGCGCTCA TCCGCCGGGT AACCGGCATC
GATCCGGTTC TGACCGTTTA CGACAGAACC GTTGACGCGA ATTTCAAGCA GTGGGTGTTC
AGCAGGCACT CCGACGCCGG TGACAAGTTC ACCGAGGAGC AGATGAATTG GCTTCGCATG
ATCAAGGAGC ATATCGCTTC AAGCATCCAC ATGGAGCAGG ACGACCTCGA TCTCACACCG
TTCGACGCCT ATGGGGGTCG CGGCAGGATG TGGCAACTTT TCGGGGATCG TATGGATGGG
ATTATCGACG AACTTAACGA AGCGTTGACG GTATGA
 
Protein sequence
MQERYSLNQT PEQIARDVID VQLRLAGWAV QEKNRIDWQV SSGIAVRHYP TQDGLEADYV 
LFVDRRPVGV IEAKKEDEGH HLTVVEEQSF GYAESKLKHL NNDPLPFVYE STGTLTRFTD
YRDPKPRSRP VFTFHRPETF REWLGQERSL RERLYDIPGL NPAALRECQT MAINNLESSF
RDGRPRALIQ MATGSGKTFA AITFIYRLLK HADAKRILFL VDTRNLGEQA EQEFRAYTPN
DDNRKFVELY NVQRLQSSSI AGDSQVCITT IQRLYSILKG EELDASLEEQ NPAEKSWQPK
EPVPVAYNAK VPIEFFDFIV IDECHRSIYN LWKQVLDYFD AFLIGLTATP DKRTFGFFNE
NIVSEYSHER AVADGVNVGY DVYTIETEIT RNGSRIRARE FIDKREKLSR RKRWEQLEDD
VVYTSSQLDR DVVNPSQIRN VIRAFRDALP VLFPGRTEVP KTLVFAKTDS HADDIIQIIR
EEFNEGNAFC NKITYKAEDD PKSLLARFRN EYNPRIAVTV DMIATGTDVK PLECLLFMRD
VRSSNYFEQM KGRGTRTLSF DDLKKVTPSV TSAKTHFVII DAVGVTKSLK TDSRPLERKP
TASLKELLEA VTFGAQDEDL YTSLANRLAR LDKQITEQER AAFIDKTGGK SINQVVRELL
DSWDPDSINR KAREMNPEAV QEMGESPSGE TTMFLEQAQQ ALLHEARLTF NGSLNEFIDT
VRRVHEQIID TVNLDQVTRS EWAAESGEKA AELIGEFKAY LEAHKDEITA LGIFYNQPYR
RRELTFRMIR EVLDRLKADR PMLAPMRIWH AYEQIEKVNG SSPKNELIAL VALIRRVTGI
DPVLTVYDRT VDANFKQWVF SRHSDAGDKF TEEQMNWLRM IKEHIASSIH MEQDDLDLTP
FDAYGGRGRM WQLFGDRMDG IIDELNEALT V