Gene Cag_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1124 
Symbol 
ID3747280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1515437 
End bp1518691 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content38% 
IMG OID637773657 
ProductType I site-specific deoxyribonuclease HsdR 
Protein accessionYP_379429 
Protein GI78189091 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACATC TCACCGAACA TAGCATAGAA ACATTTGCCA TTGAGTTACT CTATAAACTC 
GGCTACGAAT ATATCTATGC TCCCGATATT GCGCCCGACA CTTCGGCAGG CTCAGTGTCC
GAGATTCGAG AGAGCTTCGC ACAAGTTCTA TTGTTGAACA GGTTGCAAAA TGCCGTTAAA
AGAATCAATC ACAGTATCCC AGCCGATGCA CAGGCAGAAG CTATCAAAGA AATTCAACGC
ATTGCTTCGC CTGAATTGCT TACCAACAAT GAAACCTTTC ACCGTTTACT TACTGAAGGT
ATTCCCGTTT CAAAACGTGT AGATGGAGAC GATAGGGGCG ACAGAGTGTG GCTCATTGAT
TTTAAAAATC CCCACAATAA CGAATTTGTT GTAGCCAATC AATTTACCAT TATTGAAAAC
GGAAATAACA AACGCCCTGA TGTGATTCTG TTTGTCAATG GAATTCCGCT TGTAGTTATT
GAACTAAAAA ATGCTACCTA TGAAAATACC ACAATGCATT CAGCATTTAA GCAAATAGAC
ACCTATAAAA AAACTATTCC AAGTTTATTT ACGTATAACG GTTTTATCGT TATCTCTGAT
GGTTTAGAAG CCAAAGCAGG CACTATTTCG TCAGGTTTTA GTCGCTTTAT GGCATGGAAG
TCGGCAGATG GTAAAGCTGA AGCCTCGCAT TTAGTAAGCC AATTAGAAAC ATTGATTCAA
GGAATGTTGA ATAAAGAAAC CTTGATAGAC TTAATGAGGC ATTTCATTGT ATTTGAAAAA
TCAAAAAAGA TAGACGCTAA AACAGGTATT ACAACAATAT CAACCGTTAA AAAATTAGCA
GCTTATCATC AATACTATGC AGTAAATCGA GCAGTTGAGT CAACGTTAAG AGCGTCAGGT
TATCAATTGG TGAAAGAAAC GCCATTGAGT ATGGTCATGG AATCTCCTGA AAGCTATGGT
TTGCGTGGAG TAAAGAAGCA ACCCATTGGC GACAAAAAAG GTGGTGTGGT TTGGCATACG
CAAGGTAGCG GAAAATCACT CTCAATGGTT TTCTATACTG GTAAAATTGT ATTGGCTTTA
GACAACCCAA CCATTCTTGT AATTACCGAC CGAAACGATT TGGACGACCA ACTTTTTGAC
ACGTTTGCTG CATCAAAACA ATTAATAAGG CAAGAACCAG TTCAGGCAGA AGACAGAAAC
CAGTTAAAAG AATTATTAAA AGTTGCTTCG GGCGGTGTAG TATTTACAAC CATTCAAAAA
TTTCAACCCA ATGAAGGCAA CATTTATGAA AAGCTTTCTG ATAGAAAAAA CATTGTAGTT
ATAGCGGACG AAGCACATAG AACACAATAC GGATTTAAAG CAAAAACCAT TGATGCAAAA
GATGAAAAAG GGACGATTAT TGGCAAGAAA ATCGTTTACG GTTTTGCCAA ATATATGCGA
GATGCTTTGC CAAACGCAAC TTATTTAGGT TTTACAGGAA CGCCGATAGA AAACACCGAT
GTAAACACAC CAGCCGTTTT CGGAAACTAT GTGGACATTT ACGATATAGC TCAAGCCGTT
GAAGATGGAG CAACCGTTCG TATTTATTAC GAAAGCCGTT TAGCAAAAGT AAGTCTTAGC
GAAGAAGGCA AAAAATTAGT TGCCGAACTT GATGATGAAT TGGAAGAGGA AGAAGATGTA
AGGGCGTATA GCAATACGCC CCAACAAAAA GCAAAAGCTA AATGGACGCA GCTTGAAGCC
TTAGTTGGTA GTGAAAACCG AATTAGGAAT ATTGCCAAAG ACATTGTTGC ACACTTTAAC
CAACGGCAGG AAGTATGTAA TGGTAAAGGT ATGATTGTTG CTATGAGCCG CAGAATTGCA
GCCGATTTGT ATCAGGCAAT TATTAACCTA AAACCTGAAT GGCATTCAGA GGATTTGAAT
AAAGGCGTGA TAAAAGTGGT AATGACTTCG GCATCTTCTG ATGGTCCAAA AATTTCAAAA
CACCACACAA CTAAAGAGCA AAGAAGAACC TTAGCCGAAA GAATGAAAAA TCCTGATGAC
GCATTACAAT TGGTAATTGT GCGGGATATG TGGCTTACTG GTTTTGACGC ACCAAGTATG
CACACCCTTT ATATTGATAA ACCAATGAAA GGGCATAATT TGATGCAAGC AATTGCCCGT
GTTAATCGAG TTTATAACGA TAAACCCGGT GGTTTAATTG TTGACTATTT AGGCATTGCT
TCTGATTTAA AAAAAGCACT TGCTTTTTAT TCTGATGCAG GCGGAAAAGG CGACCCAACC
ATATTGCAAG AACAAGCCGT TCAATTGATG TTGGAGAAAT TAGAAGTAGT TTCTCAAATG
TATTACGGCT TTGCATATGA AACCTATTTT GAAGCCGACA CTTCAAAGAA ATTATCGCTA
ATACTTGCAG CCGAAGAACA TATTTTAGGT TTAGAAGACG GAAAGAAACG TTACATCAAC
GAAGTAACAG CACTTTCAAA AGCATTTGCC ATTGCTATAC CGCATGACCA AGCAATGGAT
GTAAAAGATG AGGTTTCGTT TTTCCAAACG GTAAAAGCAA GGTTAGCAAA GTTTGACGGA
ACCGGGTCAG GCAAAACAGA CGAAGAAATT GAAACAACCA TTCGACAAGT TATTGACAAA
GCACTCATTT CGGAACAAGT GATTGATGTG TTTGACGCAG CAGGAATAAA GAAACCCGAT
ATTTCTATTC TTTCAGAAGA TTTTTTAATG GAACTGAAAG GAATGGAACA TAAAAATGTT
GCCTTAGAAG TTTTGAAAAA ACTCTTGAAT GATGAAATAA AATCGAGAGC AAAAAAGAAC
CTCGTAAAAA GTAGAACATT TTTAGATATG TTGGAAAACT CCATTAAAAA ATATCATAAC
AAAATTCTTA CGGCTGCCGA AGTTATTGAT GAACTCATAA AACTTGGGAA AGAAATAGTT
GAAACGGATG ATGAAGCAAA ACGTATGGGT TTAACTGATT TTGAATATGC TTTTTATACC
GCAGTTGCCA ATAATGATAG CGCAAAAGAA CTCATGCAAC AAGATAAATT GAGAGAACTT
GCGATTGTAC TAACCGAAAC CATACGCCAA AACACATCTA TTGACTGGAC AATTAAAGAA
AGCGTAAAGG CTAAATTGAA AGTAGCGGTA AAAAGAGTGC TCAGAAAATA TGGCTATCCA
CCCGACATGC AATTGTTAGC AACAGAAACC GTATTAAAAC AAGCTGAAAT GATTGCTAAT
GAAATAACAA AATAA
 
Protein sequence
MIHLTEHSIE TFAIELLYKL GYEYIYAPDI APDTSAGSVS EIRESFAQVL LLNRLQNAVK 
RINHSIPADA QAEAIKEIQR IASPELLTNN ETFHRLLTEG IPVSKRVDGD DRGDRVWLID
FKNPHNNEFV VANQFTIIEN GNNKRPDVIL FVNGIPLVVI ELKNATYENT TMHSAFKQID
TYKKTIPSLF TYNGFIVISD GLEAKAGTIS SGFSRFMAWK SADGKAEASH LVSQLETLIQ
GMLNKETLID LMRHFIVFEK SKKIDAKTGI TTISTVKKLA AYHQYYAVNR AVESTLRASG
YQLVKETPLS MVMESPESYG LRGVKKQPIG DKKGGVVWHT QGSGKSLSMV FYTGKIVLAL
DNPTILVITD RNDLDDQLFD TFAASKQLIR QEPVQAEDRN QLKELLKVAS GGVVFTTIQK
FQPNEGNIYE KLSDRKNIVV IADEAHRTQY GFKAKTIDAK DEKGTIIGKK IVYGFAKYMR
DALPNATYLG FTGTPIENTD VNTPAVFGNY VDIYDIAQAV EDGATVRIYY ESRLAKVSLS
EEGKKLVAEL DDELEEEEDV RAYSNTPQQK AKAKWTQLEA LVGSENRIRN IAKDIVAHFN
QRQEVCNGKG MIVAMSRRIA ADLYQAIINL KPEWHSEDLN KGVIKVVMTS ASSDGPKISK
HHTTKEQRRT LAERMKNPDD ALQLVIVRDM WLTGFDAPSM HTLYIDKPMK GHNLMQAIAR
VNRVYNDKPG GLIVDYLGIA SDLKKALAFY SDAGGKGDPT ILQEQAVQLM LEKLEVVSQM
YYGFAYETYF EADTSKKLSL ILAAEEHILG LEDGKKRYIN EVTALSKAFA IAIPHDQAMD
VKDEVSFFQT VKARLAKFDG TGSGKTDEEI ETTIRQVIDK ALISEQVIDV FDAAGIKKPD
ISILSEDFLM ELKGMEHKNV ALEVLKKLLN DEIKSRAKKN LVKSRTFLDM LENSIKKYHN
KILTAAEVID ELIKLGKEIV ETDDEAKRMG LTDFEYAFYT AVANNDSAKE LMQQDKLREL
AIVLTETIRQ NTSIDWTIKE SVKAKLKVAV KRVLRKYGYP PDMQLLATET VLKQAEMIAN
EITK