Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1124 |
Symbol | |
ID | 3747280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1515437 |
End bp | 1518691 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637773657 |
Product | Type I site-specific deoxyribonuclease HsdR |
Protein accession | YP_379429 |
Protein GI | 78189091 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACATC TCACCGAACA TAGCATAGAA ACATTTGCCA TTGAGTTACT CTATAAACTC GGCTACGAAT ATATCTATGC TCCCGATATT GCGCCCGACA CTTCGGCAGG CTCAGTGTCC GAGATTCGAG AGAGCTTCGC ACAAGTTCTA TTGTTGAACA GGTTGCAAAA TGCCGTTAAA AGAATCAATC ACAGTATCCC AGCCGATGCA CAGGCAGAAG CTATCAAAGA AATTCAACGC ATTGCTTCGC CTGAATTGCT TACCAACAAT GAAACCTTTC ACCGTTTACT TACTGAAGGT ATTCCCGTTT CAAAACGTGT AGATGGAGAC GATAGGGGCG ACAGAGTGTG GCTCATTGAT TTTAAAAATC CCCACAATAA CGAATTTGTT GTAGCCAATC AATTTACCAT TATTGAAAAC GGAAATAACA AACGCCCTGA TGTGATTCTG TTTGTCAATG GAATTCCGCT TGTAGTTATT GAACTAAAAA ATGCTACCTA TGAAAATACC ACAATGCATT CAGCATTTAA GCAAATAGAC ACCTATAAAA AAACTATTCC AAGTTTATTT ACGTATAACG GTTTTATCGT TATCTCTGAT GGTTTAGAAG CCAAAGCAGG CACTATTTCG TCAGGTTTTA GTCGCTTTAT GGCATGGAAG TCGGCAGATG GTAAAGCTGA AGCCTCGCAT TTAGTAAGCC AATTAGAAAC ATTGATTCAA GGAATGTTGA ATAAAGAAAC CTTGATAGAC TTAATGAGGC ATTTCATTGT ATTTGAAAAA TCAAAAAAGA TAGACGCTAA AACAGGTATT ACAACAATAT CAACCGTTAA AAAATTAGCA GCTTATCATC AATACTATGC AGTAAATCGA GCAGTTGAGT CAACGTTAAG AGCGTCAGGT TATCAATTGG TGAAAGAAAC GCCATTGAGT ATGGTCATGG AATCTCCTGA AAGCTATGGT TTGCGTGGAG TAAAGAAGCA ACCCATTGGC GACAAAAAAG GTGGTGTGGT TTGGCATACG CAAGGTAGCG GAAAATCACT CTCAATGGTT TTCTATACTG GTAAAATTGT ATTGGCTTTA GACAACCCAA CCATTCTTGT AATTACCGAC CGAAACGATT TGGACGACCA ACTTTTTGAC ACGTTTGCTG CATCAAAACA ATTAATAAGG CAAGAACCAG TTCAGGCAGA AGACAGAAAC CAGTTAAAAG AATTATTAAA AGTTGCTTCG GGCGGTGTAG TATTTACAAC CATTCAAAAA TTTCAACCCA ATGAAGGCAA CATTTATGAA AAGCTTTCTG ATAGAAAAAA CATTGTAGTT ATAGCGGACG AAGCACATAG AACACAATAC GGATTTAAAG CAAAAACCAT TGATGCAAAA GATGAAAAAG GGACGATTAT TGGCAAGAAA ATCGTTTACG GTTTTGCCAA ATATATGCGA GATGCTTTGC CAAACGCAAC TTATTTAGGT TTTACAGGAA CGCCGATAGA AAACACCGAT GTAAACACAC CAGCCGTTTT CGGAAACTAT GTGGACATTT ACGATATAGC TCAAGCCGTT GAAGATGGAG CAACCGTTCG TATTTATTAC GAAAGCCGTT TAGCAAAAGT AAGTCTTAGC GAAGAAGGCA AAAAATTAGT TGCCGAACTT GATGATGAAT TGGAAGAGGA AGAAGATGTA AGGGCGTATA GCAATACGCC CCAACAAAAA GCAAAAGCTA AATGGACGCA GCTTGAAGCC TTAGTTGGTA GTGAAAACCG AATTAGGAAT ATTGCCAAAG ACATTGTTGC ACACTTTAAC CAACGGCAGG AAGTATGTAA TGGTAAAGGT ATGATTGTTG CTATGAGCCG CAGAATTGCA GCCGATTTGT ATCAGGCAAT TATTAACCTA AAACCTGAAT GGCATTCAGA GGATTTGAAT AAAGGCGTGA TAAAAGTGGT AATGACTTCG GCATCTTCTG ATGGTCCAAA AATTTCAAAA CACCACACAA CTAAAGAGCA AAGAAGAACC TTAGCCGAAA GAATGAAAAA TCCTGATGAC GCATTACAAT TGGTAATTGT GCGGGATATG TGGCTTACTG GTTTTGACGC ACCAAGTATG CACACCCTTT ATATTGATAA ACCAATGAAA GGGCATAATT TGATGCAAGC AATTGCCCGT GTTAATCGAG TTTATAACGA TAAACCCGGT GGTTTAATTG TTGACTATTT AGGCATTGCT TCTGATTTAA AAAAAGCACT TGCTTTTTAT TCTGATGCAG GCGGAAAAGG CGACCCAACC ATATTGCAAG AACAAGCCGT TCAATTGATG TTGGAGAAAT TAGAAGTAGT TTCTCAAATG TATTACGGCT TTGCATATGA AACCTATTTT GAAGCCGACA CTTCAAAGAA ATTATCGCTA ATACTTGCAG CCGAAGAACA TATTTTAGGT TTAGAAGACG GAAAGAAACG TTACATCAAC GAAGTAACAG CACTTTCAAA AGCATTTGCC ATTGCTATAC CGCATGACCA AGCAATGGAT GTAAAAGATG AGGTTTCGTT TTTCCAAACG GTAAAAGCAA GGTTAGCAAA GTTTGACGGA ACCGGGTCAG GCAAAACAGA CGAAGAAATT GAAACAACCA TTCGACAAGT TATTGACAAA GCACTCATTT CGGAACAAGT GATTGATGTG TTTGACGCAG CAGGAATAAA GAAACCCGAT ATTTCTATTC TTTCAGAAGA TTTTTTAATG GAACTGAAAG GAATGGAACA TAAAAATGTT GCCTTAGAAG TTTTGAAAAA ACTCTTGAAT GATGAAATAA AATCGAGAGC AAAAAAGAAC CTCGTAAAAA GTAGAACATT TTTAGATATG TTGGAAAACT CCATTAAAAA ATATCATAAC AAAATTCTTA CGGCTGCCGA AGTTATTGAT GAACTCATAA AACTTGGGAA AGAAATAGTT GAAACGGATG ATGAAGCAAA ACGTATGGGT TTAACTGATT TTGAATATGC TTTTTATACC GCAGTTGCCA ATAATGATAG CGCAAAAGAA CTCATGCAAC AAGATAAATT GAGAGAACTT GCGATTGTAC TAACCGAAAC CATACGCCAA AACACATCTA TTGACTGGAC AATTAAAGAA AGCGTAAAGG CTAAATTGAA AGTAGCGGTA AAAAGAGTGC TCAGAAAATA TGGCTATCCA CCCGACATGC AATTGTTAGC AACAGAAACC GTATTAAAAC AAGCTGAAAT GATTGCTAAT GAAATAACAA AATAA
|
Protein sequence | MIHLTEHSIE TFAIELLYKL GYEYIYAPDI APDTSAGSVS EIRESFAQVL LLNRLQNAVK RINHSIPADA QAEAIKEIQR IASPELLTNN ETFHRLLTEG IPVSKRVDGD DRGDRVWLID FKNPHNNEFV VANQFTIIEN GNNKRPDVIL FVNGIPLVVI ELKNATYENT TMHSAFKQID TYKKTIPSLF TYNGFIVISD GLEAKAGTIS SGFSRFMAWK SADGKAEASH LVSQLETLIQ GMLNKETLID LMRHFIVFEK SKKIDAKTGI TTISTVKKLA AYHQYYAVNR AVESTLRASG YQLVKETPLS MVMESPESYG LRGVKKQPIG DKKGGVVWHT QGSGKSLSMV FYTGKIVLAL DNPTILVITD RNDLDDQLFD TFAASKQLIR QEPVQAEDRN QLKELLKVAS GGVVFTTIQK FQPNEGNIYE KLSDRKNIVV IADEAHRTQY GFKAKTIDAK DEKGTIIGKK IVYGFAKYMR DALPNATYLG FTGTPIENTD VNTPAVFGNY VDIYDIAQAV EDGATVRIYY ESRLAKVSLS EEGKKLVAEL DDELEEEEDV RAYSNTPQQK AKAKWTQLEA LVGSENRIRN IAKDIVAHFN QRQEVCNGKG MIVAMSRRIA ADLYQAIINL KPEWHSEDLN KGVIKVVMTS ASSDGPKISK HHTTKEQRRT LAERMKNPDD ALQLVIVRDM WLTGFDAPSM HTLYIDKPMK GHNLMQAIAR VNRVYNDKPG GLIVDYLGIA SDLKKALAFY SDAGGKGDPT ILQEQAVQLM LEKLEVVSQM YYGFAYETYF EADTSKKLSL ILAAEEHILG LEDGKKRYIN EVTALSKAFA IAIPHDQAMD VKDEVSFFQT VKARLAKFDG TGSGKTDEEI ETTIRQVIDK ALISEQVIDV FDAAGIKKPD ISILSEDFLM ELKGMEHKNV ALEVLKKLLN DEIKSRAKKN LVKSRTFLDM LENSIKKYHN KILTAAEVID ELIKLGKEIV ETDDEAKRMG LTDFEYAFYT AVANNDSAKE LMQQDKLREL AIVLTETIRQ NTSIDWTIKE SVKAKLKVAV KRVLRKYGYP PDMQLLATET VLKQAEMIAN EITK
|
| |