Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0337 |
Symbol | |
ID | 6373997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 343420 |
End bp | 347457 |
Gene Length | 4038 bp |
Protein Length | 1345 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642682856 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_001958787 |
Protein GI | 189499317 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.038664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.920027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACAC CCTCCGAACA TAAAACCGTC CAATCCCGTA TCCTCGCTTA TGCCGAGGAA ATCGGCTGGG CCGTTGTGTC TCGTGAGGAG GCCGAAAAAA GGAGGGCGGG TTTTCCAACC CGCCATTACA TTCAAAGTGG CAAGCAAGAA AATGGCGGAC AGGAATGTCC GCCCTCCGGT AAAGAATCTT TATCTCTATT CTTCGACGAT CTTCTTGACG CGAAGGTGCG GGAGTTCAAT CCTTGCTACG CGGAGGCTGA GGGCGCGTTG CTTGGGCAGT TCCGCCATCT TCATGCGGAC ATCTATGGCA ACCGGGAATT TGTGGAGCAA TTGCGCAACC GGGGCAAGTT TTTCGACCAT GAGGAGAAGC GCGAGCGCGA CCTGATTCTG ATCGATTACG ACGACCCGGC GAGGAACGTG TACGAAGTCA CCGAAGAGTG GGCCTTTCAC AACGGGCACT ACGGCACGCG GGAAGATGTC GTCTTTCTCA TCAACGGTAT CCCGGTGCTG GTGATCGAGT GCAAGAACGC CAGCAAGGAC GAGGCGATTG CCCTCGGGAT AGACCAGATT CGTCGGTACC ACCGTGAAAC CCCCGAGCTG TTCGTGCCGC AGCAGCTTTT TACCGCCACC GACGCCATCG GCTTTTCGTA CGGAGCCACT TGGAACACCG TGCGCCGGAA CATATTCGAG TGGAAGATAC TGGGGGGCGG ACATTCCTGT CCGCCATTAC ATTCAGAATT GGAAAGCGGA CATTCCTGTC CGCCATTACA TTCAGAATTG GAAAGCGGAC TTTCCAGTCC GCCATTACAT TCCGACACGG CAATCGGACT TTCCAACCCG CCATTACATT TAAAAGACGC TGAAGAGAAT GGCGGACAGG AATGTCCGCC CTCCTTTGAC GTTGAAGAGA ATGGCGGGTT GGAAAACCCG CCCTCCTTTC TTGACCCGGA GAGAGAGATT GGCATGACAC AGCATCGGTT GCCTCACTGG CAGCAGGGTG ATGTGTGGGT ATTCGTGACA TGGCGGCTGG CGGATTCACT GCCACAGTCG AAACTGGAGG AGTGGAAAGA AGAGCGGGAG ATCTGGCTTT CGAATCATCC CGAGCCATGG GATGAGAAAA CCGAAGAGGA GTATCATGAA CGCTTTTCGC GTCAGATCGA CGAATGGCTC GATCAGGGCA GTGGTTCGTG TCTGTTGAGG GAACCTGCTT ATGCTCAGAT CGTGGCGAAT GCGTTGCGGC ATTTTGACGG AGAACGCTAT CAGCTGGCTT CGTTCGTGGT GATGCCTAAT CATGTGCATG TGTTGTTCTG TCCGTCCGGA ACGCATTCGC TCGCCGGGAT TCTGAAATCC TGGAAGGGCT TCAGCGCGCG TGAGATCAAC AAACGATCGG GAAAGACGGG ATCGTTCTGG CAGGAAGAGT ATTGGGACCG TTTGATTCGA AGTGAGAAGC ACTTTTTCAG AGTAGCAAAG TACATTCGTG AGAATCCAAT AAAGGGGGGC GGACTTTCCA GTCCGCCATT ACATTCTGAT TTTGAAAGAG GATTAGAAAA CCCGCCATTA CATTCTGATG TGAATGTAGG ACTTTCCAAC CCGCCATTTT ATTACGAATG CGAATTGCTG TTCGGAAAAA ATGGCGGACA GGAATGTCCG CCCTCCGTTC CCGGTCGTCT CGAATCGAAG GTAAAAACCT TCTGCGCCAT CCCGCAGGTG CTCGCCTTCC TGAAGGAGTA CATTGTCTTT GCCGAGAAGG ACGAGGAGCT GAACAAGTAC ATTCTGCGCC AGCACCAGAC CGGTGCGGTG GATGCATCCG TCAATCGGGC ACTTGATCCT GTCCGTTCCC GTGGTCTCGT CTGGCATACG CAGGGGAGCG GCAAGACTTT CACGATGATC AAGGCGGCTG AACGGCTCTT CCGTGCGCCA GAGGCGGAAA AGCCGACCAT CCTTTTGATG ATCGACCGGA ACGAGCTGGA AGACCAGATG CTCAAAAACC TCGCCGCTCT CGGTCTCGGC AACCTGGAGC ACGCGAGCAG CATTGCAAGG CTGAACAAAC TGCTGAAGTA TGACTACCGG GGCATCATCG TCACGATGAT CCACAAATTC CGGGATATGC CGGGGAATAT CAATACCCGG TCAAACATCT ACGTCCTGAT CGACGAAGCC CATCGCACCA CCGGCGGCGA TCTCGGCAAC TACCTCATGG CCGGTTTGCC CAACGCCACC TTTATCGGTT TCACCGGTAC GCCGGTGGAC AAGACCGTCT ACGGCAGGGG CACCTTCAAG ACCTTCGGCT GCGAGGATGA CAAGGGGTAT CTGCACAAGT ATTCCATCGC TGACAGCATC GAAGACGGCA CCACGCTGCC GCTCTACTAC CAGCTTGCCC CCAATGACAT GCTCGTCCCG CACGAGACAC TGGACGCGGA GTTTCTCTCC CTCGCCGAAG CCGAGGGGGT CGCCGACATC GAAGAGCTGA ACAAAATTCT CGACCGGGCC GTGAACCTGA AGAACTTTCT CAAAGGCAGG GAGCGGATCG AGAAAGTCGC GCAGTTTGTT GCCGGGCATT ACCTCGCTAA CGTCGAACCG CTCGGCTACA AGGCTTTCCT TGTCGGGGTA GACCGGGAGG CCTGCGCCCA CTACAAGCAG GCCCTCGACC AGTTCCTTCC GTCCGATTAT TCCCGGGTGG TCTACACCGG CAACAACAAC GACTCCGTTC TGCTCAAGAA ATTTCATCTC GACCAGAAGC AGGAACGTCA GATCCGCAAA AGCTTCGGCA AGATCGACCA GCAGCCAAAA ATCCTCATCG TCACCGAGAA ACTCCTTACC GGATTCGATG CGCCCCTGCT CTACGCCATG TATCTCGACA AGCCGATGCG CGACCATACC CTGTTGCAGG CCATCGCAAG GGTAAACCGT CCCTACGAGA ACGAAGCGCA GGAGATGGTG AAGCCCCACG GTTTCGTCCT TGATTTTGTC GGTATCTTCG ACAAGCTCGA AAAAGCCCTC GCCTTCGACA GCAAGGAGAT TAACGCCATC GTCAAGGATA TCAAACTTCT GAAGATGCTT TTTCAGAACA AGATGGAGGC TATAGGGGGG CGGACATTCC TGTCCGCCAT TGGAGTACAA ACGGCTGATA TACGAAACGG GCATTGGATT TCGAATGAGG ATTCTGAATG TAATGGCGGA CTGGAAAGTC ATGAGGATAA ATGTAATGGC GGACTGGAAA GTCCGCCCCC CTTTACGTTC AACGACAGGG ATGTCGATAA TCTGATAGAG CATTTCCGAG ATCCGGAGCG GCGGAAAGCG TTTTTCAAGG AGTACAAAGA GATCGAGATG CTCTACGAGA TCATCTCTCC GGACGCCTTT CTCCGACCGT TTATCGAGAG GTATGCGACG CTTTCCGCCG TGTACGATGT GGTCCGCAAG GCCTATGCGA AGCGTATACA GGTTGATCGT GATTTCCAGC GGAAGACCAA CCTGTTAGTT CAGGAAAAGG TCGGCAGTTA CGGAGTTGGA GAGTTGCAGG GCGTGGTGAA AATAGATTCG AACGCCATCG ACATCATTAA CGCTCAGGCC GGAGGTGCTC CAACGAGGGT GATCAATCTT ATCAAGAGCA TTGAAAAGAT CGCAGATGAT CAAAGTGACG ATCTGTTCCT GATCGCCATG GCAGAACGGG CGCAGGCTGT GCAGGAGAGT TTCGAGAGCC GTCAGGTGAC GACGGCCGAA GCCCTGGAAC AGCTGATGCA AGCGGTCGAA GTGAACGAGG AGCGAAAAAA AGAGCAGGCC GCAAAGGGGT TCGATGGACT GACCTTCTTT GTCTACCGGA CCTTGCTTGA TGAGAAGATC GAACATGCCG AAGAGGTCAG CAGGCAGATC AAGACGGCGT TTGTGGAATT TCCCAACTGG CAGAAGAGCG AAGCTGCGCT GCGTGAGCTT CGCAAAAAGA TTACCTTTGC GATCTTTGCT CAATCAGACG ATCTCGAGCG AGTCACGGGT ATCGTGGACT ACCTGTTCCG CCTGCTGGAA AGGGCTAACC GGATCTGA
|
Protein sequence | MPTPSEHKTV QSRILAYAEE IGWAVVSREE AEKRRAGFPT RHYIQSGKQE NGGQECPPSG KESLSLFFDD LLDAKVREFN PCYAEAEGAL LGQFRHLHAD IYGNREFVEQ LRNRGKFFDH EEKRERDLIL IDYDDPARNV YEVTEEWAFH NGHYGTREDV VFLINGIPVL VIECKNASKD EAIALGIDQI RRYHRETPEL FVPQQLFTAT DAIGFSYGAT WNTVRRNIFE WKILGGGHSC PPLHSELESG HSCPPLHSEL ESGLSSPPLH SDTAIGLSNP PLHLKDAEEN GGQECPPSFD VEENGGLENP PSFLDPEREI GMTQHRLPHW QQGDVWVFVT WRLADSLPQS KLEEWKEERE IWLSNHPEPW DEKTEEEYHE RFSRQIDEWL DQGSGSCLLR EPAYAQIVAN ALRHFDGERY QLASFVVMPN HVHVLFCPSG THSLAGILKS WKGFSAREIN KRSGKTGSFW QEEYWDRLIR SEKHFFRVAK YIRENPIKGG GLSSPPLHSD FERGLENPPL HSDVNVGLSN PPFYYECELL FGKNGGQECP PSVPGRLESK VKTFCAIPQV LAFLKEYIVF AEKDEELNKY ILRQHQTGAV DASVNRALDP VRSRGLVWHT QGSGKTFTMI KAAERLFRAP EAEKPTILLM IDRNELEDQM LKNLAALGLG NLEHASSIAR LNKLLKYDYR GIIVTMIHKF RDMPGNINTR SNIYVLIDEA HRTTGGDLGN YLMAGLPNAT FIGFTGTPVD KTVYGRGTFK TFGCEDDKGY LHKYSIADSI EDGTTLPLYY QLAPNDMLVP HETLDAEFLS LAEAEGVADI EELNKILDRA VNLKNFLKGR ERIEKVAQFV AGHYLANVEP LGYKAFLVGV DREACAHYKQ ALDQFLPSDY SRVVYTGNNN DSVLLKKFHL DQKQERQIRK SFGKIDQQPK ILIVTEKLLT GFDAPLLYAM YLDKPMRDHT LLQAIARVNR PYENEAQEMV KPHGFVLDFV GIFDKLEKAL AFDSKEINAI VKDIKLLKML FQNKMEAIGG RTFLSAIGVQ TADIRNGHWI SNEDSECNGG LESHEDKCNG GLESPPPFTF NDRDVDNLIE HFRDPERRKA FFKEYKEIEM LYEIISPDAF LRPFIERYAT LSAVYDVVRK AYAKRIQVDR DFQRKTNLLV QEKVGSYGVG ELQGVVKIDS NAIDIINAQA GGAPTRVINL IKSIEKIADD QSDDLFLIAM AERAQAVQES FESRQVTTAE ALEQLMQAVE VNEERKKEQA AKGFDGLTFF VYRTLLDEKI EHAEEVSRQI KTAFVEFPNW QKSEAALREL RKKITFAIFA QSDDLERVTG IVDYLFRLLE RANRI
|
| |