Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0334 |
Symbol | |
ID | 6373994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 338924 |
End bp | 340216 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642682853 |
Product | restriction modification system DNA specificity domain |
Protein accession | YP_001958784 |
Protein GI | 189499314 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.284396 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAA AGGGGGGCGG ACTTTCCAGT CCGCGATTAA AAAACTCAAG CGGGTTGGAA AACCCGCCCT ACGTTGAAGG TGATTCTGGA GGAAGTCTTG ATATGAGGGA TTCTAATCTA CTCATTGAAT CACTACCAGA TAGATGGAAA AACCACAAGT TTGGTGATTT ATGCGATCGA GTGAAAAACT CATATCAACC AGTCGATGGG GGTGAAAAAC CATATATCGG CCTTGAGCAT CTTGCCCAAG GATTCCCGGC GTTCATTGGT CGGGGAAAAG AATGTGAGGT AAAGTCCTCG AAAACAGTAT TTAAATCTGG TGACATTCTC TTTGGGAAAC TTCGTCCATA TCTGCGTAAG GGAGCTCAGG CGGACTTTGA TGGCATTTGT TCAACAGATA TTCTCGTGTT TCGAGCGAAA CCCATTTGTG AATCGAATTT TCTGAGATTC GTTATCCACA GTGAGGAATT TGTGGCTCAT GCAAAAACAA CGACAAGCGG AGTACGGCAT CCCAGGACAT CATGGCCATT GTTGCGTGAG TTTTACATAT CGCTCCCCCC ACTGCCCGAG CAGAAGAAAA TCGCACACAT CCTTTCGACA GTGCAGCGGG CAATCGAAGC ACAGGATCGG ATCATCCAGA CAACCACCGA GCTGAAAAAA GCCCTCATGC ACAAGCTTTT CACAGAAGGG CTGCGCAACG AACCGCAGAA AGAAGCCGAG ATTGGGCTGG TTCCGGAGAG TTGGGAGGTG GTGGAGATTG GCGACGTTTT CAAGTTCACC AGTGGTAAGA CAAAACCAAA GGACACTGCA CCTGAGCCAT CCGTTGAGCG GACAGTTCCC GTGTATGGAG GAAATGGAGT GCTGGGCTAT TCAGCGCAGA GCCTTCTCAA TGAGGATGTG TTAATCCTTG GCAGGGTTGG CGAGTATTGT GGATGTGCTC ACCTCACAAA GCCTGTCTCG TGGGTAACTG ATAACGCCCT CTACGCGAAG GAGGAGAAAC GGTCCGTGAA TCGTAGTTAC GCGCGGACTC ATTTCGCGCA CCTCAACCTC AACCAATACA GCAACAAGAT GGGGCAGCCG CTGATTACCC AAGGAATCAT CAATCGGGTG AAATTCGGAC TCCCGTCTCG CGAAGAACAA GACGAACTCG CGAACGCTTT TGAAACGCTC GACACCCGTA TCGAGCAGAT CAATGCAAAA AAGAAGTCTC TCCAAGACCT CTTCCACACA CTACTTCACG AACTGATGAC CGCGAAGATT AATGTGGGCC ATATATCCGA AAAAATTGCA TGA
|
Protein sequence | MNTKGGGLSS PRLKNSSGLE NPPYVEGDSG GSLDMRDSNL LIESLPDRWK NHKFGDLCDR VKNSYQPVDG GEKPYIGLEH LAQGFPAFIG RGKECEVKSS KTVFKSGDIL FGKLRPYLRK GAQADFDGIC STDILVFRAK PICESNFLRF VIHSEEFVAH AKTTTSGVRH PRTSWPLLRE FYISLPPLPE QKKIAHILST VQRAIEAQDR IIQTTTELKK ALMHKLFTEG LRNEPQKEAE IGLVPESWEV VEIGDVFKFT SGKTKPKDTA PEPSVERTVP VYGGNGVLGY SAQSLLNEDV LILGRVGEYC GCAHLTKPVS WVTDNALYAK EEKRSVNRSY ARTHFAHLNL NQYSNKMGQP LITQGIINRV KFGLPSREEQ DELANAFETL DTRIEQINAK KKSLQDLFHT LLHELMTAKI NVGHISEKIA
|
| |