Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2548 |
Symbol | |
ID | 6376246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2720415 |
End bp | 2721632 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642685026 |
Product | restriction modification system DNA specificity domain |
Protein accession | YP_001960923 |
Protein GI | 189501453 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACA GGTTCAAGGC TGTTTTGCTG GGCGATATCG TTATTCCTAA AGGAATACAA ACAGGGCCTT TTGGGAGTCA ATTGAAAGCT GAAGAATATA CGGAAGATGG CGTTCCAGTA GTAATGCCAA AAGATATTTG CAGCGGGTAT TTGACAAGTT CATTTATTTC AAGAGTCTCA CAATCCAAGG CAAATAAACT GAAGAAGCAT CAGATTAAGG AAGGGGATAT AATATTTCCC AGACGGGGAG ACCTGCGTCG AATCGGTGTG GCAAGAAAAG ACAATACTGG CTGGATTTGT GGGACGGGTT GCCTTAGGGC GCGATTGAAT AGTGTAGTTC ATTCTGACTT TTTGCACCAA TATGTGCTCT TAGACTCAGT TGGCAAATGG CTGGAAAGAA ATGCTCTTGG CCAGACCATG TTGAATCTTA GTACAGACAT TATTTCTAAC CTGCCTTTAA CTCTGCCCTT GCTTTCTGAG CAAAAAGCTA TTGCCGATTT GCTCTCTGCC TGGGATGAGG CTATCGAGAA AGCCGAACGA CTGATTCAGG AGAAGGAGAG GCGGTTTAGG TGGTTGCTTC GTGAGTTGAT CTCAGAACCA CGGAATACAC GGAAAGATGC GGAATGGAAA AAGGTGAGAA TGGGATCATT TTTGACTGAG AGTCGGATTC CCGATCGTGA AAATGATCCC AAGAAGCGAA TAAGTGTGAG GCTGCATTTG AGGGGCGTTG AGGTTCGAGA ATATCGAGGA ACTGAGTCTA ATGGAGCAAC TGCATATTTT ATCCGTAAAG CAGGTCAGTT TATCTACGGG AAACAAAATG TCTTTAGAGG GGCTGTTGGC ATAGTACCCC TTGAATTAGA TGGCTATAGC TCAACTCAGG ACATACCGGC ATTCGACATA GCTGATCATG TTGATAAGAG CTGGCTTCTA TTTCTCTTTT CATATACAAA CTTTTACAAA AAATTAGAGC TATATGCGAG CGGGTCGGGA TCAAAGCGAC TTCATCCCAA AGAGCTCTTC AGAATGAAAA TCACCTTACC AACATTCGGC GAACAGCAAC AAATTGCTGA GACATTATCC TCAGCCCAGT ACGAAATCGA CCTGCTGAAG CAGCTCGCAG AAAAATACAA AACCCAGAAA CGCGGCCTGA TGCAGAAGAT GCTCGCTGGC ACATGGCGGG TAAAACCGGA AATCGTTCAT CAATACATGG AGGCGTAA
|
Protein sequence | MSDRFKAVLL GDIVIPKGIQ TGPFGSQLKA EEYTEDGVPV VMPKDICSGY LTSSFISRVS QSKANKLKKH QIKEGDIIFP RRGDLRRIGV ARKDNTGWIC GTGCLRARLN SVVHSDFLHQ YVLLDSVGKW LERNALGQTM LNLSTDIISN LPLTLPLLSE QKAIADLLSA WDEAIEKAER LIQEKERRFR WLLRELISEP RNTRKDAEWK KVRMGSFLTE SRIPDRENDP KKRISVRLHL RGVEVREYRG TESNGATAYF IRKAGQFIYG KQNVFRGAVG IVPLELDGYS STQDIPAFDI ADHVDKSWLL FLFSYTNFYK KLELYASGSG SKRLHPKELF RMKITLPTFG EQQQIAETLS SAQYEIDLLK QLAEKYKTQK RGLMQKMLAG TWRVKPEIVH QYMEA
|
| |