Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1674 |
Symbol | |
ID | 6375360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1810462 |
End bp | 1811499 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642684168 |
Product | restriction endonuclease |
Protein accession | YP_001960074 |
Protein GI | 189500604 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.890961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000510843 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGTCC AGAGCGCAAC TATCCAGGCA CTAAAAGAGG CGGGCAAGCC GCTGCATGCC AATGAAATCG CCAAGCTGAT CATTGAAGCT GGGCTCTGGA AGTCTGATGG CAAGACACCG GAGGCCACCG TTAGCGCCAG CCTCTATTCG GACATAAAGA AGAATGGCGA CAAGTCACTC TTTGTAAAAG TTGGCCCTCA AACCTTCGCA CTCCGGGATG TTCAGCTTGT TGCAGTTTGC GATATGAAGG AACCTGAACC AAAAATCGAG AGTGGCCCGG TCAGCTCCCA AGAGACGTAT TCATTTCTGG ATGCCGCAGA AAAGGTGCTG GATCAGTTTG GAAACCGGAA CTCCATGCAT TACCGGGACA TTACTGACAA GGCCCTGAAT CAGGGTTGGT TGAACACCAC TGGCAAGACA CCCGAAGCAA CAATGAATGC TCAACTGGTG ACAGAGCTCA AACGGGCCAA AGCCAGCGGG GAGCCCGGCC GATTTGTCCG CACATCACCC GGGTATTACA GCCTTGTTGA ATGGATGGGG ACTGGCTTGC CGTATCAAAT CTCAAAGCAC AATCGAGAGA TCCGGAAAAA GCTGCTGTCT CAGCTAATGG ATTTGAGCCC GGCTCAATTT GAGGAGCTCG TCGGACAACT GCTGGCCGAG ATGGGCTTTG AAAGCATCGA GGTAACCAAG TACGGAGGCG ACGGTGGCGT TGATGTCCGG GGCACGCTGC TGATCAGTGA TGTAGTCCGC ATTAAAATGG CTGTTCAAGC AAAACGCTGG AAAGGCAATA TTCAGAGTCC GACCGTTCAA CAGGTGCGTG GCAGCCTTGG TGCGCATGAG CAGGGCCTAA TCATCACCAC CAGCGACTTC AGCACCGGAG CCATCAAGGA GGCCAATCAG CCGGACAAAA CCCCCGTGGG CCTCATGAAT GGCGAGCAGC TGGTCACGCT TCTGATGGAA TACAACATAG GTGTCCGCCG CATGTCACAC GACCTTTTCG AGCTGGAGGA ACTGCCGATT GAAAAGGGAG ATGTATAG
|
Protein sequence | MSVQSATIQA LKEAGKPLHA NEIAKLIIEA GLWKSDGKTP EATVSASLYS DIKKNGDKSL FVKVGPQTFA LRDVQLVAVC DMKEPEPKIE SGPVSSQETY SFLDAAEKVL DQFGNRNSMH YRDITDKALN QGWLNTTGKT PEATMNAQLV TELKRAKASG EPGRFVRTSP GYYSLVEWMG TGLPYQISKH NREIRKKLLS QLMDLSPAQF EELVGQLLAE MGFESIEVTK YGGDGGVDVR GTLLISDVVR IKMAVQAKRW KGNIQSPTVQ QVRGSLGAHE QGLIITTSDF STGAIKEANQ PDKTPVGLMN GEQLVTLLME YNIGVRRMSH DLFELEELPI EKGDV
|
| |