Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0750 |
Symbol | |
ID | 6374415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 797634 |
End bp | 798944 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642683258 |
Product | putative type I restriction-modification system |
Protein accession | YP_001959184 |
Protein GI | 189499714 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.769779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.272821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCC CGCGCTATCC GAAATACAAG GCCAGCGGCG TGGAGTGGCT GGGGGAGGTG CCGGAGCATT GGCAGATGAT CAATAGCCGC CGGTTATTCC ACCAAGCAAA GGAATCACCC TTAACCGACG ACATCCAACT CTCAGCGACC CAAAAATACG GGGTTGTCCC TCAGAGCCTT TTTATGGAAA GCGACGGTAA AGTCGCTCTA GCGTTAAGCG GACTAGGGAA TTTTAAACAC GTCGAAGTAG ATGATTTTGT GATCAGTCTC CGCAGTTTTC AAGGAGGTAT CGAGCGGAGC AAATATTCAG GTTGCGTCAG CCCTGCTTAC ACCGTGCTGC GCCCGGCGGA ACCTATTGAT GGAAGCTATT GGGGCTTCCT ACTGAAATCA CGCCGATACG TCGAGATCCT CCAAACGATG AATGATGGGT TACGGGATGG CAAAAGTATC AGTTACCAGC AGTTCGGACA AATCCCCTTG CCTTCCCCGC CCCTCGCCGA GCAAACGGCC ATTGCGGAGT TTCTGGACCG GGAGACGGGG AAGATTGATG AGCTGGTGGC GGAGCAGCGG CGGCTGATGG AACTGCTGAA AGAAAAACGT CAGGCTGTCA TCTCCCACGC CGTCACCAAA GGCCTCAACC CCCACGCCCC CATGAAGCCT TCCGGCATCG AATGGCTCGG CGATGTGCCG GTGGGGTGGA GCGTGCTCAA ACTCGGAAAC ATTTCTCGTT TTAAAGGGGG GGCAGGGTTT CCCGATAGCT ACCAAGGTCA AACAGACAAC GAGATTCCGT TCTTTAAAGT CGGAGATATG GTGAACGCTG ACGACGCTCG CGTAATGCGG AGAGCTAATC ACACAATCAC TGAAGCTACA GCGAGAGAGC TGCGTGCTTT TGTCTTTCCT GAAAGCACCA TCGTGTTTGC CAAGGTTGGC GCTGCCTTAC TCCTGAAGCG ATACCGGTTA CTTGGGCAAA GGTCTTGCAT CGACAACAAC ATGATGGGAA TGACCGTGGG GGACGGTAGT TCGGTCGATT ATCTGCTCTA TGTTCTCCCG CTACTTGATC TCGAATTAAT TGTTAACCCC GGGGCCGTGC CATCAATCAA TGAAGGTCAG ATTTCTGGCC AACGGATCGC ACTTCCTCCG ATTGATGAGC AGCGAGAAAT TGTTGAATTT CTCACCTCAG TAACCGCCAA ATTCGACACC CTCACCGCCG AAGCGCAACG CACCATCGAC CTGCTGCAAG AACGCCGCAC CGCGCTCATC TCCGCCGCCG TCACCGGGCA GATTGATGTG CGCCAACCAC CCCGGAACTA A
|
Protein sequence | MSFPRYPKYK ASGVEWLGEV PEHWQMINSR RLFHQAKESP LTDDIQLSAT QKYGVVPQSL FMESDGKVAL ALSGLGNFKH VEVDDFVISL RSFQGGIERS KYSGCVSPAY TVLRPAEPID GSYWGFLLKS RRYVEILQTM NDGLRDGKSI SYQQFGQIPL PSPPLAEQTA IAEFLDRETG KIDELVAEQR RLMELLKEKR QAVISHAVTK GLNPHAPMKP SGIEWLGDVP VGWSVLKLGN ISRFKGGAGF PDSYQGQTDN EIPFFKVGDM VNADDARVMR RANHTITEAT ARELRAFVFP ESTIVFAKVG AALLLKRYRL LGQRSCIDNN MMGMTVGDGS SVDYLLYVLP LLDLELIVNP GAVPSINEGQ ISGQRIALPP IDEQREIVEF LTSVTAKFDT LTAEAQRTID LLQERRTALI SAAVTGQIDV RQPPRN
|
| |