Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1710 |
Symbol | |
ID | 4571070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1937703 |
End bp | 1938947 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766293 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_912152 |
Protein GI | 119357508 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTCG AAAATTTGCA ATGCGCTGAG GCGCGACTTG GTCAGAACAA TTGCTTGCAA CTTGAGGAGG ATGCGGGATG TGGGATGATG TATGAACTTC GCACAATCCA TATTGGTGAC TTGGGTCGTG TCTTGACTGG AAAGACGCCC CCAAGTGTCC GGCCTGAACT ATTTGGAGAT GATCACCCGT TTCTTACACC AACAGACATT GACGGTGCTT CGCGTTACAT TGAGCCCGAA CGTTTTCTTT CGCCAGAAGG GCGCAACTAC CAGCAAAGAC TTATGCTACC TGGGCGATCC GTTTGCGTTG TCTGTATTGG TGCTACCATT GGCAAAGTTT GCATGACTGG CAGACCGTCT TTCACAAATC AACAAATTAA TTCCGTTGTC GTGAATGAGC AAGAGCACGA TCCGTTCTTT GTTTATCACC TGATGACAAC GCTTCGCGAC GAGTTAAAAG CTAATGCTGG TGGGTCGGCG ACTCCCATTA TCAATAAGAC GGCGTTTTCA GAAATCAAAG TACGTGTTCC CCCGCTTCCA GTTCAACGGC GGATTGCGGG CATACTGTCA ACCTACGACG AACTGATTGA GAACAGTCAG CGGCGCATCA AGATTCTGGA GGAGATGGCC CGATCAGTCT ATCGTGAATG GTTCGTTCAC TTCCGCTTTC CCGGCCACGA AAATGTTTCG CTCGTTTCGT CTTCTCTTGG TGCTATTCCG CAGGGGTGGG AGGCTGGTCG TTTAGACGAT GTGCTTGTTC TTCAACGTGG CTTCGATTTG CCTAAAGCCA AGCGGATGGA GGGTACTGTG CCCATTTACG CAGCTACCGG AGTTACTGGA TTTCACTGCG AAGCTAAGGT CAAAGCACCT TGTGTTGTGA CCGGAAGATC AGGCACAATT GGAGATGTCA TCTATGTACA GGAAGATTTT TGGCCACTGA ATACCTCACT TTGGGCGAAG GGTTTTCCAA AGTCGGAACC GCTTTATGCA TACTACGTGC TCTCTTCAGT TGGCTTGAAG CAGTTCAATT CCGGGGCGGC TGTTCCGACG CTTAATCGAA ATGACCTTCA TGGTCTTGAC GTGCTGATTC CTCCATGCGT ATTGCAAAAA CGATTTCAAA AAATTGCCGG TGCAATGTTA TTACAAACCC GCAATCTTGA ACTGCAAATT CAAAACCTTC GTCGGACGCG CGATCTACTG TTGCCGCGTC TGCTATCGGG GCAGGTCAAT CCCAAGGAGA ATTGA
|
Protein sequence | MKVENLQCAE ARLGQNNCLQ LEEDAGCGMM YELRTIHIGD LGRVLTGKTP PSVRPELFGD DHPFLTPTDI DGASRYIEPE RFLSPEGRNY QQRLMLPGRS VCVVCIGATI GKVCMTGRPS FTNQQINSVV VNEQEHDPFF VYHLMTTLRD ELKANAGGSA TPIINKTAFS EIKVRVPPLP VQRRIAGILS TYDELIENSQ RRIKILEEMA RSVYREWFVH FRFPGHENVS LVSSSLGAIP QGWEAGRLDD VLVLQRGFDL PKAKRMEGTV PIYAATGVTG FHCEAKVKAP CVVTGRSGTI GDVIYVQEDF WPLNTSLWAK GFPKSEPLYA YYVLSSVGLK QFNSGAAVPT LNRNDLHGLD VLIPPCVLQK RFQKIAGAML LQTRNLELQI QNLRRTRDLL LPRLLSGQVN PKEN
|
| |