Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1526 |
Symbol | |
ID | 3746585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2004998 |
End bp | 2006158 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637774066 |
Product | restriction endonuclease S subunits-like |
Protein accession | YP_379824 |
Protein GI | 78189486 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG AAGCACTTGG TAAACTCGTT GACATCAAGA CAGGAAAATT AGATGTAAAT GCAGGAACAG AATACGGTAA ATATCCCTTT TTTACTTGTG CCAAAACAGT TTACAGAATT AATCAATACG CATTTGATAA TGAAGCTATA CTTGTTGCTG GAAATGGCGA CTTGAACGTT AAGTACTTTA AAGGAAAATT CAATGCCTAT CAAAGAACCT ATGTAATTGA GAATAAAGAA GTAAATTTAT TATCCATGAA ATACTTGTAC TATTTTATGG AAACATATAT GATTCATCTA AGAAATGGAG CTATTGGAGG AATCATTAAA TACATTAAAA TTGATCACTT AACTAAAGCA GAAATCCCTC TCCCCCCACT TGACGACCAA AAACGCATTG CCCACCTACT CGGCAAAGTA GAGCGGCTAA TTGCCCAACG CAAACAACAT CTGCAACAGC TTGACCAACT GCTCAAAAGC GTTTTTCTGG AGATGTTCGG CTTCTTTGAT AAAACATATA CCAACTGGAC TATCGATACA TTAACATCGC ACACAGAGAT CGTATCGGGT ATTACAAAAG GAAAAAAATA CAAAACCGAT GAATTAATTG AAGTTCCGTA TATGCGTGTT GCAAATGTAC AAGACGAACA CTTCGTATTA GACGAAATCA AAACGATCTC TGTAACCAAA AACGAGATCA AGCAGTATCG GCTTCTTGCT GGCGATCTAT TATTAACAGA AGGTGGCGAT CCCGATAAGC TTGGGCGAGG CGCTGTTTGG CAAAACCAGA TTGAAAACTG TATTCATCAG AACCACATTT TTCGTGTTCG AGTAAACGAT AAATCCAGAA TTAACCCTGA CTATCTTAGC GCATTAATAG GATCTCCATA CGGAAAATCT TACTTCTTTC GTTCTGCAAA GCAGACAACT GGGATTGCCT CTATAAACTC AACTCAGTTG AAAAAATTTC CTATTGTAAT TCCCCCCATC GAACTCCAAA ACCGCTTCGC CACCATCGTT GAAAAAGTTG AAAGCATCAA AACGCACTAC CAACAAAGCC TCAACAACCT CGAAACACTT TACAACGCAC TAAGCCAAAA AGCCTTCAAA GGCGAGCTGG ATTTATCGCG CGTGGCGGTG CTGGTGGACG TTACACCTTA A
|
Protein sequence | MKKEALGKLV DIKTGKLDVN AGTEYGKYPF FTCAKTVYRI NQYAFDNEAI LVAGNGDLNV KYFKGKFNAY QRTYVIENKE VNLLSMKYLY YFMETYMIHL RNGAIGGIIK YIKIDHLTKA EIPLPPLDDQ KRIAHLLGKV ERLIAQRKQH LQQLDQLLKS VFLEMFGFFD KTYTNWTIDT LTSHTEIVSG ITKGKKYKTD ELIEVPYMRV ANVQDEHFVL DEIKTISVTK NEIKQYRLLA GDLLLTEGGD PDKLGRGAVW QNQIENCIHQ NHIFRVRVND KSRINPDYLS ALIGSPYGKS YFFRSAKQTT GIASINSTQL KKFPIVIPPI ELQNRFATIV EKVESIKTHY QQSLNNLETL YNALSQKAFK GELDLSRVAV LVDVTP
|
| |