Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1120 |
Symbol | |
ID | 3748338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1510560 |
End bp | 1512143 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637773651 |
Product | type I restriction-modification system specificity subunit |
Protein accession | YP_379425 |
Protein GI | 78189087 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAC AAAAAGCAAC CCTTCGGCAA ACTCAGGGAA AACAGGAAGA ACCTTTAGAA AAACAGCTTT GGAAAACCGC CGACAAACTC CGCAAGAACA TTGATGCGGC AGAATACAAA CACATTGTGT TAGGGTTAAT CTTCTTGAAA TATATTTCTG ATTCATTTGA AGAACTCTAT GCAAAGCTAC AAGCAGAGGA AGCAAACGGA GCAGACCCCG AAGACAAAGA TGAATACAAA GCTGAAAATG TATTCTTTGT TCCGCAGGAT GCACGATGGA ATTATTTGCA ATCGAAAGCA AAACAACCTG AAATTGGAAA GTTTGTGGAC GATGCAATGG ATGTTATAGA AAAAGAAAAT GCTTCACTAA AAGGAGTTTT ACCAAAAGTA TTTGCCCGAC AAAATCTTGA TCCAACAAGT TTGGGCGAAC TGATTGACTT GGTTGGAAAC ATTGCCTTAG GTGATGCAAA AGCAAGAAGT GCCGATGTGC TTGGGCATGT TTTTGAATAT TTCTTAGGTG AGTTTGCTCT TGCAGAAGGC AAAAAAGGTG GGCAGTTTTA TACGCCAAGA AGCGTTGTAG AATTATTGGT TGAAATGTTG GAGCCATACA AAGGAAGAGT TTTTGACCCT TGCTGTGGTT CGGGTGGAAT GTTTGTTCAC TCCGAAACGT TTGTAACAGA GCACCAAGGG AAAGTAAACG ACATCTCTAT TTACGGGCAG GAAAGCAACC AAACAACGTG GCGCTTATGC AAAATGAACC TTGCGATTCG AGGTATTGAT AGCTCACAAG TGAAATGGAA CAACGAAGGC TCTTTTTTAA ACGATGCACA TAAAGACCTG AAAGCCGATT ACATTATTGC TAATCCACCA TTCAACGTGA GTGATTGGGG TGGTGATTTA ATGCGAAGCG ATGGACGTTG GCAATATGGT ACGCCACCAA CAGGCAATGC CAACTTTGCA TGGATGCAAC ATTTTATTTA CCACTTAGCA CCCAATGGAC AAGCAGGTGT TGTATTAGCA AAAGGTGCTT TAACATCTAA AACTTCAGGT GAAGGCGATA TACGAAAAGC ATTAGTTGAA AACGGTTTGA TTGATTGTAT TGTAAACCTG CCTGCCAAGT TGTTTTTAAA TACACAGATT CCTGCTGCCT TATGGTTTCT TCGTAGAGAT GCAAAATTTT TCGTCTCTAC AAATGGAAAA TTTCGCGACC GAAGCAATGA AATATTATTT ATTGATACCC GAAACTTAGG GCATTTAATA AATCGCAGAA CCCGTGAACT ATCAAAGGAA GACATATATA AAATCGCCAG CACTTACCAC GCATGGAGAA CGCTGCCTGA GGCTCTCAAT GGCAGCGCCT ATGCAGATAT CCTTGGCTTT TGTGCATCCG TTGCCATAAG CAAAGTAGCC GAATTGGATT ATGTGCTTAC GCCAGGACGT TATGTAGGCT TACCCGATGA TGAAGATGAT TTTGATTTTG CGGAACGTTT TACAGCGTTA AAAGCCGAGT TGGAAATGCA ATTGCAAGAA GAAGCTCAAC TGAATGCAGT GATTTCCGCT AACCTTTTAA AGATTAAGTA TTGA
|
Protein sequence | MAKQKATLRQ TQGKQEEPLE KQLWKTADKL RKNIDAAEYK HIVLGLIFLK YISDSFEELY AKLQAEEANG ADPEDKDEYK AENVFFVPQD ARWNYLQSKA KQPEIGKFVD DAMDVIEKEN ASLKGVLPKV FARQNLDPTS LGELIDLVGN IALGDAKARS ADVLGHVFEY FLGEFALAEG KKGGQFYTPR SVVELLVEML EPYKGRVFDP CCGSGGMFVH SETFVTEHQG KVNDISIYGQ ESNQTTWRLC KMNLAIRGID SSQVKWNNEG SFLNDAHKDL KADYIIANPP FNVSDWGGDL MRSDGRWQYG TPPTGNANFA WMQHFIYHLA PNGQAGVVLA KGALTSKTSG EGDIRKALVE NGLIDCIVNL PAKLFLNTQI PAALWFLRRD AKFFVSTNGK FRDRSNEILF IDTRNLGHLI NRRTRELSKE DIYKIASTYH AWRTLPEALN GSAYADILGF CASVAISKVA ELDYVLTPGR YVGLPDDEDD FDFAERFTAL KAELEMQLQE EAQLNAVISA NLLKIKY
|
| |