Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1712 |
Symbol | |
ID | 4571072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1940022 |
End bp | 1942010 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639766295 |
Product | N-6 DNA methylase |
Protein accession | YP_912154 |
Protein GI | 119357510 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.857909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTGGA TCGCACCTTC CGAAAAAGAT ACCGCTACCG CCGCACTCGA AAAGCGCCTG TGGGATGCCG CCGACCAGCT TCGGGCGAAC TCCGGCCTCA AGGCCCAGGA GTATTCCGCA CCCGTTCTCG GGCTTATTTT CCTGCTTTTT GCCGACGTGC GGTTCGCCGC CCGGAGGGCT GAGCTTGAAT CGGCAAAGAG CAGTACTCGC CGGGGGAGCC GGGTGGACGA TCCGGCGGCC TATCATGCCG AAGGCGTGCT CTACCTTTCG CCAGAAGCGC GGTTTGTCTA CCTGCTCAAC CGTCCTGAAG CCGAGAACAT CGGCGTGATG GTCAACGAAG CCATGCGCGC TATCGAGAAG CACAATCCGC AGCTTGCCGG TGTGCTGCCG AAGACCTACT ACCTGTTCGA CAGCCCGCTT CTGAAGCAGT TGCTGAAAAA GGTGTCGGAG ATTCCCTCTT CGATGGATTA CGATGCGTTC GGACGCATCT ACGAGTACTT TCTCGGCGAA TTCGCCATGA GCGAAGGGCA GGGCGGGGGA GAGTTCTATA CGCCCGTCAG CATCGTTCGT CTCCTGACCG AAGTGATCGA GCCATATCAC GGGCGCATTC TCGACCCCGC ATGCGGTTCC GGCGGCATGT TCGTCTCCTC GGCCCGGTTC GTTGCCCAGC ACAAGCAGAA CCCTTCGGCA GAACTCTCCA TTCACGGCAT CGAGAAGACC GACGAGACGG GAAGGCTTTG TCGCCTGAAC CTTGCGGTGC ACGGGCTTGA AGGCCGTATC ATGCATGGCG GCAACGTGAA CAGCTACTAC GACGATCCGC ATGATGCAAC GGGGAATTTT GATTTCGTGC TGGCCAATCC GCCGTTCAAT GTTAACGCCG TTGACAAGGA ACGCCTGAAA GATTCGGTCG GTCCGGGACG ACGCTTTCCT TTCGGTCTTC CGCGAACCGA CAACGCGAAC TATCTCTGGA TACAGCTTTT CTACTCGGCA CTGAACGAAA GGGGGAGAGC CGGTTTCGTC ATGGCGAACT CGGCTTCCGA CGCCCGCTCC TCGGAGCAGG AAATCCGTCG CCAGCTTATC GAAAGCCGTA CGGTGGACGT AATGGTCGCA GTCGGGCCGA ACATGTTCTA CACCGTCACG CTGCCCTGCA CGCTGTGGTT TTTCGACAAG GCGAAAGCAA GGCTTTCGCC ACCCTCATCC CCGGCCCTTC TCCCAAAGGT AGAAGGGGGA GAAGAAGATT TACCATTATC CAGACGCATT CTCACAGAGC GCGACGGGGA AGGGAATGTA CCGAACAGGG CGGATACGGT GCTGTTTATC GATGCACGGC ACATCTACCG GCAGGTTGAC AGGGCTCATC GCGACTGGAC GCCCGCCCAG ATCGGCTTTA TGGCCAACCT TGTCCGTCTC TGGCGCGGCG AAGCGCTCGA CTACACGCTG GGTGGCGACG AAGCTCGCGA AAAGATCGAA GAGGTTTTCG CTGCCAAAAG TTCTGACCCG GCAGGCTTGA ACGGGCAGGA GGGGCATTCT GCCCACGCAC TGGCCGCCGA ATCCCCCGCT CCATACGGAT CAAGTGATGA AATTGAAAAA CTCCCTTCTA CCTTTGGGAG AAGGGCCGGG GATGAGGGTG CTGTCAAGCA TGAGGGTGCC GGGAACGTCG CCTATCGCGA TGAAGCAAAA GAACACCCTT CGCCCTCTGG GAGAAGGGCC GGGGATGAGG GTGCTGTCAA GCATGAGGGT GCCGGGAACG TCGCCTATAG CGACATTCCC GGCTTATGCA AGGCCGCCAC ATTAAAGGAA ATAGAGGCGC AGGGCTGGTC GCTCAATCCC GGCAGATATG TCGGCGTTGC TCCCGGCGAG GCAATCAGCG ACGAGGATTT CAAGGTCCAG CTCGAAACGC TGAACGAAGA ACTGGAACTT CTGAATGCGC AGGCGCGTGA ACTGGAGGCA ACGATTGCCG GAAATGTGGC GCAAATTTTG GAGACCTGA
|
Protein sequence | MHWIAPSEKD TATAALEKRL WDAADQLRAN SGLKAQEYSA PVLGLIFLLF ADVRFAARRA ELESAKSSTR RGSRVDDPAA YHAEGVLYLS PEARFVYLLN RPEAENIGVM VNEAMRAIEK HNPQLAGVLP KTYYLFDSPL LKQLLKKVSE IPSSMDYDAF GRIYEYFLGE FAMSEGQGGG EFYTPVSIVR LLTEVIEPYH GRILDPACGS GGMFVSSARF VAQHKQNPSA ELSIHGIEKT DETGRLCRLN LAVHGLEGRI MHGGNVNSYY DDPHDATGNF DFVLANPPFN VNAVDKERLK DSVGPGRRFP FGLPRTDNAN YLWIQLFYSA LNERGRAGFV MANSASDARS SEQEIRRQLI ESRTVDVMVA VGPNMFYTVT LPCTLWFFDK AKARLSPPSS PALLPKVEGG EEDLPLSRRI LTERDGEGNV PNRADTVLFI DARHIYRQVD RAHRDWTPAQ IGFMANLVRL WRGEALDYTL GGDEAREKIE EVFAAKSSDP AGLNGQEGHS AHALAAESPA PYGSSDEIEK LPSTFGRRAG DEGAVKHEGA GNVAYRDEAK EHPSPSGRRA GDEGAVKHEG AGNVAYSDIP GLCKAATLKE IEAQGWSLNP GRYVGVAPGE AISDEDFKVQ LETLNEELEL LNAQARELEA TIAGNVAQIL ET
|
| |