Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1309 |
Symbol | |
ID | 3747398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1775083 |
End bp | 1777884 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637773846 |
Product | methylase |
Protein accession | YP_379612 |
Protein GI | 78189274 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00329115 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAA CTAAAAAAGA AATTCGTGAT AATGCGTTGC GCTTTGCTCT TGAGTGGCGG GATGCTTCGC GTGAACGGGC TGAGGCGCAA ACGTTTTGGA ATGAATTTTT CCAGATTTTT GGCGTTTCAC GTCGGCGTGT GGCATCGTTT GAAGAGCCGG TTAAAAAGCT TGGCGAAAAG CGTGGTTCGA TTGATTTGTT TTGGAAAGGT ACGTTGGTTG TTGAACATAA ATCGCGTGGT GGAAATCTTG ATAAAGCATA TAATCAGGCG CTTGATTATT TCCCGGGATT GAAAGAGGAA GAGTTGCCGA AATATGTATT GGTTTCAGAC TTTGATCGCT TTTGTTTATA TGATTTAAAT GAAAACACTC AATGTTCTTT TCTACTCAAT GAATTACCTG AGCATATTGA CTTATTTGGC TTTATTTCGG GTCATCAAAT AAAAGTGTAC AAAGATGAAG ATCCCGTTAA TATTCAGGTT GCTGAAAAAA TGGGAGAGCT TCACGATGCG TTGTTGGATT CAGGATATGA TGGGCACGAT TTAGAGGTTT TTTTAGTGCG TTTGGTTTAT TGCTTATTTG CGGATGATAC GGGTATTTTT AATGCTAAAG GCGATTTTGA AGATTATCTT CGCAATAAAA CAAAGGAGAA TGGTTCTGAT ACAGGTTCTA TACTTGCTAA CATGTTTCAA GTATTAGATA CGCCTTACGA AAAACGCCAA AAAACGCTTG ATGAGGATTT AGTTAACTTT CCTTATGTGA ATGGTGATTT GTTTCGTGAA CCGTTGCGTA TTGCTCATTT TAATGGTGCA ATGCGTGAAC TCTTGCTGGA ATGTTGTTTG TTTGATTGGA GTAAAGTATC GCCTGCTATA TTTGGTTCGC TGTTTCAAAG TGTGATGGAT AGAAAACGGC GGCGCAATCT TGGCGCTCAT TACACATCAG AAAAAAATAT TCTTAAAGTT ATTTGTGGAT TGTTTTTAGA TGATCTTCGT CGTGAATTTG AAGCCATTAA AAATGATGCT CGCAAGGTTA CCGCTTTCCA TAATAAGATT GCTGCTATGC GTTTTTTTGA TCCTGCATGT GGTTGTGGCA ATTTTTTAGT TATTACCTAT CGTGAAATTC GTCAGCTTGA AATTGAGGTA TTGCAACAAT ATTTCATGCT GACCTCAAAA ATGTATAAAG CTGGTGTAAC CCAGCTTGAA ACTGATATAG AAACAATTTC AAAAATTGAT GTGAATCAGT TTTATGGCAT TGAAATTGAA GAGTTTCCTG CACGAATTGC GCAAGTTGCT TTGTGGTTAA CCGACCATCA AATGAATATG CGCCTTTCGC AAGCCTTTGG GCAAACCTAT GTGCGCTTAC CTCTTCAACA TGCGCCAAAT ATTATTTGCG ATAATGCGCT TCGCAAAGAT TGGGAAACAG TTATTCCATC AAAAGAGCAT CTCTATATTC TTGGCAATCC ACCATTTATT GGCAAGCAAA ATCGCAATGC AGGGCAAATG GCAGATATGG ATGTTATTTG CCAACCTTTG AAAGCAAAAG GGTTACCAAA TTATGGCGTG CTTGATTATG TAGCGTTGTG GTACATAAAA GCAGCTCTTT TTATAGAAAA CAGTAACGTG AAGGTTACAT TTGTTTCAAC CAACAGCATT ACACAAGGTG AGCAAGTTGC TGCATTATGG GAATTTTTAC TAAGAAAAGG TGTGAAGATA TTTTTTGCGC ACCGCACGTT TAAGTGGACA AATGAAGCAA GAGGTAATGC TCAGGTATTT TGTGTTATTA TTGGTTTTAC ATGGAATAAT ACGACACAGA AGAAACGATT GTTTGATTAT GAAACACCAC AAAGCGAATC GCATGAAATT GAAGCAAAAA ACATCAATCC TTATTTAATT GATGCAATTG ATATTGTTGT TTCATCGCGA AATAAGCCGC TTTGCAATGT TCCTGAAATG CTTTATGGCA GTAAGCCTGT TGATGATGGA AATCTTTTCT TTGATGATGA TGAAAAAGTA GAACTCTTAA AAAAAGAGCC TAAAGCAGAA CAATTTATAC GTCGAGTAAT CAGCGCTCAT GAATTTATTA ATGGAAAAAA TCGCTGGTGT TTATGGTTAA AAGATATTGC ACCAAATGAA TGGAGAAATT TACCAGAGCT TGTCAAGCTG GTTGAAGCGG TGAGAGGGTT TCGGTTGAAA AGCAAAAAAG CTGCAACAGT AAAATTGGCA GAGGTACCTT ATCTATTTGG AGAAATTCGC CAACCAGAAA CTAATTATAT TGTTATTCCT CTTCATTCTT CAGAACATAG AAAGTTTATT CCGACTGGAT ATTTTTCAAA GGATAATATT CTTCATAATT CTTGTTCTGC TGTGCCAAAT GCGACATTAT ATCATTTTGG CATATTAACC AGCACAATGC ACATGGTTTG GATGCGTACT GTTTGTGGGC GAATAAAAAG CGATTATCGC TATTCCAATA ATTTAGTTTA CAACAACTTT CTATTTCCTC ACGACATAAG CAACAAGCAA AAAGCAAAAG TTGAAGAAAA AGCGCAAGCC GTTTTAAACG CTCGCGAACT CTTTCCAAAC TCAACACTTG CCGATTTGTA CGATCCGCTT ACCATGCCCA AAGCACTCCT AACAGCCCAT CGCGAACTTG ACGCAGCCGT TGACGCTTGC TATCGCAAAA CGCCCTTCCA AAACGAGCTT GAACGGTTAG AATTTCTTTT TCAACTCTAC AGTTCTTACA CTCAACCACT TGTTCCAGCA ATGGATGCAA AACCAAAAAG AAAGCGGATG GGAAAGGGGT GA
|
Protein sequence | MPITKKEIRD NALRFALEWR DASRERAEAQ TFWNEFFQIF GVSRRRVASF EEPVKKLGEK RGSIDLFWKG TLVVEHKSRG GNLDKAYNQA LDYFPGLKEE ELPKYVLVSD FDRFCLYDLN ENTQCSFLLN ELPEHIDLFG FISGHQIKVY KDEDPVNIQV AEKMGELHDA LLDSGYDGHD LEVFLVRLVY CLFADDTGIF NAKGDFEDYL RNKTKENGSD TGSILANMFQ VLDTPYEKRQ KTLDEDLVNF PYVNGDLFRE PLRIAHFNGA MRELLLECCL FDWSKVSPAI FGSLFQSVMD RKRRRNLGAH YTSEKNILKV ICGLFLDDLR REFEAIKNDA RKVTAFHNKI AAMRFFDPAC GCGNFLVITY REIRQLEIEV LQQYFMLTSK MYKAGVTQLE TDIETISKID VNQFYGIEIE EFPARIAQVA LWLTDHQMNM RLSQAFGQTY VRLPLQHAPN IICDNALRKD WETVIPSKEH LYILGNPPFI GKQNRNAGQM ADMDVICQPL KAKGLPNYGV LDYVALWYIK AALFIENSNV KVTFVSTNSI TQGEQVAALW EFLLRKGVKI FFAHRTFKWT NEARGNAQVF CVIIGFTWNN TTQKKRLFDY ETPQSESHEI EAKNINPYLI DAIDIVVSSR NKPLCNVPEM LYGSKPVDDG NLFFDDDEKV ELLKKEPKAE QFIRRVISAH EFINGKNRWC LWLKDIAPNE WRNLPELVKL VEAVRGFRLK SKKAATVKLA EVPYLFGEIR QPETNYIVIP LHSSEHRKFI PTGYFSKDNI LHNSCSAVPN ATLYHFGILT STMHMVWMRT VCGRIKSDYR YSNNLVYNNF LFPHDISNKQ KAKVEEKAQA VLNARELFPN STLADLYDPL TMPKALLTAH RELDAAVDAC YRKTPFQNEL ERLEFLFQLY SSYTQPLVPA MDAKPKRKRM GKG
|
| |