Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1614 |
Symbol | |
ID | 4571137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1835686 |
End bp | 1837443 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766195 |
Product | restriction endonuclease |
Protein accession | YP_912059 |
Protein GI | 119357415 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000775525 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAC GCATCGGCCA ATTGTCCCTA AACTACAGTG CACGCGGCAT TCCCACTTAC ACACTCGAAC TCTGGCACGA CGGACTGAAA AAGCATCGCC TTATTCGTGG CGAAAGTGAA TCAATCGTCA ATCTGAAGGC AACACTGCAA GTTGAAGAAT GGGAGGAACG CTGGGCAGTT ATTGACGCTA AAGAGCGTGA CCGCTCACAG AAACTCGCCG GAAAACGGCA GATTGAAGAA AACAAATCGC TAGCCGTGGA ACGGACTGCT GAAGCCCAGC AAGAACTCGA ACGGCTAAAT TCGCTTCTGA AGGCAACCTT GGCGGTTGAT GACACTATTG ATTGGGAGAA GCTAAAGGAT AAAACGCCCT ACCCGGAAAA AAAACCGGTG ATGCCACCCA CACCACGGGA GCCCGTATTG CCGCAAATGC CAAGCGAACC ATTACGAGGT GACCAAAAAT ATATTCCCTC ATTAGGAATT CTCGACAAGC TGATAGTCTC TCGAAAGGAA CGTGCGGTTT CGGAAAAGCT GGCACTGTTT GCCTCCGATC ACAAGTCGTG GCAGGACGAG GTAGCCATAA TTACACGCAC ACACACAGCA GCGCTTTTGG TACATGGAAA GTCCGTTGCC GCCATGCGTG AAGAACACGA AAAGCAAGTT TCAGCATGGG ACAAACGACG CAACGAGTAT TTAAACAAAC AATCTGCTAC ACATGCTGAA GTTGACGCAA AACGCACTAC CTATGAGTCC AGCGATCCTG ATGCAATTAC CGAATACTGC GATTTAGTTC TTTCATCCTC GCGCTATCCA GTCTATTTCC CGCAGGAATA TGATCTCGAC TATGACGCAG CAACTAAAAC AATCATCGTT GATTACCGGC TTCCCGCGCC AGACGATCTT CCACGTTTGA AGGCAGTTAA GTACGTTGCA AGCCGTGATG AGTTTGAAGA GCAGTATATT TCCGAGGCTC AATCATCCAA GCTCTACGAC GATATTTTAT ATCAAGTCGT CCTACGCACA GTTCACGAGT TGTTCGAAGC GGACATCATC TCTGCAATTG AGACAATTGT TTTCAATGGT ATTGTCACTT CAACGGATCG TACAACCGGT AAGCCAACGA CAGCATGCGT TCTCTCACTG CGTGCCAATC GTGCTGAGTT CTTGGAGATT AACCTTTCAC AAGTCGATCC GAAGGCGTGT TTTAAGTCGC TCAAAGGTGT CGGAAGCTCA AAGCTCCATG GCTTGTCGCC GGTTCCACCC ATCATGCAGC TTCGGAGGGA CGATGGACGA TTCGTATCCG CTTACGAAGT CGCCAATACG CTTGATAGCA GCGTAAATTT AGCTGCTATG GACTGGGAGG ACTTCGAACA TTTGATTCGT GAGATTTTTG AAAAGGAATT TTCATCATCT GGTGGCGAAG TTAAGGTAAC TCAAGCAAGT CGCGATGGAG GTGTTGATGC CATTGCTTTT GATCCTGATC CCATTAGGGG CGGAAAGATC GTTATTCAAG CAAAGCGATA TACCAACACG GTCGGCGTTG GTGCGGTACG TGATCTCTAC GGCACCGTAG TGAATGAAGG TGCAACAAAG GGTATTTTGG TTACTACGTC CGACTATGGC CCCGACTCTT ATGCCTTTGC CAATGGAAAA CCCCTTGTTC TTCTCAGCGG TGCTAACTTG TTACATATTC TGGAGAAACA TGGTCATCAA GCCCGCATTG ACATACAGGA AGCAAGAAAG CTTACTGCAA AGCTATGA
|
Protein sequence | MKTRIGQLSL NYSARGIPTY TLELWHDGLK KHRLIRGESE SIVNLKATLQ VEEWEERWAV IDAKERDRSQ KLAGKRQIEE NKSLAVERTA EAQQELERLN SLLKATLAVD DTIDWEKLKD KTPYPEKKPV MPPTPREPVL PQMPSEPLRG DQKYIPSLGI LDKLIVSRKE RAVSEKLALF ASDHKSWQDE VAIITRTHTA ALLVHGKSVA AMREEHEKQV SAWDKRRNEY LNKQSATHAE VDAKRTTYES SDPDAITEYC DLVLSSSRYP VYFPQEYDLD YDAATKTIIV DYRLPAPDDL PRLKAVKYVA SRDEFEEQYI SEAQSSKLYD DILYQVVLRT VHELFEADII SAIETIVFNG IVTSTDRTTG KPTTACVLSL RANRAEFLEI NLSQVDPKAC FKSLKGVGSS KLHGLSPVPP IMQLRRDDGR FVSAYEVANT LDSSVNLAAM DWEDFEHLIR EIFEKEFSSS GGEVKVTQAS RDGGVDAIAF DPDPIRGGKI VIQAKRYTNT VGVGAVRDLY GTVVNEGATK GILVTTSDYG PDSYAFANGK PLVLLSGANL LHILEKHGHQ ARIDIQEARK LTAKL
|
| |