Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1398 |
Symbol | |
ID | 4569307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1587158 |
End bp | 1588831 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765985 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_911851 |
Protein GI | 119357207 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC AGATCGCCAA CCTGCTTGAG CAGCACTTCG ATACAGCCTT TGCCGCCCCT GATGGCATCA AAAAGCTCCG TGAGCTGATT CTCACCCTCG CCATGCAGGG CAAGCTGGTG GAGCAGGACC CGAACGACCA GCCCGCCTCA GAACTGCTGA AAGAGATCGA AGCCGAGAAA CAGCGGCTGG TGACGGCAGG AAAAATCAAA AAGCCTAAAC CTCTGTCTGC AATCGTCAAT GATGAAATAC CATACGATAT TCCTTCAAGT TGGATATGGG TGCGATTCGG CGATATTGCC CGGCACAATT CTGGTAAGAC GCTGGACAAG GGGCGTAACA CAGGTGAGTC CCGTGATTAC ATAACCACGT CAAATCTCTA CTGGGGAAAG TTCGAACTCG AAAATGTTCG TCAAATGTTA ATCAGAGAGG ATGAACTCGA AAAATGCACA GCCAAAAAGG ACGATCTGTT GATTTGTGAA GGAGGGGAAG CGGGACGAGC AGCAATGTGG CCCTTCGATT CAGAAGTGTG TTTTCAGAAT CATATTCATC GTGCTCGATT TTACAAAGAC ATTGACCCGT ACTTTGTCTA CAGATTTTTC GAGAAATTGA GTGCCACTGG TGAGATCAAT CAGCATCGCA AAGGGGTTGG TATTTCCAAT ATGTCGAGTA AGTCTTTGGC CTCGATAGTG TTTCCATTGC CCCCCTTTTC CGAACAACAC CGCATCGTCG CTCGCATCGA CCAGTTGATG GCCCGTTGCA ATGAGCTGGA AAAGCTGCGC AAGGAACGGG AAGAGAAGCG GCTGATTGTC CACGCCGCCG CCATCAAGCA ACTGTTCGAT GCACCGGACG GTTCAGCTTG GGGTTTCATC CAGCAACATT TCAATGAACT CTACAGTGTC AAAGAAAACG TCGCCGAACT ACGCAAAGCC ATCCTCCAAC TCGCCGTCAT GGGTCGCCTC GTTCCCCAGG ATCAGAATGA TCCCCCAGCA TCGGAGTTGT TGAAGGAGAT CGAAAAAGAG AAGGCATCGC ACGAATGCAC GAAGTCACGA AGGAAAGGGG AGAAGCTACC GGAAATTTTT AATGAGGAAA TGCCCCACAA AATCCCGTCA AATTGGGCTT GGGTGCGATT TGGCGACATT GCTCAACACA ATTCCGGAAA AACGCTGGAC AAGGGGCGCA ACACAGGTCA ACCACGCGAA TATATCACTA CCTCAAATCT CTACAGGGGA AGGTTCGAGC TTGAAAATGT TCGTCAAATG TTAATCAGAG AGGATGAACT CGAAAAATGC ACAGCCAAAA AGGACGATCT GTTGATTTGT GAAGGAGGGG AAGCGGGGCG AGCAGCAGTG TGGCCCTTCG ATTCAGAAGT GTGTTTTCAG AATCATATTC ATCGTGCTCG ATTTTACAAA GACATTGACC CGTACTTTGC CTACAGATTT TTCGAGAAAT TGAGTGCCAC TGGTGAGATC AATCAGCATC GCAAAGGAGT AGGTATTTCC AATATGTCGA GTAAGGCTTT GGCCTCGATT GTGTTTCCAT TGCCTCCTCA GCCCGAACAA CATCGCATTG TAGCCCGAAC CGACCAGTTG ATGACGCTGT GCGACCAACT CGACCAGCAG ATCGATGACG CCGTTGGCAA ACAGACCGAA ATCCTGAATG CTGTGTTGGC GTAG
|
Protein sequence | MNKQIANLLE QHFDTAFAAP DGIKKLRELI LTLAMQGKLV EQDPNDQPAS ELLKEIEAEK QRLVTAGKIK KPKPLSAIVN DEIPYDIPSS WIWVRFGDIA RHNSGKTLDK GRNTGESRDY ITTSNLYWGK FELENVRQML IREDELEKCT AKKDDLLICE GGEAGRAAMW PFDSEVCFQN HIHRARFYKD IDPYFVYRFF EKLSATGEIN QHRKGVGISN MSSKSLASIV FPLPPFSEQH RIVARIDQLM ARCNELEKLR KEREEKRLIV HAAAIKQLFD APDGSAWGFI QQHFNELYSV KENVAELRKA ILQLAVMGRL VPQDQNDPPA SELLKEIEKE KASHECTKSR RKGEKLPEIF NEEMPHKIPS NWAWVRFGDI AQHNSGKTLD KGRNTGQPRE YITTSNLYRG RFELENVRQM LIREDELEKC TAKKDDLLIC EGGEAGRAAV WPFDSEVCFQ NHIHRARFYK DIDPYFAYRF FEKLSATGEI NQHRKGVGIS NMSSKALASI VFPLPPQPEQ HRIVARTDQL MTLCDQLDQQ IDDAVGKQTE ILNAVLA
|
| |