Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0749 |
Symbol | |
ID | 6374414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 794653 |
End bp | 797631 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642683257 |
Product | type III restriction protein res subunit |
Protein accession | YP_001959183 |
Protein GI | 189499713 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.186322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA TCACCTCCGA ATACGCCTTT GAGGAACACA TGGAAGAAGT CCTGCTGAAG TCGCATGGCT ATTTCCCCGC CCAGCAGGTG GACTACGACA AGGCCCTGTG CCTGCGGCCC GAGACGGTCA TCTCCTTCAT CCGCGCCACG CAGACCAAGA AGTGGGCCGA CTACTGCGAG TTGATCGGCG ACAAGGGGCA GGCGACCCGC AACCTGCTCA AGCGCATCAA GGAGGTGGTG GACAAGGAAG GCACGCTGCA CGCCCTGCGC AAAGGCTTCG ACATCCACGG CTCTGGGCAT TTTGACCTGT GCTACTTCGA GCCCACCAAC CCGGTGGCGG AGGAGAGTCG GCGGCTGTAT CAGGAAAACC TCCTGCACGT CCAGCGGCAG GTGAAGTTCT CGGAAAGCGA CGAGAAGTCG CTCGACATGG GCATTTTCCT CAACGGCCTG CCCATTTTCA CCATCGAGCT GAAGAACCAG ATCTCGGGCC AGAACGTCGG CCACGCGATG AAGCAATACA AGAACACCCG CGATCCCAAG GAGCCGCTGT TCCGCTTCAA GCGCTGCCTC GCCCACTTCG CGGTGGATAA CGATCTGGTC TATGTGGCCA CCGAACTCGC CGGAGCCAAG ACCCAGTTCC TGCCCTTCAA CCAGGGGAAC GACGGCGGCG CAGGCAACCC GCCCTGCAAG ATCGGCTATG CCACCAGCTA CCTGTGGCAG GAGGTCTGGA AGAAGCCGCG CATCCTCGAC CTGATCCAGC GTTTCATCCG CGTGATGGAT GTGCTGGATG ACAAGGGCAA GAAGACCGAC AAACAGCGGC AGATTTTCCC GCGCTTTCAC CAGCTCACCT GCGTGCGCGA ACTCGGGGCC GATGCCCAGC AGCACGGAGC GGGAAGGCGT TACCTGAACC AGCACAGCGC GGGCAGCGGC AAGACCAACT GCATTGCATG GCTGGCCAAC AGCCTTGCCA CCCTGCACAC CAAGGAGGGC CAGCCGGTCT TCAGTTCGGT GATCGTGATT TCCGACCGCC GGGTGATCGA CCGTCAGCTT CAGCGCACGC TTACGCAAGT GATCGAGACC CCCGGCATGT TGGTCAACAT CGCCGCCGAT GACGGCATGA CCTCCAAAGA TCTCAAGCGG GCGCTGGAGG ACGGCAAGCG CATCATCGTC ACTACGCTGC AAAAGTTTGG CGTGATCATG GACAGCATGG GCGATCTGCC GGGCGAACGC TTCGCGGTGA TCGTGGATGA GGCGCACTCG TCGCAGGCGG GGCAAAGCGC CCAGGCGGTG CAGAAGGTGC TCTCCTATTC CTCCGAAGAC GAGCAGAAGG ACGAAGAGGA GAAGACCACC GAAGACCGGA TTCTGGAAGA GCTGAAGACA CGCGGCCCGC AAAAGAACGT GAGTTATTTC GCTTTCACCG CCACGCCCAA GCCGGAGACC TTGCAGCAGT TCGGCACCAA GCATAAGGAC GGCACTTTCA AGCCATTCAG TCTCTACACC ATGAGGCAGG CGATTGAGGA AAAGTTCATC CTCGATGTGC TCAAGAACTA CACCACCTAC GACCAATACT GGGCCTTGCT CAAGAAGGTG AAGGACGATC CGGAGTTCGA GGAGGTAAAG GCCAAGTCGC TTCTCAAGCA GTTCGTCAGC CGCCATGAGC GCACTATCGC CAAGAAGGTC GCCATCATCG TGGCGCATTT CCAAGCATCG GTCAGCGACA AGCTGGGCGG CAAAGCCAAG GCGATGATCG TCACCTCCTC CCGCCTGCAC GCCGTGCGCT ACAAGCTGGC TGTGGATGCC TGTCTAAAAA AGAAGGGCAT TCCGTTCAAG GCGCTGGTGG CTTTCACCGA CGTGGTGAAG GATTCCAAGG ACGGCAAGGA ATACACCGAA GCCAACATGA ACGGCTTCCC CGAGGCCGCG ACGGCGGATC GCTTCGAGGC GGATGAATAC CGCTTCCTGA TCGTGGCGAG CAAGTTCCAG ACTGGCTTCG ACCAACCCAA ACTGCTGGCG ATGTATGTGG ACAAGAAGCT CAGCGGCGTG GCCTGTGTTC AAACGCTCTC GCGCCTCAAC CGCACCATGG TGGGCAAAGA AGAGACCTTC GTGCTGGACT TCGAGAACAC CGCCGAGGAC ATCGAGAAAG GCTTCCAGCC ATTCTACGAC CGCATCACGC TGTCCAAGGA AACCGACCCC AATCAACTCT ACAACATCCG CACCGACCTC GATAAATTCG GGATTCACAC AGAGGCGGAT TTGAAGGCAT TCGCCGACCA ATGGTTCGCC CCGAAGCCCA AAATCGAAAA GCTCCACGCG CTCACCGATC CTGTGGCCGA GAAGTGGAAG AAGGAAGGCG AGGAAGAGCA GGTGGACTAC AAATCCAAGG CGCGGGATTT CGTGAAGCTT TACGCCTTCA TGTCGCACCT CGTCCCGCTG AAGGATACCG GGCTGGCGAA GCTCGACGTG TTCCTGCGCT TCCTGCTCCC CAAGCTGCCC GCCAAGAAAG GCGAGACCCC GCTGGAAGTG CTTGGAATGG TGGATATGGA AAAGCTCGCC GTGCGCAAGA AGGACAAAAA AGACATCGGC CTGAAACGCG GCGAGGAAAA GGTGGACCCA CTCAACTACG GCGGCGGCGC TACCCTCTCG GAAGAGGAGC GGGAGCAGCT ATCCAAGATC ATCGAAGACC TCAACACCCG CTTCAACACC TCGTTCACCG AAGACGAGAT CATGGTGATC AAGCAGCTCG AAAAGAAGAT CGGCGAGGAC GAGGCCCTCC AACAACAGCT CAAGAACGGT TCGCCCCACG CGGTCGCAGC AACCTTTGAG CAGGTGGCGA AAGATGCCTT CGAGGATTTG GTGAACGACA ACTTCAAGTT CTACAAAAAG GTGAGCGAGG ACGACGAAGT CTCGAAGGAG TTTTTTGCTC GCCTGTTTGA GTGGTATGTC GAGGGGCGCA AGAAGACAAC GCCGAAGAAA GAAAAGTAG
|
Protein sequence | MAKITSEYAF EEHMEEVLLK SHGYFPAQQV DYDKALCLRP ETVISFIRAT QTKKWADYCE LIGDKGQATR NLLKRIKEVV DKEGTLHALR KGFDIHGSGH FDLCYFEPTN PVAEESRRLY QENLLHVQRQ VKFSESDEKS LDMGIFLNGL PIFTIELKNQ ISGQNVGHAM KQYKNTRDPK EPLFRFKRCL AHFAVDNDLV YVATELAGAK TQFLPFNQGN DGGAGNPPCK IGYATSYLWQ EVWKKPRILD LIQRFIRVMD VLDDKGKKTD KQRQIFPRFH QLTCVRELGA DAQQHGAGRR YLNQHSAGSG KTNCIAWLAN SLATLHTKEG QPVFSSVIVI SDRRVIDRQL QRTLTQVIET PGMLVNIAAD DGMTSKDLKR ALEDGKRIIV TTLQKFGVIM DSMGDLPGER FAVIVDEAHS SQAGQSAQAV QKVLSYSSED EQKDEEEKTT EDRILEELKT RGPQKNVSYF AFTATPKPET LQQFGTKHKD GTFKPFSLYT MRQAIEEKFI LDVLKNYTTY DQYWALLKKV KDDPEFEEVK AKSLLKQFVS RHERTIAKKV AIIVAHFQAS VSDKLGGKAK AMIVTSSRLH AVRYKLAVDA CLKKKGIPFK ALVAFTDVVK DSKDGKEYTE ANMNGFPEAA TADRFEADEY RFLIVASKFQ TGFDQPKLLA MYVDKKLSGV ACVQTLSRLN RTMVGKEETF VLDFENTAED IEKGFQPFYD RITLSKETDP NQLYNIRTDL DKFGIHTEAD LKAFADQWFA PKPKIEKLHA LTDPVAEKWK KEGEEEQVDY KSKARDFVKL YAFMSHLVPL KDTGLAKLDV FLRFLLPKLP AKKGETPLEV LGMVDMEKLA VRKKDKKDIG LKRGEEKVDP LNYGGGATLS EEEREQLSKI IEDLNTRFNT SFTEDEIMVI KQLEKKIGED EALQQQLKNG SPHAVAATFE QVAKDAFEDL VNDNFKFYKK VSEDDEVSKE FFARLFEWYV EGRKKTTPKK EK
|
| |