Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1149 |
Symbol | |
ID | 6374824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1235518 |
End bp | 1238592 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642683651 |
Product | type III restriction protein res subunit |
Protein accession | YP_001959568 |
Protein GI | 189500098 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.511254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0779427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGA TATTCAAACA ACAACCCTAC CAAACGGATG CCACTATGGC GGTTGTCGGC TGCTTTGAAG GGCAAAGCAA AGGCTTTAGA AAAGAAGTGG TTGGCAGAGA AACCCTCGAC CACGGGCTGT TTGGCACAGA AGTCAAAGTT GAAGAGATCT TCTCCAATAA AAAACTGGAA ATTACCGAAG TCGATATATT AAAAAATGTG CAAGCACTTC AAAAAGAACA GGGCTTAAAA ACCATCAGCA AACTGGATGG TCTCAACTTT ACCACCGAAA TGGAAACCGG AACCGGTAAA ACCTATGTCT ATACCAAAAC CATGTATGAA CTCAATAAAC ACTACGGCTG GAATAAGTTT ATTATCATGG TGCCATCAGT GGCGATTCGT GAGGGTGTAC ATAAATCGTT AGAAATTACC GCAGACCATT TTCAGGAGAT TTACGGTAAA AAAATTCGTT TTTCCATCTA CAACACCCAG AACAAATCTA ATCTGATAAA CATCAAGAGC TTTGCCAACA CTTCCAATAT CGAAGTGCTT ATCATGAACT ATCAGGCATT TGCCACCACC AGCCAGGAAT CCAGAAAGAT TTATCAAAAA CTCGATACAC TGCAAAGTGA ACGGCCTATT GATATTATTA AACGAGCCAG ACCCATTCTG ATTATTGATG AACCACAACG AATGGGTATT GAAGAAGGTA AAGTTTTCTC TGGGAATAAG CCATCAAAAG TAGTCGAAGC AATATATCGG AATAATGAAT TTAATCACCT TTGTACTCTA TTGTATTCAG CGACCCATAA AAAAGATTTC AATAAAATTT ATCGCCTCGA TGCAATTGAT GCCTACAACC AGAAACTGGT GAAAAAAATA AGTGTCAAAG GGATTGAGGT TGTCGGAAAC AGCGGAACCA ACAGCTATTT GTTTTTAGAT AAAATCCAAA TCAGCACGAG CCAATTTCCA GTTGCCTATG TTGAGTTGGA AGTTAAACAG GCTAACGGTA TTCAGAAAAA AATCAGACAG ATAAAAGAGA AAGATGACCT CTATGTATTC TCCAATGAAT TGAAACAATA CAAAGGCTTT GTGGTAAAGT CGATTAACGG TCTTACCAAT ACCGTTTCAT TCACCAACGG AATTACCCTC TCTGTAGGGC AAACCGCAGG AGACGTGGAC GAAGAGCATG TTCGCCGTAT ACAGATTCGG GAAACGATTA AATCCCATAT TGAAAAAGAA CGAATCATGT TTCAAAAGGG AATTAAAGTA CTGTCCCTTT TCTTTATCGA TGAAGTGGTA AAGTACCGTG ATTATTCAAG TCCCGACCAG AAAGGCATTT ATGCCAAAGT TTTTGAGGAT GAATACCGCC AGGCAATCAG CGAGCTAAGT CTTTTTGAAC CGGACTACAA CGACTATCTT AGTAAATTTA CTGTGGAAGA TATTCACAAA GGTTACTTCT CCGTTGATAA AAAAGGACTG TTTATAGATT CCAAAGAAAA ACGCGGCGAA AGCGGTAGTG ATGATGTGAG TGCCTATGAC CTCATTATGA AAAAGAAGGA AATACTTCTT GATCTAAAGG AATCTACCCG TTTTATATTT TCTCATTCTG CGCTTCGTGA AGGTTGGGAT AACCCCAATG TATTTCAGAT TTGCACTCTC AAGCACAGCC AATCAGAAAT CAGCAAGCGA CAGGAGATAG GACGTGGGCT TCGTATTTGT GTCAATTCAA AAGGCGAGCG CATGGATGCA TCGGTACTTG ATAGTGATTT TTTTGATGTG AATAAACTCA CGGTAGTTGC CAGTGAATCC TACGATTCCT TTGCCAAAGC CTTGCAGAAT GAAATTGTTG AATCACTTTC AGAGCGTCCA GTTACCCTTA CGATTGAGGT ATTAAACAAC CGTGTCATTC ATAACGAAAA AGGCGAAAAG TTTGTTTTTG ATAGCCAGTC CTCTATGGAT TTGATTTTCG ATATGAAAAC CAAGGGCTAT CTGGACGCAA ATTACCATAT TACCGAAGCC CTGATTACAG ATGTGGAAAA TAAAACTTAT GCTCTCCCGG AAAAGTTACA AGGTTTTGAA TCTTGCGTTG CTGAACTTAT GACCGGTATC TACACAACGG CAAACTTTAA AGCAGCAGAA AATGAAAATG CCAATAATAT CAATGAGGCT ATTCTAAAAC CTAACGATAA CTTTGCCAAA AAAGAGTTTC AGGATTTATG GAAGAAGATT AAAGTAAAAA CGGTCTATGA AGTGGATTTT GAAAGTGCAG AGCTTGTTGA TCTCTGCGTT AAGGCAATTG ACACCAATCT CACCGTTAAA AAAATATTGA TAAATATTAC TTCCGGTGAA CAGCAAGACA AGATTGACGA GGCGACACTC AAATCCGGCG AAAGCATGAA GAAAGAAAAA AGTGTTACCG AAAGAGCCGA GTCTTTGCTG GGTTCACTGA AGTACGATCT GGTAGCTGAA ATTGCCAAAG AGACAAATCT TACCCGCAAA ACGGTTGTTA AAATATTGCA GGCATTACGG CAGGATACCT TCCATTACTT CAGGGTAAAT CCAGAAAGCT TTATTCAGGG TGTAACCAAT ATCATCAATA GCGAAAAAGC TGCAACCCTT ATTAACAACA TTGTCTACTC AAAAACAGAC AAGACATACG ATGACAGTAT TTTTACCATC AATAACTTTA AGGGATCACT CACAAAGAAT ATTCTGGAAG TCAAAAAGCA TGTTCATAAC TATGTAAAGA CCGATTCTGA TATTGAAAGA AGATTTGCTA CTGATCTTGA ATGTGAAGAA GTGCTTGTCT ATGCAAAACT ACCAGGTGGC CCGAATGGTT TTAAAATACC AACACCACTG GGAAACTATA ATCCTGACTG GGCTATCGTT TTTAATACTG ACAAATTCAA GTATGTCTAT TTTATTGCAG AAACCAAGGG AAGCATGGAA ACCTTGCAAC TGAAAGAAAT AGAGCAGAAA AAAATCAGTT ACGCCAAAAA GCATTTTGAG GCATTAGGAC ATGCAGATAT TAAATACGAT GTGATTGATT CGTATCAGGC ATTGAGAGAC AAAATAATGA ATTAA
|
Protein sequence | MKLIFKQQPY QTDATMAVVG CFEGQSKGFR KEVVGRETLD HGLFGTEVKV EEIFSNKKLE ITEVDILKNV QALQKEQGLK TISKLDGLNF TTEMETGTGK TYVYTKTMYE LNKHYGWNKF IIMVPSVAIR EGVHKSLEIT ADHFQEIYGK KIRFSIYNTQ NKSNLINIKS FANTSNIEVL IMNYQAFATT SQESRKIYQK LDTLQSERPI DIIKRARPIL IIDEPQRMGI EEGKVFSGNK PSKVVEAIYR NNEFNHLCTL LYSATHKKDF NKIYRLDAID AYNQKLVKKI SVKGIEVVGN SGTNSYLFLD KIQISTSQFP VAYVELEVKQ ANGIQKKIRQ IKEKDDLYVF SNELKQYKGF VVKSINGLTN TVSFTNGITL SVGQTAGDVD EEHVRRIQIR ETIKSHIEKE RIMFQKGIKV LSLFFIDEVV KYRDYSSPDQ KGIYAKVFED EYRQAISELS LFEPDYNDYL SKFTVEDIHK GYFSVDKKGL FIDSKEKRGE SGSDDVSAYD LIMKKKEILL DLKESTRFIF SHSALREGWD NPNVFQICTL KHSQSEISKR QEIGRGLRIC VNSKGERMDA SVLDSDFFDV NKLTVVASES YDSFAKALQN EIVESLSERP VTLTIEVLNN RVIHNEKGEK FVFDSQSSMD LIFDMKTKGY LDANYHITEA LITDVENKTY ALPEKLQGFE SCVAELMTGI YTTANFKAAE NENANNINEA ILKPNDNFAK KEFQDLWKKI KVKTVYEVDF ESAELVDLCV KAIDTNLTVK KILINITSGE QQDKIDEATL KSGESMKKEK SVTERAESLL GSLKYDLVAE IAKETNLTRK TVVKILQALR QDTFHYFRVN PESFIQGVTN IINSEKAATL INNIVYSKTD KTYDDSIFTI NNFKGSLTKN ILEVKKHVHN YVKTDSDIER RFATDLECEE VLVYAKLPGG PNGFKIPTPL GNYNPDWAIV FNTDKFKYVY FIAETKGSME TLQLKEIEQK KISYAKKHFE ALGHADIKYD VIDSYQALRD KIMN
|
| |