Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0707 |
Symbol | |
ID | 6374372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 743860 |
End bp | 747159 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683219 |
Product | type III restriction protein res subunit |
Protein accession | YP_001959145 |
Protein GI | 189499675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGAG GTAAATCAAA AGGTCTGCAA TCTCATGATT TTCGCAACAA GCTGCTGCTC AATCAATGGT TGATCAGCCT TTTTGGTATC GATCCACTTG TCGAGCACAT CCTTCGGGGT CAGAAGGTGA GGCCATTCCG TCTGCTTGCC GATCCGATCA GGGATGCCCG CCTTGAAGGG CGGGACAAGG ATAATCTGCA TTACTTCTAT CACTACCTCG GCGACAGCCC TCTGTTCTCA ATGGCTGGTG TTGAAAATGA CCGGTTCCGC ATCAGCCGCG AAATGCTGCT CAGATATGAG GAGAATATCG TAAGCCACAC CAGGGCGATC AACGAGAACC GCCATCGCCC GGTTATCTGG AAATATTACC AGTGGCTGAC GTTGCTGTTC ACCGAAATCT ATCTCGACCG CTTTTTCAGA GATCGAGAGG CCCTGCTTTC GGAACTGAAC GCCTTTGTCG AGCGCTTCAA CCAGCATTGT AGCAACTGGG AAAAGCCTTT CGCTGCGGTG CCGCCGTATC GTGACGATGA CCTCAACAAG CTTTGTCTTC AGAATGCCAC AGGCAGCGGC AAGACGCTTC TGATGCATGT CAATCTCCTC CAGTACCGCC ACTATGCTGA AAGGAGTGGA AAGGGAAGTG ATCTTTCGCG CGTGATTCTG CTTACACCGA ATGAGCGTCT TTCGGAACAG CATATCGCAG AGTTCAGGGA GAGCAGTTTG CTTGCCTCGC ATTTTGCGCA GGAGGGGCAT AACCTGTTTA CGCAGACCAC CGGGCTGAAA CGGGTCGATG TGCTTGAAAT CACCAAACTC GGCGAACAGG AAGGCCCCAA TACGATTGCT ACCCGCAGTC TTGGCGACCA GAACCTGCTT CTGGTCGATG AAGGCCATCG GGGAATGAGC GGTAAGGATG AAGGTGCCTG GTTCAGGCGG CGCTCAGAGC TTTGCGCCAA GGGATTCACC TTCGAGTATT CGGCAACATT TGAACAGGCG GTTCAGGCTT CCGGAAAAGC TGATTTTGAA AACGGGTATG CAAAAGCCGT GCTGTTCGAC TACTCCTACC GGTGGTTTTA CGAAGACGGG TTCGGCAAGG ACTATCAGAT TCTGAACATG CCCGCAACAT TCGCGCAGGT TCAGTTTGCC TACTTGACCG CCTGCTTGCT CAAGTTTTAC CAGCAGTTGC GGATTTATGA GGAGAAAACC CGCGAGTTCG AGGCCTTCAA TCTTGAGAAG CCGTTATGGG TGTTTGTCGG AAGCACCGTT TCCAAGGCGA AGGGAGGCAC CAGTGATGAA AAGGTTGTTG CTGCCGATGT CGCCCAGATT ATCTGTTTCA TCGCTGAGTT TCTTGAAAAG CAGGTCGAGA GCCAGAACGC CATCAATGCC TTGCTTACCG GCAAAGGTCA GGATACCGGC CTGTTGGACA AAGATGGGAA CGATATTTTT GCCGGAGCCT TTACCTACCT TGCCCAGGCA ATGAACGCAG GTGAGATCAT CGACGATCTT TTCCGGGATA TTCTCTCAAG GCTTTTCAAC AATGCTGCCG GGGGTACCCT GTTTCTTGAC CGGATCAAAG GCGAATCAGG TGAAGTGGCG CTGAGAGTTG GCAATGCAGA CGAGCCATTC GGCTTGATAA ATGTCGGTGA TGCCAAAAGC CTTTGTGAGC ATGTCGAGGA GGTCGCAAAG CAGAACGGCG TTCGCCTTCA GGTTGAGGAC AGTGACTTTA CCGAGGCGAT GTTCGCTTCA GTGAAGGATT CTTCTTCAAA GGTCAACCTG CTCATCGGCT CCAAGAAATT CATCGAAGGA TGGGATTGCT GGCGAGTCAG CACGATGGGC CTGATGCACG TCGGGAAGTC TGAAGGTGCA CAGATCATTC AGCTTTTTGG CCGTGGTGTC CGGTTGAAAG GGTATGAATG GAGCCTGAAG CGTAGTGGCC ACACTCATGC ACCGGTTAAG CCAAACTTCA TCGAGGAGCT GGAAACCCTG AATGTGTTCG GCATCGAAGC GGACTTCATG GAGAAGTTCC GCGAATTCCT GAGAGAAGAA GGACTTCCCG GCAACGAGCG TCGCAAAGTC ATTATGATTC CCCTGAATGT CACCTATGAC TTCGGTAAGA AGCTCAAAAT CCTTCGACCA AAACGGAAAG CCTCTGATGG AAAGGAATAC GACTTCAAAA AAGATGCTCC GGTTCCTGCT GTCGGTCATG TTCCGGATTA TATGATGCAC AATACCGTTG TTTCTGACTG GTATCCTCGC ATTCAAGCCA TTCGTTCACG AGGTGCGATT ACAGTAACCA GCAAAGACAA GGTTTCCTTG CGCGAACAGC ATCTGGCCCT GCTTGATTAC GACCAGCTCT TTTTTGAACT CGAACAGTTC AAGCGCGAAC GGAGCTGGTA TAACCTCAAC ATTACCAAGA AGGGAATTAC CAGCCTGCTC AGGAACCCCG GATGGTACAC GCTCTATTTG CCGGAAACCC GTCTGAATCC GACAAGCTTT GACGGCGTTC TTTTGTTGCA GCAGGTGGCC TCAGAGCTTC TGAAACGGTA TTGCGAGCAC TATTACAACT ACTGCAAGCG GGAGTACATC GAACCGCGTC TCGAACTTCG CGATCTGACA CTCGATGACG ACAATATTCC GCAGGAGGAG TTGTACCAGC TGATCGTCGA TGGCGATGAA GAGCAGGTTA TTCAGGGAAT CGAGCAGATC AAGAAGGATC TGGAGCAGAA GAAGGATGAC CTGCTGAAGG TTGGAGACCT GAACGCCTGT AATTTCGGCA AACACCTGTT CCAGCCGCTT TTTCATGTCC GTCGAGGCGG CAAAATAACC ATCCTGCCAG TAGCCCTCAA CGAAAGCGAA TATCAGTTTG TCACTGATCT GAAAGGCTGG TGCGATGAGC GCAAATCCGC TCTGGAAAAG GATGGAATGG AGCTTTTCCT GCTCAGAAAT ATGAGCCGGG GTAAAGGTGC CGGATTCTTC GAGGCTGGCA ACTTCCATCC GGATTTTATC CTCTGGTTGC TTGTTGGCGG AAAACAGTAT ATCACCTTCA TCGAACCTCA CGGCCTGCTG CACGAAGGCC CAGCCAGCGA GAAAGTACTG TTCCACAAAC GAATTAAGAG CATCGAGCAG CGCCTCAACG ACCCATCCGT GATTCTGAAC AGCTTTATCC TGTCATGGAC TCCATATCCG CAATTGAAAT GGGGAGACAC CCAGTCCGAG CTTGAGCAAC GGCATGTCCT TTTTATGACC GATGACCGTG ATGGGTATAT TGACAAGCTT TTTGCCAAGT TAATGGAGAT GGTGAAATGA
|
Protein sequence | MARGKSKGLQ SHDFRNKLLL NQWLISLFGI DPLVEHILRG QKVRPFRLLA DPIRDARLEG RDKDNLHYFY HYLGDSPLFS MAGVENDRFR ISREMLLRYE ENIVSHTRAI NENRHRPVIW KYYQWLTLLF TEIYLDRFFR DREALLSELN AFVERFNQHC SNWEKPFAAV PPYRDDDLNK LCLQNATGSG KTLLMHVNLL QYRHYAERSG KGSDLSRVIL LTPNERLSEQ HIAEFRESSL LASHFAQEGH NLFTQTTGLK RVDVLEITKL GEQEGPNTIA TRSLGDQNLL LVDEGHRGMS GKDEGAWFRR RSELCAKGFT FEYSATFEQA VQASGKADFE NGYAKAVLFD YSYRWFYEDG FGKDYQILNM PATFAQVQFA YLTACLLKFY QQLRIYEEKT REFEAFNLEK PLWVFVGSTV SKAKGGTSDE KVVAADVAQI ICFIAEFLEK QVESQNAINA LLTGKGQDTG LLDKDGNDIF AGAFTYLAQA MNAGEIIDDL FRDILSRLFN NAAGGTLFLD RIKGESGEVA LRVGNADEPF GLINVGDAKS LCEHVEEVAK QNGVRLQVED SDFTEAMFAS VKDSSSKVNL LIGSKKFIEG WDCWRVSTMG LMHVGKSEGA QIIQLFGRGV RLKGYEWSLK RSGHTHAPVK PNFIEELETL NVFGIEADFM EKFREFLREE GLPGNERRKV IMIPLNVTYD FGKKLKILRP KRKASDGKEY DFKKDAPVPA VGHVPDYMMH NTVVSDWYPR IQAIRSRGAI TVTSKDKVSL REQHLALLDY DQLFFELEQF KRERSWYNLN ITKKGITSLL RNPGWYTLYL PETRLNPTSF DGVLLLQQVA SELLKRYCEH YYNYCKREYI EPRLELRDLT LDDDNIPQEE LYQLIVDGDE EQVIQGIEQI KKDLEQKKDD LLKVGDLNAC NFGKHLFQPL FHVRRGGKIT ILPVALNESE YQFVTDLKGW CDERKSALEK DGMELFLLRN MSRGKGAGFF EAGNFHPDFI LWLLVGGKQY ITFIEPHGLL HEGPASEKVL FHKRIKSIEQ RLNDPSVILN SFILSWTPYP QLKWGDTQSE LEQRHVLFMT DDRDGYIDKL FAKLMEMVK
|
| |