Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2543 |
Symbol | |
ID | 6376241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2714988 |
End bp | 2718200 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642685021 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_001960918 |
Protein GI | 189501448 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.111804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATT TTCAGATAAA CGAAAAACAT CTGTCCCAGA TTCCTGCTTT GCAGCTGCTG ATCGGGCTTG GCTATGAATA CCTGACTCCG GGCGAAGCAC TCCTTGAACG GCAGCAGAGG ATGTCCAATG TGCTGCTGGA AAATGTTCTG CGAAATCAAC TGAAGGCGAT TAACCGAATC AACTTCAAGG GTGGCGTGTA TCTGTTCAGC GAGGAGAATA TTCAGTCGGC CATCCAGAAA CTCAAGAATA TCAAGTACGA CGGTTTGCAG AAAACCAACG AGGCTGTCTA CGATCTCATC ACCCTCGGGA CAGCCATGGA GCAGACTATC GAGGGTGACT CGAAGAGTTT TACTCTGAAT TATGTTGATT GGCGAAATCC CGAGAGGAAC GCATTTCATG TGACGGTTGA ATACAGCGTC GAGCGGTCCC GGAGCACGGA AACGGTCCGG CCGGATATCG TTCTCTTCGT AAATGGCATT CCTTTCTGTG TTATCGAGTG TAAGTCCCCG CAGATCGAAG TCGATCAGGC TGTTTCCCAG TCGATCAGGA ACCAGAACGA CGCCTATATC CCGAAGCTGT TTGTCTATAC GCAGTTGCTA CTGGCGCTCA ACAAGAACAG AGCCATGTAC GCGACAACAG GAACGGCTGT CAGGTTCTGG AGCGTCTGGA AAGAGCCGGG GTTAACCACG GATGATGAAA CCACGAAAGA CACGAAAAGC ACGAAACTGA AAGAGCTGGA GAGGTTGATT CAGGAATCGT TGGACTATCG GGTTGACCCT GATGATTTTT CTTTGTCTTC CGTGCTTTCC GTGGCTCAAA ACCTAAGAAT AGTTACGGAG CAGGACAAGT CTCTCTATGC TCTTTGCCGC CCGGAAAGAC TTTTGGAACT GGCATGGAAG TTCACAGTTT TTGACGGAGG ATTGAAAAAG ATCGCCCGGT ATCAGCAGTA CTTCGTCATC AAGTCCACGC TGAACCGTGT CAAACAGTTC GCCTACAGCG GTTCCCGCAA GGGCGGGGTG ATCTGGCATA CGCAAGGGTC GGGAAAATCC CTGACCATGG TTATGCTGAC GCGCAATCTG GCCCTCGATC CGGAAATATC GAATCCTCGT ATCGTTCTCG TTACTGACAG GGATGATCTG GACAAGCAGT TGGGAAATAC TTTCGCGGCC TGCGGCCTCG AAGCCAATCG GGCGACTTCC GGCAGGAATC TGCTCGAACT GGTGGCGGAA AAGCAGTCAG GAATTGTCAC GACGCTCATT CATAAATTCG ACAAGGCCTA TGCCGTCAAC AAGTATCGGG ACGAGTCTTC CGATATCTTC GTGCTCGTTG ACGAGAGTCA CCGTACTCAG TTCGGCTCCT TTGCCGCCCG TATGCGCCAG ATGTTTCCCA ATGCCTGCTA TCTTGGTTTC ACGGGTACCC CGCTTCTGAA AAAAGAGAAA AACAATTTTA TCAGATTCGG AGAACTGGTA GAACCTCATT ACTCTATTTC GCAGGCGGTG GAAGACGGCG CGGTCGTTCC GCTTTTATAC GAGGGGCGGC ATGTTGAGAT GACCCAGAAC AGGGCGGCTG TGGATCTGTG GTTCGAGCGT CATACCCAAG GGTTGAGCAA GGAGCAGCAG GCGGATCTGA AACGCAAATA TGCTCGTGCT GAAATGCTGA ACAAGGCAGA TCAGGTCATT TACATGAGGG CCTTCGATAT CAGTGAACAT TTCCGGGCCA ATTGGCAGGG AACCGGCTTC AAGGCACAGC TTGTTGCGCC CGGCAAGGTT TCCGCGTTGA AATACCAGGA GTATTTGAAT GATATCGGCA TGGTCGGTTC CGAGGTCGTC ATTTCCCCTC CTGATATGCG TGAAGGTTTC GAGGAAACGG ACGATGAATC CAACGATGAG GTGGTGAAGT TCTGGCAGAA GATGATGAAG CGGTATGGAA GCGAAGAAGA GTACACCAAG CAACTCATCA ATCAGTTCAA ACACGGCAGC GATCCTGAAA TCCTGATTGT CGTCGACAAA CTACTGACCG GTTTTGACTC CCCCCGAAAT GCCGTACTCT ATCTATGCCG TGTTCTCAGG GAACATACTT TATTGCAGGC CATTGCACGG GTCAACCGGA TTCATGAAGG TAAGGAGTTC GGCTTTATCG TCGATTATGC AAATGTACTT GGCGAACTTG ATAAGGCCCT AACGATGTAT AGCGCGTTTG AAGGGTTCGA CGAATCGGAT CTTGTCGGCA CATTGACCTC GATCAACAGC GAGATTGAAA AACTGCCAGA CCGTCATTCG GATTTGTGGG ACCTCTTCAA GACCGTCAAA CATTCATACG ATGAAGAGGC GTATGAGCTT TTGCTTGCCG ATGACCAACT CAGGGAAGAG TTTTATACCC GTCTTGCCGA ATTTGCCAAA ACCCTCGCTA TCGCGCTCTC ATCCGAAAAG TTTTTAACTG AAACGGAGGA TAAAACGCTT TCCAGATACA AGTCGGATCT CCGGAAGTTC CAGTCTCTGA AGGTCGCCGT AAAACTCCGC TATGCTGAAT CTATCGATTA CCGCGATTAT GAGCCGAAAA TCAAAAAATT GCTCGATACC CATATTCAGG CAGACGAGGT TATCCAGCTT AACGAGCCGG TGAACATCTT TGATGAGCAT ATGTTCAACC AGGTCAAGGA AGAGCAGGGT GTTTATCAGG ATAAAAAAAC GACGGGCTCA AAAGCAGATA CGATTGCTCA TGCAACGAAA AAAGCGATCA CGGAAAAGAT GGATGAAGAT CCTGCGTTTT ATCAAAAGTT CTCGCTGCTC ATCCAGCAGG CAATCGAGGA TTTTCGGGCA AAACGGATAT CTGATCTGGA GTATCTGCAG CAGGTTGTCG ATATCCGGAA CAAGGTTGTC GGCAAGGTAC ACGATGATGT CCCGGACATC CTTTCAGGTA ATGAGGATGC CATGGCCTGC TATGGTGTGC TGAAACCCTT CTTCGAAGAT CATGCTCCGG GTCTTGAACC CGTCGTTGCA GATACGGCAA TTGCCATCTA TGACATTCTG GAAAAACACA AGAAAGTTCA TTTCTGGGAT GATGAAGATG CCCAGAAACA GGCGATCAAT GAGATTGAAG ATTACCTCTA TGACGAGCTG AAAACGGAAA AAGGTATTGC ATTGTCTCTC GATCAGATGG ATGAGATCAT TGAAAAGGTC ATGCAGGTTG CCAAACACAG GAGTTACACA TGA
|
Protein sequence | MSDFQINEKH LSQIPALQLL IGLGYEYLTP GEALLERQQR MSNVLLENVL RNQLKAINRI NFKGGVYLFS EENIQSAIQK LKNIKYDGLQ KTNEAVYDLI TLGTAMEQTI EGDSKSFTLN YVDWRNPERN AFHVTVEYSV ERSRSTETVR PDIVLFVNGI PFCVIECKSP QIEVDQAVSQ SIRNQNDAYI PKLFVYTQLL LALNKNRAMY ATTGTAVRFW SVWKEPGLTT DDETTKDTKS TKLKELERLI QESLDYRVDP DDFSLSSVLS VAQNLRIVTE QDKSLYALCR PERLLELAWK FTVFDGGLKK IARYQQYFVI KSTLNRVKQF AYSGSRKGGV IWHTQGSGKS LTMVMLTRNL ALDPEISNPR IVLVTDRDDL DKQLGNTFAA CGLEANRATS GRNLLELVAE KQSGIVTTLI HKFDKAYAVN KYRDESSDIF VLVDESHRTQ FGSFAARMRQ MFPNACYLGF TGTPLLKKEK NNFIRFGELV EPHYSISQAV EDGAVVPLLY EGRHVEMTQN RAAVDLWFER HTQGLSKEQQ ADLKRKYARA EMLNKADQVI YMRAFDISEH FRANWQGTGF KAQLVAPGKV SALKYQEYLN DIGMVGSEVV ISPPDMREGF EETDDESNDE VVKFWQKMMK RYGSEEEYTK QLINQFKHGS DPEILIVVDK LLTGFDSPRN AVLYLCRVLR EHTLLQAIAR VNRIHEGKEF GFIVDYANVL GELDKALTMY SAFEGFDESD LVGTLTSINS EIEKLPDRHS DLWDLFKTVK HSYDEEAYEL LLADDQLREE FYTRLAEFAK TLAIALSSEK FLTETEDKTL SRYKSDLRKF QSLKVAVKLR YAESIDYRDY EPKIKKLLDT HIQADEVIQL NEPVNIFDEH MFNQVKEEQG VYQDKKTTGS KADTIAHATK KAITEKMDED PAFYQKFSLL IQQAIEDFRA KRISDLEYLQ QVVDIRNKVV GKVHDDVPDI LSGNEDAMAC YGVLKPFFED HAPGLEPVVA DTAIAIYDIL EKHKKVHFWD DEDAQKQAIN EIEDYLYDEL KTEKGIALSL DQMDEIIEKV MQVAKHRSYT
|
| |