Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_2012 |
Symbol | |
ID | 6878003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | - |
Start bp | 2012555 |
End bp | 2015653 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642789884 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002220436 |
Protein GI | 198284115 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCGG AAGGCCGCAC CCGGAAACAA ATCGACGCCC CAGACACCTA CAGTCGGCCC GCTGATAGTC CGAGTTTCGC CGTCCGCGAG GACACCATCG AATACGGCTT CATCGGCACC CTGCAAAACC TCAAGTACGA ATATCGCCCC GACATCCGCG ACCGCGCCGC GCTGGAGGCC AATTTCCGCC AGCACTTCGA AGCGCTCAAC CGCGTCCGCC TCACCGATGC CGAGTTCGCC CGCCTGCTGG ATGAGATCGT CACCCCGGAT GTTTTTACCG CCGCCAAGAC CCTGCGCAGC ATCAATGCCT TCACCCGCGA CGACGGCACG CCGCTGAACT ACAGCCTCGT CAACCTCAAG GACTGGTGCA AAAACACCTT CGAGGTCATC CACCAGTTGC GCATCAATAC CGACTACAGC CACCACCGCT ACGACGTCAT CCTGCTCATC AACGGTGTGC CCTGCGTGCA GATCGAGCTG AAAACCCTCG GCGTCCATCC GCGCCGGGCC ATGGAGCAGA TCGTCGAATA CAAGCACGAC CCCGGCAACG GCTACACCCG GACGCTGCTC TGCTTCATGC AGCTCTTCAT CGTCAGCAAC CGCGACCAGA CTTACTACTT CGCCAACAAC AACGCCCGCC ATTTCGCCTT CAACGCCGAC GAGCGCTTTT TGCCCGTCTA TGAGTTCGCG GACGAGGACA ATAGCAAGAT CAGACAGCTC GACGCCTTCG CCGAGCGCTT CCTGAAAAAG TGTAACCTCG GCCAGACCCT CAGCCGCTAC ATGGTGCTGC TGGCGGGCGA GCAAAAGCTC ATGATGATGC GACCCTATCA GGTCTATGCC GTGCAGCACA TGGTCAAGTG CATCGATGAA GACAACGGCA ACGGCTACAT CTGGCACACC ACCGGCAGCG GCAAGACGCT CACCTCCTTC AAGGCCGCCA CCCTGCTCAA GGAGAACGAG CACATCCATA AATGCGTGTT CGTTGTCGAC CGCAAGGACC TCGACCGCCA GACGCGGGAG GAATTCAACC GCTTCCAGGA AGGCTGTGTC GAAGAAAACA CCCACACCGG CGCCCTCGTG CGCCGCCTGC TGTCCGAGGA CTACGCCGAC AAGGTCATCG TCACCACCAT CCAGAAGCTC GGCCTCGCGC TCGACGAAAC GAGCCGCCGC AACAAGCAAC GCAGCAAAAA CGGCCAGGCC ACCTACAAGG AGCAGCTCGA AGCACTGCAG GACGAGCGCA TCGTCTTCAT CTTCGACGAA TGCCACCGCT CGCAGTTCGG TGAGAACCAC AAGGCCATCA AAGCCTTCTT CCCCCGCGCC CAACTCTTCG GCTTCACCGG CACGCCCATC TTCGAGGCCA ATGCCAGCCT GCAGAAGATC GAGGACAGCA CGGCCTCCAT GCGCACCACG GCAGACCTCT TCCAGAAACA GCTGCACGCC TACACCATCA CCCACGCCAT CGAAGACGGC AATGTGCTGC GCTTCCATGT CGATTACTTC AAGCCGGAAG GAAAGAACCC ACCCAAACCC GGCGAACCCA TCGCCAAGCG CGCGGTCATC GAAGCCATCC TCGCCAAGCA CGATGCCGCC ACCGGCGGGC GCCGCTTCAA CGCCCTCTTT GCCACCGCAT CCATCAACGA TGCCATCGAA TACCACGCGC TGTTCAAGAC CATGCAAGCC GAGAAACTGG CCGCCGATCC CGAGTTCAAG CCGCTGAATA TCGCCTGCGT CTTCTCTCCG CCTGCGCAGC TCGCGGAGAA TCCCGAAAGC AAAAAGGACA TCGACCAGCT CTCCGAAGAC CTCCCGCAAG AGCAGGAAGA CAACAAGGTC GAGCCAGAAG CGAAGAAGCA GGCGCTTGAA GGCATCCTCG CCGACTACAA CGCCCGCTAC GGCACCAACC ATCGCCTGAG CGAATTCGAT CTTTACTACC AGGATGTGCA AAAGCGGATC AAAGACCAGC AGTGGCCGAA TGCCGACTAC CCCCCGGCGC AGAAAATCGA CATCACTATC GTCGTGGACA TGCTGCTCAC CGGCTTCGAT TCCAAGTTCC TGAACACGCT CTACGTGGAC AAGAACCTCA AGCACCACGG CCTGATCCAG GCCTTCTCGC GCACCAACCG CGTGCTCAAC GCCACCAAGC CCTACGGCAA CATCCTCGAC TTCCGCCAGC AGCAGGATGC CGTCGATGCC GCCATCGCGC TGTTCTCCGG CGAAAAGACC GGCGAGCAGG CGCGCGAAAT CTGGCTGGTG GAAAAGGCGC CCGTGGTCAT CCAGAAACTG GAGGCCGCCG TGCAAAAGCT CGACGCCTTC ATGCAGTCCC AGGGGCTGGA TTGCACGCCC TCCACCGTGG CCAACCTGAA AGGCGATGCC GCCAAGACCG TTTTCATCGA GCGCTTCAAG GAAGTGCAGC GGCTCAAGAC CCAGCTCGAC CAATACACCG ACCTCACCGG GGAAAACAAA GCCGCCATCG AGCAGGTGCT GCCCGAAGAA ACCCTGCGCG GCTTCAAGGG CCAGTATCTG GACACCGCCA AAAAGCTCCG CGACGGGCGC AACAAGCCGG ACAAGCCCGG CGCCGACAAG CCTGCCGATC AGCTCGACTT CGAGTTCGTC CTCTTCGCCT CCGCCGTCAT TGATTACGAC TACATCATGA AGCTGATGAC CAGCTTCTCG GCCAAAGAGC CGGGCAAGGC CAAGATGACC CGCGAACAAC TCATCGGCCT CATCAGCAGT GACGCCAAGT TCATCAACGA GCGCGACGAC ATCGCCGAAT ATATCGGCAC GCTCCAGGCC GGCGAGGGGC TGAGTGAAAC CGCCATCCGC GACGGCTACA CCCGCTTCAA GGCGGAGAAG AACGCCCAGG AACTCGCCAC CATTGCCGCA AAGCACAATC TGGCCACCGC CGCCCTGCAA AGCTTCGTGG ACGGCATTTT TGAGCGCATG ATCTTCGACG GCGAACGCTT GAGCGACCTC ATGGCCCCGC TCGATCTGGG CTGGAAAGCC CGCAGCCAGG CCGAAATCGC GCTGATGGAA GATTTGTATC CGCTGCTGAC CCAACGCGCC GGGGGCCGCG ATATTTCAGG GCTTAGCGCC TATGAGTGA
|
Protein sequence | MKPEGRTRKQ IDAPDTYSRP ADSPSFAVRE DTIEYGFIGT LQNLKYEYRP DIRDRAALEA NFRQHFEALN RVRLTDAEFA RLLDEIVTPD VFTAAKTLRS INAFTRDDGT PLNYSLVNLK DWCKNTFEVI HQLRINTDYS HHRYDVILLI NGVPCVQIEL KTLGVHPRRA MEQIVEYKHD PGNGYTRTLL CFMQLFIVSN RDQTYYFANN NARHFAFNAD ERFLPVYEFA DEDNSKIRQL DAFAERFLKK CNLGQTLSRY MVLLAGEQKL MMMRPYQVYA VQHMVKCIDE DNGNGYIWHT TGSGKTLTSF KAATLLKENE HIHKCVFVVD RKDLDRQTRE EFNRFQEGCV EENTHTGALV RRLLSEDYAD KVIVTTIQKL GLALDETSRR NKQRSKNGQA TYKEQLEALQ DERIVFIFDE CHRSQFGENH KAIKAFFPRA QLFGFTGTPI FEANASLQKI EDSTASMRTT ADLFQKQLHA YTITHAIEDG NVLRFHVDYF KPEGKNPPKP GEPIAKRAVI EAILAKHDAA TGGRRFNALF ATASINDAIE YHALFKTMQA EKLAADPEFK PLNIACVFSP PAQLAENPES KKDIDQLSED LPQEQEDNKV EPEAKKQALE GILADYNARY GTNHRLSEFD LYYQDVQKRI KDQQWPNADY PPAQKIDITI VVDMLLTGFD SKFLNTLYVD KNLKHHGLIQ AFSRTNRVLN ATKPYGNILD FRQQQDAVDA AIALFSGEKT GEQAREIWLV EKAPVVIQKL EAAVQKLDAF MQSQGLDCTP STVANLKGDA AKTVFIERFK EVQRLKTQLD QYTDLTGENK AAIEQVLPEE TLRGFKGQYL DTAKKLRDGR NKPDKPGADK PADQLDFEFV LFASAVIDYD YIMKLMTSFS AKEPGKAKMT REQLIGLISS DAKFINERDD IAEYIGTLQA GEGLSETAIR DGYTRFKAEK NAQELATIAA KHNLATAALQ SFVDGIFERM IFDGERLSDL MAPLDLGWKA RSQAEIALME DLYPLLTQRA GGRDISGLSA YE
|
| |