Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_0969 |
Symbol | |
ID | 6876934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | + |
Start bp | 940548 |
End bp | 943469 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642788847 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002219422 |
Protein GI | 198283101 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.301095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCC CCTCCGAACA CAAAACCGTC CAATCCCGCA TCCTCCGCTA CGCCGAGGCC ATCGGTTGGA CCTTTGTGTC CCGCGAGGCA GCCGAGCAGC GGCGCGGGTT TGATCCGGAC GTGCCGCCCG CCGACCGCGC CAAGAACCGT TCGCTTTTCT TCGACGACCT GCTCGACGCC AAGCTGCGGG AATTCAACCC GCGCTACGCC GAGGCCGAGG GCGCCTTGCT CGGGCAGTTC CGCCATCTGC ACGCCGACAT CTACGGCAAC CGGGAATTCG TCGAGCACCT GCGCAACCGG GGCAAGTTCT TCGACCATGA GGAAAAGCGC GAGCGCGACC TGATCCTGAT CGATTACGAC GACCCGGCGC GCAATGTTTT TGAGGTCACC GAGGAGTGGG CCTATAACAA CGGCCACTAC GGCACACGGG AGGATGTGGT CTTTCTCATC AACGGCATCC CGGTGCTGGT GATCGAGTGC AAGAACGCCA ACAAGGATGA GGCGATTGCC CTGGGGGTGG ATCAGATTCG CCGTTACCAC CGCGAGACGC CCGAGCTGTT CGTGTCGCAG CAGCTCTTTA CTGCCACCGA CGCCATCGGT TTTTCCTACG GGGTGAGCTG GAACACGGTG CGGCGGAACA TTTTCAACTG GAAGGACGGG GAAGTCGGCA AGCTGGAGGC CAAGGTGAAG AGCTTCTGCG CCATCCCGCA GGTGCTCGCC TTCCTGAAGG ACTACATCGT CTTTGCCGAG AAGGACGAGG AACTCAACAA ATACATCCTG CGCCAGCACC AGACCGGCGC GGTGGAGGCG GCCGTCAGCC GCGCCCTCGA TCCCCAACGC ACGCGCGGCT TGGTCTGGCA CACCCAGGGC AGCGGCAAGA CCTTCACCAT GATCAAGGCG GCGGAGCGGC TGTTCCGCGC GCCCGGTGCG GACAAGCCGA CCGTGCTGCT GATGATCGAC CGCAACGAGC TGGAAGACCA GATGCTCAAG AACCTCGCCG CGCTCGGCCT GGGCAATCTG GAGCACGCCA GCAGCATCGC CCGGCTCAAC CAGCTGCTGC GCGACGACTA CCGGGGCATC ATCGTGACGA TGATCCACAA ATTCCGCGAC ATGCCGGCGG ACTTGAACAC GCGCTCGAAC ATCTACGTGC TGATCGACGA AGCCCACCGC ACCACCGGCG GCGACCTCGG CAACTTCCTG ATGGCCGGCC TGCCGAACGC GACCTTTATC GGCTTCACCG GAACACCGGT GGACAAGACA GTGTATGGCA AGGGCACCTT CAAGACCTTC GGCTGCGAGG ACGACCAGGG CTACCTGCAC AAGTATTCCA TCGCCGACAG CATCGAGGAC GGCACCACGT TGCCGCTGTA CTACCAGCTC GCCCCCAACG AAATGCTGGT GCCGCACGAA ACGCTGGACA AGGAGTTCCT GTCGCTGGCC GAAGCCGAAG GCGTGGCCGA CATCGAGGAA CTGAACAAGA TTCTCGAACG GGCGGTGAAC CTGAAGAACT TCCTCAAGGG CAAGGAGCGG ATTCAGCACG TGGCGCAATT CGTCGCGGCG CATTACCGCG AGAACGTCGA GCCGCTGGGC TATAAGGCCT TCCTCGTCGG CGTCGACCGC GAAGCCTGCG CCCATTACAA GCGCGCCCTC GACCAGTTCC TGCCGCCCGA GTATTCCGAG GTCGTCTACA CCGGCAGCAA CAACGATTCC AAGCTGCTGA AAGAATTCCA CCTCGACCCG AAGCGGGAGC GGCAGATTCG CAAGAGCTTC GGCAAGCTCG ACCAGATGCC GAAAATCCTC ATCGTCACCG AGAAACTGCT CACCGGCTTC GACGCGCCGG TGCTCTACGC CATGTATCTC GACAAACCGA TGCGCGACCA CACGCTGCTG CAGGCCATCG CCCGGGTGAA TCGCCCCTAC GAGAACGAAG TGCAGGAGAT GGTGAAGCCC CATGGCTTCG TGCTGGATTT TGTCGGCATC TTCGACAAGC TCGAAAAGGC GCTGGCCTTC GACAGCGACG AGATCAACGC CATCGTCAAG GACCTGAAGC TGCTGAAGGT GCTGTTCAAG AACAAGATGG AGTCCAAGGC CCCGGACTAC CTCGGCCTGA TCGAGCGCAA CTTCAACGAC AAGGACGTCG ATACCCTCAT CGAGCACTTC CGCGATCCCG AGCGGCGCAA GGAGTTCTTC AAGGAATACA AAGAGATCGA GATGCTCTAC GAGATCATCT CGCCCGATGC CTTCCTGCGC CCATTCATTG CAGACTATGG CACCTTATCG GCGATCTACC AGGTCGTCCG CAAGGCCTAC ACCCGAACCG TGATGGTTGA CCGCGAGTTT CAGGCCAAGA CCAACCATCT GGTGCGGGAG CAGGTGGGCA GTTATGGCGT AGGTGGTTTG GGTGAGATTG TCGCGATCGA CGGCAATACC ATCGAGCTGA TCAAGAACAA GCGCGGCGGG GATGGCACCA AAGTCATCAA CCTGATCAAA AGCATTGAAA GGCTTGCCGA GGAGGGCAGC GATGACCCTT ACCTCATCGC CATGGCGGAG CGCGCGAGGG CTGTGCAGGA GAGCTTCGAG AGCCGCCAAA CCAGCACTGC CGAGGCGCTG GCCGAATTGC TGCGTGAAGT GGAAGGCAAC GAAACGCGGA AGAAGGAACA GGCCGAAAAG TCCTTCGATG GCCTGACCTA TTTTGTCTAC CGCAGCCTGC TCGATGCGAA GATTCAGAAC GCCGAGGCGG TTAGCCGGAA GATTCGCCAC GCATTCACCG AGTTTCCGAA CTGGAAGCGC AGCGAAAACG CCCTGCGCGA ACTCCGCAAG AAAGTGACCT TTGCCCTCTT TGCCGAAACG GAGGACCTTG ACCGGGTGAC GGCGATGGTG GATGAGTTGT TTACCCTGCT GGAAAAGGCG GATCGGATTT GA
|
Protein sequence | MPTPSEHKTV QSRILRYAEA IGWTFVSREA AEQRRGFDPD VPPADRAKNR SLFFDDLLDA KLREFNPRYA EAEGALLGQF RHLHADIYGN REFVEHLRNR GKFFDHEEKR ERDLILIDYD DPARNVFEVT EEWAYNNGHY GTREDVVFLI NGIPVLVIEC KNANKDEAIA LGVDQIRRYH RETPELFVSQ QLFTATDAIG FSYGVSWNTV RRNIFNWKDG EVGKLEAKVK SFCAIPQVLA FLKDYIVFAE KDEELNKYIL RQHQTGAVEA AVSRALDPQR TRGLVWHTQG SGKTFTMIKA AERLFRAPGA DKPTVLLMID RNELEDQMLK NLAALGLGNL EHASSIARLN QLLRDDYRGI IVTMIHKFRD MPADLNTRSN IYVLIDEAHR TTGGDLGNFL MAGLPNATFI GFTGTPVDKT VYGKGTFKTF GCEDDQGYLH KYSIADSIED GTTLPLYYQL APNEMLVPHE TLDKEFLSLA EAEGVADIEE LNKILERAVN LKNFLKGKER IQHVAQFVAA HYRENVEPLG YKAFLVGVDR EACAHYKRAL DQFLPPEYSE VVYTGSNNDS KLLKEFHLDP KRERQIRKSF GKLDQMPKIL IVTEKLLTGF DAPVLYAMYL DKPMRDHTLL QAIARVNRPY ENEVQEMVKP HGFVLDFVGI FDKLEKALAF DSDEINAIVK DLKLLKVLFK NKMESKAPDY LGLIERNFND KDVDTLIEHF RDPERRKEFF KEYKEIEMLY EIISPDAFLR PFIADYGTLS AIYQVVRKAY TRTVMVDREF QAKTNHLVRE QVGSYGVGGL GEIVAIDGNT IELIKNKRGG DGTKVINLIK SIERLAEEGS DDPYLIAMAE RARAVQESFE SRQTSTAEAL AELLREVEGN ETRKKEQAEK SFDGLTYFVY RSLLDAKIQN AEAVSRKIRH AFTEFPNWKR SENALRELRK KVTFALFAET EDLDRVTAMV DELFTLLEKA DRI
|
| |