Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1393 |
Symbol | |
ID | 8323475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 1449743 |
End bp | 1452724 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644952524 |
Product | type III restriction protein res subunit |
Protein accession | YP_003109990 |
Protein GI | 256372166 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC ACTTTGAACC GAACCTCGAC TACCAGTGGC AGGCCATAGA GGCTGTGTGC GATCTGTTCC GCGGGCAGGA GATCTGCCGC ACCGAGTTCA CGGTCACGCG CGATCCCGCG GACACCCAGA TGCGCCTCGG CGTTGCGCAG AACGAGCTTG GCATCGGCAA CCGTCTGATG TTGCTCGACG ACGAGCTGCT CAAGAACCTG AACGACATCC AACTCCGCAA CGGCCTGCCG CCCTCGGAAT CACTCGCATC AGGTGACTTC ACTGTGGAGA TGGAGACCGG CACCGGCAAG ACCTATGTGT ATCTGCGCAC CATTTTCGAG CTGAACAAAC GCTACGGCTT TACCAAGTTC GTCATCGTGG TGCCGTCGAT TGCCATCAAG GAGGGTGTGT ACAAGACGCT TCAGATCACC GAAGACCATT TCAAGAGCCT GTACTCCGGC GTGCCTTTCG ACTATTTCCT CTACGACTCC TCCAAACTCG GGCAGGTGCG CAACTTCGCC ACCAGCGCGA ACATCCAGAT CATGGTGATG ACCGTCGGGG CCATCAACAA GAAAGACGTC AACAACCTCT ACAAGGAAAG CGAGAAGACA GGCGGTGAAA AGCCCATCGA CCTGATCAAG GCCACCCGGC CCATTGTGAT CGTCGATGAG CCTCAGAGCG TGGACGGCGG CCTTCAGGGG CGCGGCAAGG AAGCGCTCGA CGCCATGAAT CCGCTTTGCA CCCTGCGTTA CTCGGCCACC CACGTGGATA AGCACCACAT GGTCTACCGG CTGGACGCTG TGGACGCCTA CGAGAAGCGG CTGGTCAAGC AGATCGAGGT GGCCTCGGCC ACGGTCGAAC ATGCCCACAA CAAGCCCTAC GTGCGCCTGG TTTCGGTGTC GAACAAGCGC GGCATCATTT CAGCGAAGGT CGAGCTGGAC GTGAAGACCG ACAGCGGTGG GGTCAAGCGG CGGGAGGTGA AGGTTCAGGA CGGCGACGAC CTGGAAGAGA TCGCCGGCCG CGCGATCTAC GCCGGCTACC GTATCGGCGA AATCCGGGTC GAAAAGGGAA ATGAGTACAT GGAGCTGCGC GTCCCCGGCG GCGAGCACTT CCTCAAGCCG GGGCAGGCCT GGGGAGACGT GGACATGCTG GCCGTTCAGC GCCAGCTGAT TCGCCGAACC ATTCGCGAGC ACCTGGACAA GGAGATGCGC CTGCTACCAA AGGGCATCAA GGTGCTGTCG CTCTTTTTTA TCGACGAAGT CGCCAAGTAT CGCCAGTATG ACGCGGCCGG GAACCCCGTG AAAGGTGATT ACGCCCGCAT CTTCGAGGAA GAGTACCGAC GAGCTGCCAA CCTGCCAGAG TATGAACCGC TCTTCAAAGG GGTGGATGTG AGCCGCGAAG CCGAGAAGGT TCACAACGGC TACTTTTCCA TCGACAAGAA AGGCGGCTGG ACGGATACTG CCGAGAACAA CGAGGCGGGC CGCGACAACG CCGAGCGGGC CTACAACCTC ATCATGAAGG ACAAGGAAAA GCTCCTCTCC CTCGAGACGC CGATCAAGTT TATTTTCTCG CACTCGGCGC TGCGCGAAGG CTGGGACAAC CCCAACGTCT TTCAAATCTG CACCCTGCGA GACATCCGTA CGGAGCGCGA ACGGCGGCAA ACCATCGGGC GCGGCTTGCG TTTGTGCGTG AATCAGTACG GCGAGCGTGT GCGGGGCTTC GATGTCAATA CACTGACCGT GATCGCTACG GAGAGCTACG AGGACTTCGC CGAGAACTTG CAGAAGGAAA TCGAGGAGGA TACCGGCATC CGGTTCGGCA TTGTCGAGGC GGACCAGTTC GCTCACATCG TGGTCACCGA TGCGAATGGC AAAGCCTCGC CGCTCGGCAT CGAACGGTCG AAGGCGTTGT GGGAGTCCTT CAAGGCCTAT GGCTACATCG ACGCCAAAGG ACGGGTGCAG GATTCACTCA GAAGGGCACT CAGGGACGGC ACCGTGGAAG TGCCCGAGGA GTTCGCCGCG CAGCGTGACC AAATCACGGA TGTCCTCAAG AAGGCGGCCG GGCGGCTTGA GATCAAGAAC GCCGACGAGC GTCGGCAGGT GCGTGTTCGC GGGGAGGTCC ATATCAGTGA TGAGTTCAAG GCGCTGTGGG ACCGCATCAA GGACAAAACC ACATACCGGG TGAAATTCGA CAACGAAGCG CTCGTGGAGG CGTGCATCAA AGCGCTCAAA GAAATGCCGG CGATCCCGAA GGTGAGCCTC CAGTGGCGAA AGGCGGATAT CGCCATCGGC AAGGCCGGCG TGGTAGCCGC CGAGCGCGAG CGTGGGGCGC CCGTAGGGCT CGAGGAAACG GACATCGATC TGCCGGACCT GCTTACCGAG CTTCAAGACC GGACCCAACT CACGCGCCGC ACGATTTACC GCGTCCTGAC GGAAAGTAGG CGTCTGGCCG ACTTCAAGCG CAATCCCCAG CGGTTCATCG AGCTGGCGGC AGAGACTATT AACCGCTGCA AGCGGAAGGC GATGGTTGAT GGCATCAAAT ACCAGCGCCT GGGTGACGAG CACTACTACG CGCGGGAGCT GTTCGAAAGT GAGGAGCTGA CCGGCTACCT GAAGAATCTC TTCGAGGCCA AGAAGTCGGT CTATGAGCAG GTGGTCTACG ACTCCGACAC CGAGCTAACG TTTGCCCAGC AGCTCGAGTA CAACCGTGAT ATCAAGGTCT ACGCAAAACT CCCGGGCTGG TTCACCGTGC CGACGCCGCT GGGCGGATAC AACCCTGACT GGGCGGTGGT CGTCGAGCAG GAAGGCGAGG AGCGCCTCTA CTTTTACTTT GTCGTTGAGA CCAAGGGCAG TGTGGTCGCC GACGACTTGC GTGGCAAGGA AAAGGCGAAG ATTGACTGCG GCAAGGCGCA TTTCGAGGCG CTCAAGGATC GCGAGAACCC GGCACAGTAC AAAGTAGCGA GTTCGGTTGA TGATCTATTT ATCCGAGAGT GA
|
Protein sequence | MKLHFEPNLD YQWQAIEAVC DLFRGQEICR TEFTVTRDPA DTQMRLGVAQ NELGIGNRLM LLDDELLKNL NDIQLRNGLP PSESLASGDF TVEMETGTGK TYVYLRTIFE LNKRYGFTKF VIVVPSIAIK EGVYKTLQIT EDHFKSLYSG VPFDYFLYDS SKLGQVRNFA TSANIQIMVM TVGAINKKDV NNLYKESEKT GGEKPIDLIK ATRPIVIVDE PQSVDGGLQG RGKEALDAMN PLCTLRYSAT HVDKHHMVYR LDAVDAYEKR LVKQIEVASA TVEHAHNKPY VRLVSVSNKR GIISAKVELD VKTDSGGVKR REVKVQDGDD LEEIAGRAIY AGYRIGEIRV EKGNEYMELR VPGGEHFLKP GQAWGDVDML AVQRQLIRRT IREHLDKEMR LLPKGIKVLS LFFIDEVAKY RQYDAAGNPV KGDYARIFEE EYRRAANLPE YEPLFKGVDV SREAEKVHNG YFSIDKKGGW TDTAENNEAG RDNAERAYNL IMKDKEKLLS LETPIKFIFS HSALREGWDN PNVFQICTLR DIRTERERRQ TIGRGLRLCV NQYGERVRGF DVNTLTVIAT ESYEDFAENL QKEIEEDTGI RFGIVEADQF AHIVVTDANG KASPLGIERS KALWESFKAY GYIDAKGRVQ DSLRRALRDG TVEVPEEFAA QRDQITDVLK KAAGRLEIKN ADERRQVRVR GEVHISDEFK ALWDRIKDKT TYRVKFDNEA LVEACIKALK EMPAIPKVSL QWRKADIAIG KAGVVAAERE RGAPVGLEET DIDLPDLLTE LQDRTQLTRR TIYRVLTESR RLADFKRNPQ RFIELAAETI NRCKRKAMVD GIKYQRLGDE HYYARELFES EELTGYLKNL FEAKKSVYEQ VVYDSDTELT FAQQLEYNRD IKVYAKLPGW FTVPTPLGGY NPDWAVVVEQ EGEERLYFYF VVETKGSVVA DDLRGKEKAK IDCGKAHFEA LKDRENPAQY KVASSVDDLF IRE
|
| |