Gene Afer_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1393 
Symbol 
ID8323475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1449743 
End bp1452724 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content60% 
IMG OID644952524 
Producttype III restriction protein res subunit 
Protein accessionYP_003109990 
Protein GI256372166 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC ACTTTGAACC GAACCTCGAC TACCAGTGGC AGGCCATAGA GGCTGTGTGC 
GATCTGTTCC GCGGGCAGGA GATCTGCCGC ACCGAGTTCA CGGTCACGCG CGATCCCGCG
GACACCCAGA TGCGCCTCGG CGTTGCGCAG AACGAGCTTG GCATCGGCAA CCGTCTGATG
TTGCTCGACG ACGAGCTGCT CAAGAACCTG AACGACATCC AACTCCGCAA CGGCCTGCCG
CCCTCGGAAT CACTCGCATC AGGTGACTTC ACTGTGGAGA TGGAGACCGG CACCGGCAAG
ACCTATGTGT ATCTGCGCAC CATTTTCGAG CTGAACAAAC GCTACGGCTT TACCAAGTTC
GTCATCGTGG TGCCGTCGAT TGCCATCAAG GAGGGTGTGT ACAAGACGCT TCAGATCACC
GAAGACCATT TCAAGAGCCT GTACTCCGGC GTGCCTTTCG ACTATTTCCT CTACGACTCC
TCCAAACTCG GGCAGGTGCG CAACTTCGCC ACCAGCGCGA ACATCCAGAT CATGGTGATG
ACCGTCGGGG CCATCAACAA GAAAGACGTC AACAACCTCT ACAAGGAAAG CGAGAAGACA
GGCGGTGAAA AGCCCATCGA CCTGATCAAG GCCACCCGGC CCATTGTGAT CGTCGATGAG
CCTCAGAGCG TGGACGGCGG CCTTCAGGGG CGCGGCAAGG AAGCGCTCGA CGCCATGAAT
CCGCTTTGCA CCCTGCGTTA CTCGGCCACC CACGTGGATA AGCACCACAT GGTCTACCGG
CTGGACGCTG TGGACGCCTA CGAGAAGCGG CTGGTCAAGC AGATCGAGGT GGCCTCGGCC
ACGGTCGAAC ATGCCCACAA CAAGCCCTAC GTGCGCCTGG TTTCGGTGTC GAACAAGCGC
GGCATCATTT CAGCGAAGGT CGAGCTGGAC GTGAAGACCG ACAGCGGTGG GGTCAAGCGG
CGGGAGGTGA AGGTTCAGGA CGGCGACGAC CTGGAAGAGA TCGCCGGCCG CGCGATCTAC
GCCGGCTACC GTATCGGCGA AATCCGGGTC GAAAAGGGAA ATGAGTACAT GGAGCTGCGC
GTCCCCGGCG GCGAGCACTT CCTCAAGCCG GGGCAGGCCT GGGGAGACGT GGACATGCTG
GCCGTTCAGC GCCAGCTGAT TCGCCGAACC ATTCGCGAGC ACCTGGACAA GGAGATGCGC
CTGCTACCAA AGGGCATCAA GGTGCTGTCG CTCTTTTTTA TCGACGAAGT CGCCAAGTAT
CGCCAGTATG ACGCGGCCGG GAACCCCGTG AAAGGTGATT ACGCCCGCAT CTTCGAGGAA
GAGTACCGAC GAGCTGCCAA CCTGCCAGAG TATGAACCGC TCTTCAAAGG GGTGGATGTG
AGCCGCGAAG CCGAGAAGGT TCACAACGGC TACTTTTCCA TCGACAAGAA AGGCGGCTGG
ACGGATACTG CCGAGAACAA CGAGGCGGGC CGCGACAACG CCGAGCGGGC CTACAACCTC
ATCATGAAGG ACAAGGAAAA GCTCCTCTCC CTCGAGACGC CGATCAAGTT TATTTTCTCG
CACTCGGCGC TGCGCGAAGG CTGGGACAAC CCCAACGTCT TTCAAATCTG CACCCTGCGA
GACATCCGTA CGGAGCGCGA ACGGCGGCAA ACCATCGGGC GCGGCTTGCG TTTGTGCGTG
AATCAGTACG GCGAGCGTGT GCGGGGCTTC GATGTCAATA CACTGACCGT GATCGCTACG
GAGAGCTACG AGGACTTCGC CGAGAACTTG CAGAAGGAAA TCGAGGAGGA TACCGGCATC
CGGTTCGGCA TTGTCGAGGC GGACCAGTTC GCTCACATCG TGGTCACCGA TGCGAATGGC
AAAGCCTCGC CGCTCGGCAT CGAACGGTCG AAGGCGTTGT GGGAGTCCTT CAAGGCCTAT
GGCTACATCG ACGCCAAAGG ACGGGTGCAG GATTCACTCA GAAGGGCACT CAGGGACGGC
ACCGTGGAAG TGCCCGAGGA GTTCGCCGCG CAGCGTGACC AAATCACGGA TGTCCTCAAG
AAGGCGGCCG GGCGGCTTGA GATCAAGAAC GCCGACGAGC GTCGGCAGGT GCGTGTTCGC
GGGGAGGTCC ATATCAGTGA TGAGTTCAAG GCGCTGTGGG ACCGCATCAA GGACAAAACC
ACATACCGGG TGAAATTCGA CAACGAAGCG CTCGTGGAGG CGTGCATCAA AGCGCTCAAA
GAAATGCCGG CGATCCCGAA GGTGAGCCTC CAGTGGCGAA AGGCGGATAT CGCCATCGGC
AAGGCCGGCG TGGTAGCCGC CGAGCGCGAG CGTGGGGCGC CCGTAGGGCT CGAGGAAACG
GACATCGATC TGCCGGACCT GCTTACCGAG CTTCAAGACC GGACCCAACT CACGCGCCGC
ACGATTTACC GCGTCCTGAC GGAAAGTAGG CGTCTGGCCG ACTTCAAGCG CAATCCCCAG
CGGTTCATCG AGCTGGCGGC AGAGACTATT AACCGCTGCA AGCGGAAGGC GATGGTTGAT
GGCATCAAAT ACCAGCGCCT GGGTGACGAG CACTACTACG CGCGGGAGCT GTTCGAAAGT
GAGGAGCTGA CCGGCTACCT GAAGAATCTC TTCGAGGCCA AGAAGTCGGT CTATGAGCAG
GTGGTCTACG ACTCCGACAC CGAGCTAACG TTTGCCCAGC AGCTCGAGTA CAACCGTGAT
ATCAAGGTCT ACGCAAAACT CCCGGGCTGG TTCACCGTGC CGACGCCGCT GGGCGGATAC
AACCCTGACT GGGCGGTGGT CGTCGAGCAG GAAGGCGAGG AGCGCCTCTA CTTTTACTTT
GTCGTTGAGA CCAAGGGCAG TGTGGTCGCC GACGACTTGC GTGGCAAGGA AAAGGCGAAG
ATTGACTGCG GCAAGGCGCA TTTCGAGGCG CTCAAGGATC GCGAGAACCC GGCACAGTAC
AAAGTAGCGA GTTCGGTTGA TGATCTATTT ATCCGAGAGT GA
 
Protein sequence
MKLHFEPNLD YQWQAIEAVC DLFRGQEICR TEFTVTRDPA DTQMRLGVAQ NELGIGNRLM 
LLDDELLKNL NDIQLRNGLP PSESLASGDF TVEMETGTGK TYVYLRTIFE LNKRYGFTKF
VIVVPSIAIK EGVYKTLQIT EDHFKSLYSG VPFDYFLYDS SKLGQVRNFA TSANIQIMVM
TVGAINKKDV NNLYKESEKT GGEKPIDLIK ATRPIVIVDE PQSVDGGLQG RGKEALDAMN
PLCTLRYSAT HVDKHHMVYR LDAVDAYEKR LVKQIEVASA TVEHAHNKPY VRLVSVSNKR
GIISAKVELD VKTDSGGVKR REVKVQDGDD LEEIAGRAIY AGYRIGEIRV EKGNEYMELR
VPGGEHFLKP GQAWGDVDML AVQRQLIRRT IREHLDKEMR LLPKGIKVLS LFFIDEVAKY
RQYDAAGNPV KGDYARIFEE EYRRAANLPE YEPLFKGVDV SREAEKVHNG YFSIDKKGGW
TDTAENNEAG RDNAERAYNL IMKDKEKLLS LETPIKFIFS HSALREGWDN PNVFQICTLR
DIRTERERRQ TIGRGLRLCV NQYGERVRGF DVNTLTVIAT ESYEDFAENL QKEIEEDTGI
RFGIVEADQF AHIVVTDANG KASPLGIERS KALWESFKAY GYIDAKGRVQ DSLRRALRDG
TVEVPEEFAA QRDQITDVLK KAAGRLEIKN ADERRQVRVR GEVHISDEFK ALWDRIKDKT
TYRVKFDNEA LVEACIKALK EMPAIPKVSL QWRKADIAIG KAGVVAAERE RGAPVGLEET
DIDLPDLLTE LQDRTQLTRR TIYRVLTESR RLADFKRNPQ RFIELAAETI NRCKRKAMVD
GIKYQRLGDE HYYARELFES EELTGYLKNL FEAKKSVYEQ VVYDSDTELT FAQQLEYNRD
IKVYAKLPGW FTVPTPLGGY NPDWAVVVEQ EGEERLYFYF VVETKGSVVA DDLRGKEKAK
IDCGKAHFEA LKDRENPAQY KVASSVDDLF IRE