Gene AFE_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_2790 
Symbol 
ID7134994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp2480037 
End bp2483105 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content59% 
IMG OID643531143 
Producttype I restriction-modification enzyme, R subunit 
Protein accessionYP_002427160 
Protein GI218665978 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCC TCAACGAATC CACCATCGAA CACCACGCCA TTGCCCTATT CAAAGAACTG 
GGCTACGCCT ACGCATTCGG CCCCGACATC GGCCCGGATA GCAACCAAAA AGAACGCGCT
GACTACGAAA ACCCCCTGTT ACTGGAACGC CTGCGTACAG CCATCCATCG CCTGAATCCA
GAAGCCCCCA AGGAAATCCG CGACGAAGCC CTACGCCATG TGCAGGCTAT TGCCGTGGGT
GGACTGATGA CCGAGAACAG CCTATTTCAG CAGATGCTCG TGGAAGGGGT ACGGGTGGCG
CACATGGTGG ACGGCGAAGA ACGCGGCAGC ATCGTCCGCC TGATCGACTA CGACCATCCT
GAAAAGAATG ATTTGCTGGT GGTCAACCAG TACACCGTCA TCCACAACCG CATCAACCGC
CGCGCCGATC TAGTGGTTTT CGTGAATGGC CTGCCCTTGG GCCTCTTTGA ACTGAAAAAC
ATGGCGGACG CCAACGCCAC CACCCGCCGG GCATGGAATC AACTACAGAC CTATCAGGCC
GACATTCCCA ACCTCATGGC CTACAACGCC ATACTGGTCA TCAGCGATGG GCTGAAGGCC
GCCGTAGGCT GTCTGGGTGC CCCTTATGAA CGCTTTCTGC CCTGGAAAAC CATCGACGGT
CAGGAACTCA TGGACCGGGG GGATGACCCT TTACGTTTCC TGATTCACGG GGTTTTTGAA
CCCGTCCGTT TTCTGGACTT GATCCGCTAT TTCATCGCCA TTAGTGACGA TGGCCGCAAA
CTCAGCAAGA AGATCGCCGC CTATCATCAG TTCCACGCCG CCCAGAAGGC CCTGCTGACC
ACCCTGGCAT CGGCAGGCAT CGGTGGTGAT CGCCGGGGTG GTGTGGTCTG GCATACCCAG
GGGTCCGGCA AGAGCCTCAC CATGCTCTTT TTCGCGGGCA TGCTCATCCA GCACCCGGCG
CTGGAAAACC CCACCATCCT CATGCTCACC GACCGCACGA ACCTGGACAC CCAGTTGTTC
GATACCTTTG CCGCAGGCAA GCAGCTTTTA CGGCAAGACC CGCAGCAGGC CGACAGCGTC
AGTGATCTTC GGGAACTGCT GCAACGGGCT TCCGGCGGTG TACTGTTCAG CACCATCCAG
AAATTCCAGA AAGACCGCGA CGAACCTTTG GGCAAGCACC CGGTATTGTC CGAGCGGAAG
AACATCATCG TCATGGTGGA TGAAGCCCAG CGTAGCCAGT ATGAGATTCT GAAGGGATAT
GCCGCCAATC TGCGGGCGGC CTTGCCCAAC GCGACCTTCG TGGCCTTCAC CGGCACACCG
CTGGAACTGG ATGACCGGGA CACGCGGGTG GTGTTCGGCG ATTACATCGA CATCTACGAT
ATTCAGCGGG CCGTCGAGGA TGGGGCCACC GTCCCGATTT ATTACGAAAG CCGCCTGGTG
CGCCTCAACC TGCCGGAAGA CCAGCGCCCC CTCATCGACA GCACCTTTGA GGAAATCACC
GAAGACGACG AGCAGGAAAA CAAGGAACGC CTGAAAACCC GCTGGGCGCA ACTGGAAGCC
CTGGTCGGCG CCCCCAAGCG AGTGGAACAG ATTGCCGCTG ACCTGCTGGA CCATTGGGAA
AAGCGCAAGA GCATCCTGTC CGGCAAGGCC ATGATCGTCG CCATGAGCCG TCGCATTGCC
GTGGAACTTT ATGATGCCAT CGTCCGTATC CATCCCGACT GGCGCGGTGA CGACGACACG
CAAGGCCGGA TCAAGGTGGT CATGACCGGT TCCGCCAGCG ACCCCATCGA ATGGAAGCCG
CATATCCGCA GCAAGGCGGG CAATGAAGCC ATTGCCGACC GCCTGAAAGA CCCGGATGAT
CCGTTAGAAA TCGTCATCGT GCGGGATATG TGGCTGACGG GCTTCGACGC CCCGGCCCTG
AATACGCTCT ATGTGGACAA ACCCATGCGC GGGCACACCC TCATGCAGGC CATCGCCCGC
GTCAACCGGG TCTTCACCAA CAAGTCCGGT GGACTGGTGG TGGATTACAT CGGCATTGCC
CAGGACCTGA AAAACGCCAT TGCCACCTAC ACCCGTTCCG GTGGCAAGGG CAATCCGGCG
GAAACCGTGC AGACCGCCAT CAAGGTGCTG CTGGAAAAAC TCGACATCTG CCGCGGTATT
TTCCATGGCT GTGACTATCA GGATTATCTG ACTGACGACG GATCAGAAAA GGTGGCCACC
CTGCGCAAGG CCACCGAATT CATTCTGTCT CAGGAACTCA CCGAAGAAGG CGTCAAGCGC
CGTTTCCGCA ACGCCACGTC CGGCCTCAAA AAAGCGGCGA CTTTAGCCGC CGGCGACGAC
ATTGTGGAAC GCTGCCGGAC TGAGATCATT TTCTTCCTGA GCGTCCGTGC CACTCTGGAC
AAGTCCAGCG AGGCCGAGTC TCTGGTGGAA GAACAGGAAT ACGCCATCCG CCAACTTATC
GACCAATCCA TCGCCCCGGA AGGCGTGGTG GACCTGTATG CGGTGGCGGG GCTACCCAAA
AGCGAAATAT CCCTGCTGTC CGATGAATTC ATCCACCAGA TCGAGGCTAT GCCCGAAAAG
AATCTGGCGC TGGAAATGCT GCGTCGATTA CTGCAGGACA AGGTCCGCAA GGAAGGCAAG
GGCAACCTGA TCCAGAGCAA AGCCTTCTCG GAAAAGCTGG AAGAAGCCCT GCGCAAATAC
CACAACCGTT CGGTGGATAC CGTCGAGGTG ATGCAGGAAC TCATCGAACT GGCCAAAGCA
TTACAGGCGC TGGATCAACG CCATGCGGAA CTGGGCCTCA GCAAGGACGA GGCCGCCTTT
TATGATGCCC TGGCGACCAA CGATTCCGCC GTGCAGGCCA TGGGGGACGA GGCCCTGAAA
ATGATCGCCA AGGAAGTGGC CGATACTGTG CGCCAAAACA CCCGCATCGA CTGGTCCATC
CGGGAACAGG CAAGGGCGCA TCTGCGGCGC ATGGTGAAGC GGGTGCTACG CAAACACGGC
TATCCACCCG ACATGCAGGA AGGGGCGGTG AATCTGGTCA TTGAACAGGC TGAGGGGTTG
GTGGGGTGA
 
Protein sequence
MTPLNESTIE HHAIALFKEL GYAYAFGPDI GPDSNQKERA DYENPLLLER LRTAIHRLNP 
EAPKEIRDEA LRHVQAIAVG GLMTENSLFQ QMLVEGVRVA HMVDGEERGS IVRLIDYDHP
EKNDLLVVNQ YTVIHNRINR RADLVVFVNG LPLGLFELKN MADANATTRR AWNQLQTYQA
DIPNLMAYNA ILVISDGLKA AVGCLGAPYE RFLPWKTIDG QELMDRGDDP LRFLIHGVFE
PVRFLDLIRY FIAISDDGRK LSKKIAAYHQ FHAAQKALLT TLASAGIGGD RRGGVVWHTQ
GSGKSLTMLF FAGMLIQHPA LENPTILMLT DRTNLDTQLF DTFAAGKQLL RQDPQQADSV
SDLRELLQRA SGGVLFSTIQ KFQKDRDEPL GKHPVLSERK NIIVMVDEAQ RSQYEILKGY
AANLRAALPN ATFVAFTGTP LELDDRDTRV VFGDYIDIYD IQRAVEDGAT VPIYYESRLV
RLNLPEDQRP LIDSTFEEIT EDDEQENKER LKTRWAQLEA LVGAPKRVEQ IAADLLDHWE
KRKSILSGKA MIVAMSRRIA VELYDAIVRI HPDWRGDDDT QGRIKVVMTG SASDPIEWKP
HIRSKAGNEA IADRLKDPDD PLEIVIVRDM WLTGFDAPAL NTLYVDKPMR GHTLMQAIAR
VNRVFTNKSG GLVVDYIGIA QDLKNAIATY TRSGGKGNPA ETVQTAIKVL LEKLDICRGI
FHGCDYQDYL TDDGSEKVAT LRKATEFILS QELTEEGVKR RFRNATSGLK KAATLAAGDD
IVERCRTEII FFLSVRATLD KSSEAESLVE EQEYAIRQLI DQSIAPEGVV DLYAVAGLPK
SEISLLSDEF IHQIEAMPEK NLALEMLRRL LQDKVRKEGK GNLIQSKAFS EKLEEALRKY
HNRSVDTVEV MQELIELAKA LQALDQRHAE LGLSKDEAAF YDALATNDSA VQAMGDEALK
MIAKEVADTV RQNTRIDWSI REQARAHLRR MVKRVLRKHG YPPDMQEGAV NLVIEQAEGL
VG