Gene AFE_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_2381 
Symbol 
ID7136690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp2127023 
End bp2130121 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content61% 
IMG OID643530746 
Producttype I restriction-modification system, R subunit 
Protein accessionYP_002426770 
Protein GI218667282 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGG AAGGCCGCAC CCGGAAACAA ATCGACGCCC CAGACACCTA CAGTCGGCCC 
GCTGATAGTC CGAGTTTCGC CGTCCGCGAG GACACCATCG AATACGGCTT CATCGGCACC
CTGCAAAACC TCAAGTACGA ATATCGCCCC GACATCCGCG ACCGCGCCGC GCTGGAGGCC
AATTTCCGCC AGCACTTCGA AGCGCTCAAC CGCGTCCGCC TCACCGATGC CGAGTTCGCC
CGCCTGCTGG ATGAGATCGT CACCCCGGAT GTTTTTACCG CCGCCAAGAC CCTGCGCAGC
ATCAATGCCT TCACCCGCGA CGACGGCACG CCGCTGAACT ACAGCCTCGT CAACCTCAAG
GACTGGTGCA AAAACACCTT CGAGGTCATC CACCAGTTGC GCATCAATAC CGACTACAGC
CACCACCGCT ACGACGTCAT CCTGCTCATC AACGGTGTGC CCTGCGTGCA GATCGAGCTG
AAAACCCTCG GCGTCCATCC GCGCCGGGCC ATGGAGCAGA TCGTCGAATA CAAGCACGAC
CCCGGCAACG GCTACACCCG GACGCTGCTC TGCTTCATGC AGCTCTTCAT CGTCAGCAAC
CGCGACCAGA CTTACTACTT CGCCAACAAC AACGCCCGCC ATTTCGCCTT CAACGCCGAC
GAGCGCTTTT TGCCCGTCTA TGAGTTCGCG GACGAGGACA ATAGCAAGAT CAGACAGCTC
GACGCCTTCG CCGAGCGCTT CCTGAAAAAG TGTAACCTCG GCCAGACCCT CAGCCGCTAC
ATGGTGCTGC TGGCGGGCGA GCAAAAGCTC ATGATGATGC GACCCTATCA GGTCTATGCC
GTGCAGCACA TGGTCAAGTG CATCGATGAA GACAACGGCA ACGGCTACAT CTGGCACACC
ACCGGCAGCG GCAAGACGCT CACCTCCTTC AAGGCCGCCA CCCTGCTCAA GGAGAACGAG
CACATCCATA AATGCGTGTT CGTTGTCGAC CGCAAGGACC TCGACCGCCA GACGCGGGAG
GAATTCAACC GCTTCCAGGA AGGCTGTGTC GAAGAAAACA CCCACACCGG CGCCCTCGTG
CGCCGCCTGC TGTCCGAGGA CTACGCCGAC AAGGTCATCG TCACCACCAT CCAGAAGCTC
GGCCTCGCGC TCGACGAAAC GAGCCGCCGC AACAAGCAAC GCAGCAAAAA CGGCCAGGCC
ACCTACAAGG AGCAGCTCGA AGCACTGCAG GACGAGCGCA TCGTCTTCAT CTTCGACGAA
TGCCACCGCT CGCAGTTCGG TGAGAACCAC AAGGCCATCA AAGCCTTCTT CCCCCGCGCC
CAACTCTTCG GCTTCACCGG CACGCCCATC TTCGAGGCCA ATGCCAGCCT GCAGAAGATC
GAGGACAGCA CGGCCTCCAT GCGCACCACG GCAGACCTCT TCCAGAAACA GCTGCACGCC
TACACCATCA CCCACGCCAT CGAAGACGGC AATGTGCTGC GCTTCCATGT CGATTACTTC
AAGCCGGAAG GAAAGAACCC ACCCAAACCC GGCGAACCCA TCGCCAAGCG CGCGGTCATC
GAAGCCATCC TCGCCAAGCA CGATGCCGCC ACCGGCGGGC GCCGCTTCAA CGCCCTCTTT
GCCACCGCAT CCATCAACGA TGCCATCGAA TACCACGCGC TGTTCAAGAC CATGCAAGCC
GAGAAACTGG CCGCCGATCC CGAGTTCAAG CCGCTGAATA TCGCCTGCGT CTTCTCTCCG
CCTGCGCAGC TCGCGGAGAA TCCCGAAAGC AAAAAGGACA TCGACCAGCT CTCCGAAGAC
CTCCCGCAAG AGCAGGAAGA CAACAAGGTC GAGCCAGAAG CGAAGAAGCA GGCGCTTGAA
GGCATCCTCG CCGACTACAA CGCCCGCTAC GGCACCAACC ATCGCCTGAG CGAATTCGAT
CTTTACTACC AGGATGTGCA AAAGCGGATC AAAGACCAGC AGTGGCCGAA TGCCGACTAC
CCCCCGGCGC AGAAAATCGA CATCACTATC GTCGTGGACA TGCTGCTCAC CGGCTTCGAT
TCCAAGTTCC TGAACACGCT CTACGTGGAC AAGAACCTCA AGCACCACGG CCTGATCCAG
GCCTTCTCGC GCACCAACCG CGTGCTCAAC GCCACCAAGC CCTACGGCAA CATCCTCGAC
TTCCGCCAGC AGCAGGATGC CGTCGATGCC GCCATCGCGC TGTTCTCCGG CGAAAAGACC
GGCGAGCAGG CGCGCGAAAT CTGGCTGGTG GAAAAGGCGC CCGTGGTCAT CCAGAAACTG
GAGGCCGCCG TGCAAAAGCT CGACGCCTTC ATGCAGTCCC AGGGGCTGGA TTGCACGCCC
TCCACCGTGG CCAACCTGAA AGGCGATGCC GCCAAGACCG TTTTCATCGA GCGCTTCAAG
GAAGTGCAGC GGCTCAAGAC CCAGCTCGAC CAATACACCG ACCTCACCGG GGAAAACAAA
GCCGCCATCG AGCAGGTGCT GCCCGAAGAA ACCCTGCGCG GCTTCAAGGG CCAGTATCTG
GACACCGCCA AAAAGCTCCG CGACGGGCGC AACAAGCCGG ACAAGCCCGG CGCCGACAAG
CCTGCCGATC AGCTCGACTT CGAGTTCGTC CTCTTCGCCT CCGCCGTCAT TGATTACGAC
TACATCATGA AGCTGATGAC CAGCTTCTCG GCCAAAGAGC CGGGCAAGGC CAAGATGACC
CGCGAACAAC TCATCGGCCT CATCAGCAGT GACGCCAAGT TCATCAACGA GCGCGACGAC
ATCGCCGAAT ATATCGGCAC GCTCCAGGCC GGCGAGGGGC TGAGTGAAAC CGCCATCCGC
GACGGCTACA CCCGCTTCAA GGCGGAGAAG AACGCCCAGG AACTCGCCAC CATTGCCGCA
AAGCACAATC TGGCCACCGC CGCCCTGCAA AGCTTCGTGG ACGGCATTTT TGAGCGCATG
ATCTTCGACG GCGAACGCTT GAGCGACCTC ATGGCCCCGC TCGATCTGGG CTGGAAAGCC
CGCAGCCAGG CCGAAATCGC GCTGATGGAA GATTTGTATC CGCTGCTGAC CCAACGCGCC
GGGGGCCGCG ATATTTCAGG GCTTAGCGCC TATGAGTGA
 
Protein sequence
MKPEGRTRKQ IDAPDTYSRP ADSPSFAVRE DTIEYGFIGT LQNLKYEYRP DIRDRAALEA 
NFRQHFEALN RVRLTDAEFA RLLDEIVTPD VFTAAKTLRS INAFTRDDGT PLNYSLVNLK
DWCKNTFEVI HQLRINTDYS HHRYDVILLI NGVPCVQIEL KTLGVHPRRA MEQIVEYKHD
PGNGYTRTLL CFMQLFIVSN RDQTYYFANN NARHFAFNAD ERFLPVYEFA DEDNSKIRQL
DAFAERFLKK CNLGQTLSRY MVLLAGEQKL MMMRPYQVYA VQHMVKCIDE DNGNGYIWHT
TGSGKTLTSF KAATLLKENE HIHKCVFVVD RKDLDRQTRE EFNRFQEGCV EENTHTGALV
RRLLSEDYAD KVIVTTIQKL GLALDETSRR NKQRSKNGQA TYKEQLEALQ DERIVFIFDE
CHRSQFGENH KAIKAFFPRA QLFGFTGTPI FEANASLQKI EDSTASMRTT ADLFQKQLHA
YTITHAIEDG NVLRFHVDYF KPEGKNPPKP GEPIAKRAVI EAILAKHDAA TGGRRFNALF
ATASINDAIE YHALFKTMQA EKLAADPEFK PLNIACVFSP PAQLAENPES KKDIDQLSED
LPQEQEDNKV EPEAKKQALE GILADYNARY GTNHRLSEFD LYYQDVQKRI KDQQWPNADY
PPAQKIDITI VVDMLLTGFD SKFLNTLYVD KNLKHHGLIQ AFSRTNRVLN ATKPYGNILD
FRQQQDAVDA AIALFSGEKT GEQAREIWLV EKAPVVIQKL EAAVQKLDAF MQSQGLDCTP
STVANLKGDA AKTVFIERFK EVQRLKTQLD QYTDLTGENK AAIEQVLPEE TLRGFKGQYL
DTAKKLRDGR NKPDKPGADK PADQLDFEFV LFASAVIDYD YIMKLMTSFS AKEPGKAKMT
REQLIGLISS DAKFINERDD IAEYIGTLQA GEGLSETAIR DGYTRFKAEK NAQELATIAA
KHNLATAALQ SFVDGIFERM IFDGERLSDL MAPLDLGWKA RSQAEIALME DLYPLLTQRA
GGRDISGLSA YE