Gene AFE_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_1023 
Symbol 
ID7135528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp926062 
End bp927426 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content62% 
IMG OID643529421 
ProductHNH endonuclease domain protein 
Protein accessionYP_002425496 
Protein GI218667415 
COG category[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTAT TGGTACTCGA TAAAAGAAAG AAGCCGTTGA TGCCGTGCTC GGAGAAACGG 
GCGCGGCTGC TGCTGGAGCG TGGCCGGGCG CGGGTGCATC GCATGGTGCC GTTTACCATC
CGGCTGGTGG ATCGCTTGCA GGAAGATTCC ACCTTGCAAC CCGTCCGGCT CAAGCTCGAC
CCAGGTAGTA AAACAACCGG CATGGCTCTG GTTCGGGAAC AGGAGTCTGT GGACGAAGAT
ACCGGCGAAA TCCAGCGCAA GGCCATAGTG TTGATGCTGC TGGAGTTGCA GCATCGGGGC
TATGCCATTC GCGACGCGCT CACCCAGAGG CGGGCTTTTC GGCGGCGGCG GCGCGGGAAT
CTGCGCTACC GTCCGGCCCG CTTCGACAAT CGCGCCAAAC CAGAAGGCCG TTTGGCTCCG
AGCTTGCAGC ACCGGGTCGA TACGACGATG GCTTGGGTGC AGAGGCTGTT GCGCTGGGCG
CCGGTATCTG CCCTGTCCAC CATGCTGCAC CGCTTCGATA CCCAGGCACT CCAGAATCCC
GAGATCAGCG GGATCGAGTA CCAGCGCGGC GAACTGTTTG GCTACGAGGT CCGCGAGTAC
CTATTGGAGA AGTGGGGCCG CAAGTGCGCC TACTGCGATG CCCAGAATAC CCCATTGACC
ATCGATCATA TCCACCCCAG GAGCGCGGGC GGCTCGGATC GGGTATCGAA TCTCACCCTG
GCCTGTTTCC CCTGCAACCA GCGCAAGAGC AACCGGGACG TGCGGGAGTT TCTGGCGCAC
GACCCGAAAC GTCTGACCCG CATCGAGGCA AGCCGCAAGG CACCCCTCAA GGACACCGCT
GCCGTCAACA GTACCCGTTG GGCGCTTTGG CGGCAACTGG TGGCTACCGG TCTCGATGTC
GAGGTCGGCA CCGGCGGCAG GACGAAGTGG AATCGCAGTC GGCTACAAAT CCCCAAAGAA
CATTGTCTGG ACGCTGCCTG CGTGGGGCAT GTCGATGGTC TCGAACACTG GCAGCAGCCG
GTACTCGGTA TCAAAGCGAC GGGGCGCGGA AGCTACCAGC GCACGCGGCT GACAAAGCAC
GGCTTCCCGC GTGGCTATCT CACCCGCAGC AAGAGTGCTT TCGGGTTCCA GACGGGCGAT
ATGGTCAAGG CGGTAGTGAC GAAAGGCAAG AAGGTAGGCA CCTATCTGGG CCGCGTTGCC
ATCCGGGCCA GCGGCAGCTT CAACATCCAG ACCGGGAACG GACTGGTGCA ACACATCCAT
TACCGATTCT GCAAACTGGT TCAGCGCGGC GATGGTTACG GATACCACTG GTCGCTTCTC
CACCCCGCGC TGAACCACGG GATTGCCGAA GCTGGGAGGA ACTGA
 
Protein sequence
MAVLVLDKRK KPLMPCSEKR ARLLLERGRA RVHRMVPFTI RLVDRLQEDS TLQPVRLKLD 
PGSKTTGMAL VREQESVDED TGEIQRKAIV LMLLELQHRG YAIRDALTQR RAFRRRRRGN
LRYRPARFDN RAKPEGRLAP SLQHRVDTTM AWVQRLLRWA PVSALSTMLH RFDTQALQNP
EISGIEYQRG ELFGYEVREY LLEKWGRKCA YCDAQNTPLT IDHIHPRSAG GSDRVSNLTL
ACFPCNQRKS NRDVREFLAH DPKRLTRIEA SRKAPLKDTA AVNSTRWALW RQLVATGLDV
EVGTGGRTKW NRSRLQIPKE HCLDAACVGH VDGLEHWQQP VLGIKATGRG SYQRTRLTKH
GFPRGYLTRS KSAFGFQTGD MVKAVVTKGK KVGTYLGRVA IRASGSFNIQ TGNGLVQHIH
YRFCKLVQRG DGYGYHWSLL HPALNHGIAE AGRN