Gene AFE_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_1646 
Symbol 
ID7134150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp1416685 
End bp1418169 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content51% 
IMG OID643530029 
Productserine protease, DO/DeqQ family 
Protein accessionYP_002426068 
Protein GI218667609 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TCCAATGGAA AAGACCACGC TATATAGTAT CCCTTGCTGT TGCCGTTGCG 
TTGGGTTTCG GACTGGGAGC AACGGGATGG GTTTTTGGGG AATCAGGTCA TACCATTACG
CCTCTGCCCT CACCAAGTGA GGCAGAAACA AGGCCAAGAT TGGTCAAGAT ACCGGACTTT
AGCCCAATCG TAAAAAAATA TGGCAACGCA ATCGTTAGAA TCAGCGATTC TGACACAAAA
ATAGTTCATG AACATCAGGG GTTTTTTAAT CCATTTCCGA AAGACTCGCC GTTTTTTGGT
TTCTTTCATG GGCTACCAAA TTACACGCCC CCGGAAAAGG AGGTGACCAA GGCTCTCGGG
TCTGGATTCA TCATCTCTCA CAACGGTTAC ATCGTCACGG CGGGACATGT CGTGAGAGGC
ATGCACCATA TTATGGTGAC CCTGAATAAT CATCACGCGT ATCGAGCCAA AGTGGTCGGA
TTATCCGTCC ATTATGATAC CGCGTTGCTG AAGATTCATG CCCACGATCT GCCTATCGTA
CAACTGGGGA ACTCAAAGAA CCTTCAGGTC GGTCAGTGGC TGCTGGCTAT TGGTATGCCG
TTTGGACTCT ATAACACCGT AACCCAAGGC GTGGTCAGCG CCATGAATAG ATCACTACCT
CATGATAATC AGTACATACC ATTCATTCAA AGCGATGTGC CCATCAATCC TGGAAATTCT
GGCGGTCCGC TTTTCAACAT GCGTGGACAG GTCGTCGGGA TCAATGATCA GATTTATACT
AACGATGGCG GCTACATGGG GTTATCCTTC AGCATCCCGA TTGATACCGC GATGCGTGCC
GTTCATGCAT TCGAGCGCCA TCAGAAAGTA AAATTTGGTT GGCTGGGTGT CGAAATTCAG
TCAATGACGC CACAAATGGC GCAGGCGATG CACCTTCCGG AACCAGTAGG CGCATTGATC
GCGCAGGTTA TGCCCTCGAG TCCTGCGGCA AAAGCGGGCA TTAAGTCCGG GGAAGTGATC
GTGGCTTATG ATCACCGTCC TATTTACAAC GTTAGCACGC TCCCTCCATT AGTAGGTGAC
ACACCACCAG GCAGGATTGT GCCCATCGGC ATCCTCGATC ACGGGAAGCC CAGGACATTG
CAGGTTCAGG TCGGCGAGAT GCCGCAAAAG ATGCTGGTGG CTGCCGATCA ACAGAGCATC
GACATTCGTC GTCTAGGGGT ACGCGTTGGC ACCTTGGGAC CAAAGGAGCA GCAGAAACTC
GGGGTGGATC ACGGCGTATT GATTCAGTCC GTCTATCCGG GGCCAGCATC TTTTATTGGC
CTACGGAGTG GAATGGCGAT TTTGTCCATC AACCAACTGC GGGTAACCAG CCCTGAACAA
TTGGCTCAAC TGGTGAAATC TCTCCCTGCG AATACACCCA TTTCCATGCG TATTCGCAAT
CACCATGGGA GTATTTTCGT CGTGATTACG CTGCCCACAC GATAG
 
Protein sequence
MRKIQWKRPR YIVSLAVAVA LGFGLGATGW VFGESGHTIT PLPSPSEAET RPRLVKIPDF 
SPIVKKYGNA IVRISDSDTK IVHEHQGFFN PFPKDSPFFG FFHGLPNYTP PEKEVTKALG
SGFIISHNGY IVTAGHVVRG MHHIMVTLNN HHAYRAKVVG LSVHYDTALL KIHAHDLPIV
QLGNSKNLQV GQWLLAIGMP FGLYNTVTQG VVSAMNRSLP HDNQYIPFIQ SDVPINPGNS
GGPLFNMRGQ VVGINDQIYT NDGGYMGLSF SIPIDTAMRA VHAFERHQKV KFGWLGVEIQ
SMTPQMAQAM HLPEPVGALI AQVMPSSPAA KAGIKSGEVI VAYDHRPIYN VSTLPPLVGD
TPPGRIVPIG ILDHGKPRTL QVQVGEMPQK MLVAADQQSI DIRRLGVRVG TLGPKEQQKL
GVDHGVLIQS VYPGPASFIG LRSGMAILSI NQLRVTSPEQ LAQLVKSLPA NTPISMRIRN
HHGSIFVVIT LPTR