Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AFE_1646 |
Symbol | |
ID | 7134150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 23270 |
Kingdom | Bacteria |
Replicon accession | NC_011761 |
Strand | - |
Start bp | 1416685 |
End bp | 1418169 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643530029 |
Product | serine protease, DO/DeqQ family |
Protein accession | YP_002426068 |
Protein GI | 218667609 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.17535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TCCAATGGAA AAGACCACGC TATATAGTAT CCCTTGCTGT TGCCGTTGCG TTGGGTTTCG GACTGGGAGC AACGGGATGG GTTTTTGGGG AATCAGGTCA TACCATTACG CCTCTGCCCT CACCAAGTGA GGCAGAAACA AGGCCAAGAT TGGTCAAGAT ACCGGACTTT AGCCCAATCG TAAAAAAATA TGGCAACGCA ATCGTTAGAA TCAGCGATTC TGACACAAAA ATAGTTCATG AACATCAGGG GTTTTTTAAT CCATTTCCGA AAGACTCGCC GTTTTTTGGT TTCTTTCATG GGCTACCAAA TTACACGCCC CCGGAAAAGG AGGTGACCAA GGCTCTCGGG TCTGGATTCA TCATCTCTCA CAACGGTTAC ATCGTCACGG CGGGACATGT CGTGAGAGGC ATGCACCATA TTATGGTGAC CCTGAATAAT CATCACGCGT ATCGAGCCAA AGTGGTCGGA TTATCCGTCC ATTATGATAC CGCGTTGCTG AAGATTCATG CCCACGATCT GCCTATCGTA CAACTGGGGA ACTCAAAGAA CCTTCAGGTC GGTCAGTGGC TGCTGGCTAT TGGTATGCCG TTTGGACTCT ATAACACCGT AACCCAAGGC GTGGTCAGCG CCATGAATAG ATCACTACCT CATGATAATC AGTACATACC ATTCATTCAA AGCGATGTGC CCATCAATCC TGGAAATTCT GGCGGTCCGC TTTTCAACAT GCGTGGACAG GTCGTCGGGA TCAATGATCA GATTTATACT AACGATGGCG GCTACATGGG GTTATCCTTC AGCATCCCGA TTGATACCGC GATGCGTGCC GTTCATGCAT TCGAGCGCCA TCAGAAAGTA AAATTTGGTT GGCTGGGTGT CGAAATTCAG TCAATGACGC CACAAATGGC GCAGGCGATG CACCTTCCGG AACCAGTAGG CGCATTGATC GCGCAGGTTA TGCCCTCGAG TCCTGCGGCA AAAGCGGGCA TTAAGTCCGG GGAAGTGATC GTGGCTTATG ATCACCGTCC TATTTACAAC GTTAGCACGC TCCCTCCATT AGTAGGTGAC ACACCACCAG GCAGGATTGT GCCCATCGGC ATCCTCGATC ACGGGAAGCC CAGGACATTG CAGGTTCAGG TCGGCGAGAT GCCGCAAAAG ATGCTGGTGG CTGCCGATCA ACAGAGCATC GACATTCGTC GTCTAGGGGT ACGCGTTGGC ACCTTGGGAC CAAAGGAGCA GCAGAAACTC GGGGTGGATC ACGGCGTATT GATTCAGTCC GTCTATCCGG GGCCAGCATC TTTTATTGGC CTACGGAGTG GAATGGCGAT TTTGTCCATC AACCAACTGC GGGTAACCAG CCCTGAACAA TTGGCTCAAC TGGTGAAATC TCTCCCTGCG AATACACCCA TTTCCATGCG TATTCGCAAT CACCATGGGA GTATTTTCGT CGTGATTACG CTGCCCACAC GATAG
|
Protein sequence | MRKIQWKRPR YIVSLAVAVA LGFGLGATGW VFGESGHTIT PLPSPSEAET RPRLVKIPDF SPIVKKYGNA IVRISDSDTK IVHEHQGFFN PFPKDSPFFG FFHGLPNYTP PEKEVTKALG SGFIISHNGY IVTAGHVVRG MHHIMVTLNN HHAYRAKVVG LSVHYDTALL KIHAHDLPIV QLGNSKNLQV GQWLLAIGMP FGLYNTVTQG VVSAMNRSLP HDNQYIPFIQ SDVPINPGNS GGPLFNMRGQ VVGINDQIYT NDGGYMGLSF SIPIDTAMRA VHAFERHQKV KFGWLGVEIQ SMTPQMAQAM HLPEPVGALI AQVMPSSPAA KAGIKSGEVI VAYDHRPIYN VSTLPPLVGD TPPGRIVPIG ILDHGKPRTL QVQVGEMPQK MLVAADQQSI DIRRLGVRVG TLGPKEQQKL GVDHGVLIQS VYPGPASFIG LRSGMAILSI NQLRVTSPEQ LAQLVKSLPA NTPISMRIRN HHGSIFVVIT LPTR
|
| |