Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AFE_1396 |
Symbol | |
ID | 7134259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 23270 |
Kingdom | Bacteria |
Replicon accession | NC_011761 |
Strand | + |
Start bp | 1210286 |
End bp | 1211725 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643529788 |
Product | serine protease, DO/DeqQ family |
Protein accession | YP_002425828 |
Protein GI | 218668070 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.333333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTTT GTATGGGTTT GGGACTGAGC ATAGGGGCGA TGACGCCTGC TTGGGCGGAT TCCGGTGCCA GCGGTCAGGC GTTGGTGGGG CTGCCTGATT TTACGCCCAT CGTCAAGCAG TATGGACCTG CAGTAGTGAA CATCAGCACG ACGGAGACCA GAGTGGCGCG TGGTGTGACT TCCCCCTTCC CGCCGAACTC TCCCCTGAAT CAGTTTTTCG CCCCCTTCTT TGGCGCACCG GGGCAACCTG GAGCACCTGG TGGCGGAGCC GGACAGAAAT ATCAGGTGCA ATCCCTGGGT TCCGGTTTTG TCATCAGCTC CGACGGCTAT ATCGTGACTG CGGCGCATGT GGTGAAAGGG GCTCAGAAGA TCATTGTCAG TCTCACCAAT CATCATCAAT ATGCAGCTCA CCTGGTGGGC CTGTCGGCGC GTATGGATGT GGCGTTGCTC AAAATTGACG CGAAGAATCT GCCGGTGGTA CAGATTGGTG ACTCCAGCAA GCTGGAGGTC GGACAGTGGG TGCTGGCGGT GGGTTCGCCC TTCGGCTTTG AGAACAGCGT CACCCAGGGT GTGATCAGTG CGACCTCGCG GCCTTTGCCG GATGATCCCT ACATCCCGTT CGTTCAAACG GATGTGCCGA TCAACCCTGG TAACTCCGGT GGCCCGCTAT TCAATATGCG CGGTCAGGTC ATCGGCATCA ACGACCAGAT CTATACCAAT AGCGGTGGCT ACATGGGGTT GTCTTTCTCT ATCCCCATCA ATGTCGCCAT GGATGCGGTC AAACAGTTAA AGCTGCATCA GAAAGTGCAT TTTGGCTGGC TCGGGGTCAT GATTCAGGAT GTCAGCATGG ATCTCGCCAA GTCCTTCCAC ATGAAAGAGC CGGTGGGTGC CTTGGTGTCA CAGGTTGTGC CTGACGGTCC GGCTGCCAAG GCGGGGTTAC GTCCGGGAGA TGTCATTGTC TCCTTTGACG GTCAGGCCAT CTATAACTCT GGTCAATTAC CGCCGCTGGT GGGAGTATTG CCCGCCGGTT TCAAGGCGAA GCTGGGGGTT ATCCGTGATG GCAAGCCCAT GAGCCTCAAC ATCGTGGTGG AGAGTCTGCC CGGCAACCTG GAGAATACGG TGGAATCCGC CGCATCCGGC GGTCCGGCGC AGGAAGGTGA AGTCAAACGA CTGAATGTGC AGGTGGGTCC GCTGACTGCG GAGGCACGTA AGCAACTGCA CGTGAATACT GGTGTCCTGG TCCTCGGGGT TGGTGTGGGG CCGGCGGCAG AAGCCGGTAT TCGTCCCGGT GATGTGATCT TGCAGGTGGC ACAGCAGCAG ATTACCAATG CCGCCGACTT GCAGAAGCTG GTGGCTGCCT TGCCGGCGGG CCAGCCGATC CCGGTGCTGG TGCGACGTGG TGAGGGGAGT TTCTATCTGG TGCTTTCGCT GCCGCATTGA
|
Protein sequence | MALCMGLGLS IGAMTPAWAD SGASGQALVG LPDFTPIVKQ YGPAVVNIST TETRVARGVT SPFPPNSPLN QFFAPFFGAP GQPGAPGGGA GQKYQVQSLG SGFVISSDGY IVTAAHVVKG AQKIIVSLTN HHQYAAHLVG LSARMDVALL KIDAKNLPVV QIGDSSKLEV GQWVLAVGSP FGFENSVTQG VISATSRPLP DDPYIPFVQT DVPINPGNSG GPLFNMRGQV IGINDQIYTN SGGYMGLSFS IPINVAMDAV KQLKLHQKVH FGWLGVMIQD VSMDLAKSFH MKEPVGALVS QVVPDGPAAK AGLRPGDVIV SFDGQAIYNS GQLPPLVGVL PAGFKAKLGV IRDGKPMSLN IVVESLPGNL ENTVESAASG GPAQEGEVKR LNVQVGPLTA EARKQLHVNT GVLVLGVGVG PAAEAGIRPG DVILQVAQQQ ITNAADLQKL VAALPAGQPI PVLVRRGEGS FYLVLSLPH
|
| |