Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_2470 |
Symbol | |
ID | 6878468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | + |
Start bp | 2436464 |
End bp | 2437984 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642790327 |
Product | protease Do |
Protein accession | YP_002220872 |
Protein GI | 198284551 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.769337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAATG ATCGCCCGAG TGTGATGTTG AAGCGCCCCA AACTGGTAAT CACTGCCGTC GTGGCGGCAT GTTTTGGTTT TTCACTGGGT GCAGCGGAGT GGGCGCAAGC CGACGCCCCC GCGGCCCCGG CCCTTTCCAT CTCGACCAAC ACAGTTCCCC AGAAGGCGCT GGTCGCCTTG CCGGATTTTA CCCCGATTAT CGACCGTTAT GGTCCCGCGG TGGTCAACAT CAGCAGCACC ACCAACAAGG TCATTCACCA GCAGGCCAAT CCCTTTCCGC CAAACTCGCC TTTTTACCAG TTTTTCCATC ACTTCATGGC GCCCGGACAG GGCGGTGGTC CCGGACCACA GCAGCATGAA AAAATCCAAT CTCTCGGTTC CGGCTTCATC ATCAGTCCCG ACGGCTATAT CGTCACCGCC GGTCACGTCG TTCGCGGCGC TAACCATATC GTCGTTACCC TCACCACCCA CCACGCCTAT CCGGCCAAAC TGATCGGCCT GTCGGTGCGT TACGATACGG CACTGCTCAA AATCAACGCC AAGAACCTAC CCACTGTGCC CATCGGCAAC TCCGACGATT TGAAGGTTGG CCAGTGGCTG CTCGCCGTTG GCGCGCCGTT CGGTTTCTAT AACACAGTGA CCCAGGGTGT AGTCAGCGCC ATGAATCGCC CGTTACCCGA TGATGAATAC ATCCCCTTTA TCCAGAGCGA CGTACCCATC AATCCCGGCA ACTCGGGTGG GCCACTCTTC AACATGAATG GCCAGGTGAT CGGCATCAAC GACCAGATCT ATACCAACAG CGGTGGCTAT ATGGGTCTGT CGTTCTCCAT CCCCATCAAT ACCGTGATGC GGGTGGTGCA GGACTTCAAA AATCACAAGG CAATTCAGTT TGGCTATCTG GGGGTCGAGG TACAGGACGT CACGCCGCAA ATGGCGCAGG CGCTGCACCT CACGGAACCC GTCGGCGCCC TCATCGCGTC CGTGGAACCC GGCAGCCCGG CGGCCAAGGC GGGCATCAGG CCCGGTGACG TCATCGTCAC CTATGATAAC AAGCCGGTGT ACAATGTCGG ACAGTTGCCA CCTATGGTAG GCAACACCCT GCCAGGCACC CACGCCAAGG TCGGCATCCT CCACCGCGGA AAAGCGGAAA CCAAAGATGT CCTGATTGCC GCCCTGCCCA AAAACATGGA AGGGCCTTCC GGCAGCCAGG AACCTTCCAC GGCCGCCAAA GTGGGCAAAG TCTCCCGCAT GGGCATCCAT GTCCAGTCGC TGACCCCCAG CATTGAGAAG CAGTTGGATG TGCATCATGG CGTCGTGGTG GTCGGCGTTA GTGAAGGCGC GGCTGCGGAG GCAGGGATCA TGCCGGGTAT GATCATCCAG CAGATCGATC AGCAGGACGT CAACAGTCCC GCCGAGTTGG AGCATATCGT TGCCGGTCTG CCGGCCGGTC AGCCCATCCC GCTGCTGGTG CGGCAAGGCA AGGCCAGCAT CTACGTGGTG GTGACGCTAC CCAAGAAGTA G
|
Protein sequence | MANDRPSVML KRPKLVITAV VAACFGFSLG AAEWAQADAP AAPALSISTN TVPQKALVAL PDFTPIIDRY GPAVVNISST TNKVIHQQAN PFPPNSPFYQ FFHHFMAPGQ GGGPGPQQHE KIQSLGSGFI ISPDGYIVTA GHVVRGANHI VVTLTTHHAY PAKLIGLSVR YDTALLKINA KNLPTVPIGN SDDLKVGQWL LAVGAPFGFY NTVTQGVVSA MNRPLPDDEY IPFIQSDVPI NPGNSGGPLF NMNGQVIGIN DQIYTNSGGY MGLSFSIPIN TVMRVVQDFK NHKAIQFGYL GVEVQDVTPQ MAQALHLTEP VGALIASVEP GSPAAKAGIR PGDVIVTYDN KPVYNVGQLP PMVGNTLPGT HAKVGILHRG KAETKDVLIA ALPKNMEGPS GSQEPSTAAK VGKVSRMGIH VQSLTPSIEK QLDVHHGVVV VGVSEGAAAE AGIMPGMIIQ QIDQQDVNSP AELEHIVAGL PAGQPIPLLV RQGKASIYVV VTLPKK
|
| |