Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0618 |
Symbol | |
ID | 6376655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 797749 |
End bp | 799260 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681771 |
Product | hypothetical protein |
Protein accession | YP_001957745 |
Protein GI | 189502028 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTACA AAAAACAAAA AAAGAATCTT TACATTATTT CGTTGGTTGT GATCCTGTGT GCAATCACAT TAAGTTTATA CTCATTATTT AATTCTAATA GAAACCAACC AATTGGTAAG CGATCGGCTA CTAACCAACA GCATTCTATA CAGCTTACTC AGCTTACTGT TCCCACAACT CCAATAGCGC CAGAAACTCC TAACTTCGTT TCTGCAGCGC AAAAAGCTAC ACCTGCTGTA GTACATATCA CAGCTAAGTA TGAAGCTAAA ATTATGCGTA GAGGTACTAG CCCTATAGAG GAACTATTTA AAGATTTTCT TGGAGAAGGA TTTGAATTAG GCCCTAAAGA ATATAAAACA CAGCCTGGTG CAGCTTTTGG TTCTGGTGTT ATTATTAGTG AAGACGGATA TATTGTTACT AATAACCATG TTATAGATAA GGCTGACCAA ATTGAAGTAA CATTAGATGA TAACCGCAGG TATACTGCCA AAGTGGTAGG CACTGATCCT GACACTGACT TAGCGCTCTT AAAAATAGAA GAAAAAAAGT TATCCTTCTT AGTATTTGGA GATTCTGATA AGCTTCAAGT AGGTGAATGG GTGCTAGCTG TAGGTAACCC TTTTAACCTA ACTTCTACGG TTACCAAAGG GATCGTGAGT GCCAAAGCTA GAAGAGCTGA TATAGCTAGA TCAGGAGGTG GCATAAAAAT TGAAGCATTT ATTCAAACAG ATGCGGCTGT TAATAAAGGA AATAGTGGTG GTGCACTTGT TAACCTACAA GGTGAGCTAG TAGGCATCAA TACAGCTATT TCTACACCTA CCGGTGCGTT TGCTGGCTAT TCTTTCGCTA TCCCCAGCTC TATTGTACAA CGTATTATTA GCGATCTAAA AAAGTATGGA ACTGTACAAC GTGCTATATT AGGTATATTT CCTTTGGATG TAAATGCAGA CCTAGTAGAA GAAAAGAAAT TAAAACGGTT TGATGGTGTT TACGTAAATG GATTCTCTGA ACGTAGTGCT GCTGCTGAAG CAGGTTTAAA AGAAGGCGAT ATTATTATAG CTATTAATAA TACGAAAATT AAGAATTTAG CTCAGTTACA TGAACATCTT ACTCACTACC AGCCAGGCGA TAAAGTAAAT GTGTTGATAG ATAGAAAAGG AAAAGAAATT AATATTACAG TTACCCTTAA AAATGCTTTA AATGAAATTA AAATTGTACA TGGGCAAGGT AGTATAGAAG TAGAAGGCGC TACCTTTGAA CCCCTAGACC AAGCTACCAA GCAAAAGCTA GGCCTTAAAG CAGGTGTACA GATTAAGGGA ATTAAAACAG GTAAATGGCA GCAAGCAGGC ATCAAGAAAG GCTTTATTGT TGTCGCAATT GATAAAGAAC CTATTGAAAC CTTAGATCAG TTAGCAACCA TACTCAATAG TAAAAAAGGA GGTATCCTGA TAGAAGGTAT TTATCCTAAT GGCACAAAAG CTTACTATGG AATAGGTTGG GGCGATCTAT AA
|
Protein sequence | MPYKKQKKNL YIISLVVILC AITLSLYSLF NSNRNQPIGK RSATNQQHSI QLTQLTVPTT PIAPETPNFV SAAQKATPAV VHITAKYEAK IMRRGTSPIE ELFKDFLGEG FELGPKEYKT QPGAAFGSGV IISEDGYIVT NNHVIDKADQ IEVTLDDNRR YTAKVVGTDP DTDLALLKIE EKKLSFLVFG DSDKLQVGEW VLAVGNPFNL TSTVTKGIVS AKARRADIAR SGGGIKIEAF IQTDAAVNKG NSGGALVNLQ GELVGINTAI STPTGAFAGY SFAIPSSIVQ RIISDLKKYG TVQRAILGIF PLDVNADLVE EKKLKRFDGV YVNGFSERSA AAEAGLKEGD IIIAINNTKI KNLAQLHEHL THYQPGDKVN VLIDRKGKEI NITVTLKNAL NEIKIVHGQG SIEVEGATFE PLDQATKQKL GLKAGVQIKG IKTGKWQQAG IKKGFIVVAI DKEPIETLDQ LATILNSKKG GILIEGIYPN GTKAYYGIGW GDL
|
| |