Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0842 |
Symbol | |
ID | 8413708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 935340 |
End bp | 937274 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645022425 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_003179862 |
Protein GI | 257784645 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0456572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAATA ACGATAACAA GCATCGTAGA TCTATGTCGA TGCTACTCTA TATTGCAGTA GCTATCTTTG TATATCTCCT GCTTAGCAAC ACTTTATTAC CAGGTCTTTT GCGCCAACAG ATACAAACTG TCTCCTACAG TGAGTTTCTT AATAAAATTG AAAGTAATGA GGTTACTAAA GTAGATCTCA ACACTGGTAA CAGAAACATT AGATTTACTA CAGGCTCTGG AGATTCCGAG AAAATCTTTG AGACTACGCA GTTCCCTAAT GATTCAACAC TGGTACAAAC GCTCAGAGAA CACAAAGTTG ACTTCTCAGC TTCCATTCCT GACAACTCTG CAAACATGCT GATGTATGCC CTTATTCAAT ACGGCATTCC TCTTATTATC TTCTTGGGTA TTGGTTTCTT TATTAACCGC TCCCTTAAAC GCGCTATGGG AGACGATGGT CCATCCATGA ACTTTGGCGG CGGTTTTGGT GGTCTCGGCG GCAATCTTGG TCGCTCAAGC GCTAAAGAAA TTAAGGGAGA AGATACTGGC ATTACGTTTA AAGACGTTGC TGGCCAAGAA GAGGCTAAAG AGTCTATGCA AGAGATTGTT AGCTTCCTTA AGACTCCCGA TAAGTACAAA GAAATTGGTG CTCGCTGTCC TCGTGGTGCT CTACTTGTAG GACCTCCAGG CACCGGTAAA ACTCTTATAG CTAAAGCAGT TGCTGGTGAA GCTGGCGTTC CTTTCTTCCA GATTGCTGGC TCTGAGTTTG TTGAGATGTT TGTTGGACGC GGTGCCGCAA AAGTCCGCGA TCTCTTCAAG CAGGCAAATG AGAAAGCTCC TTGCATTATC TTCATTGATG AGATTGATGC TGTTGGTAAG CGCCGCGACG CTTCCCTCAA CTCCAACGAT GAGCGTGAGC AGACCTTAAA CCAGCTGCTC TCAGAGATGG ATGGCTTTGA TAACCACAAG GGTATTGTTG TTCTGGCAGC AACTAACCGC CCAGAAACCT TGGACAAGGC ACTTTTGCGT CCTGGTCGCT TTGATCGTCG TATTCCTGTT GAGCTTCCAG ATCTTAAGGG TCGTGAGGCA GTTCTCCAGA TTCACGCCAA TGATGTAAAA ATGGAGCCAG GCGTTGACCT CTCTATCGTT GCTAAGTCCA CGCCAGGAGC ATCTGGTGCA GACCTTGCAA ACATCATCAA TGAGGCAGCT CTTCGTGCTG TTCGCTTTGG TCGCCGTCGT GTTACCACTG AAGACCTTAC AGAGTCTGTC GACGTCGTTA TTGCCGGAGC AAAAAAGAAA AATAGTGTTC TATCTGAGCA TGAGAAGGAT GTTGTTGCCT ATCACGAGAC CGGCCACGCA ATTGTTGGTG CCATCCAGAA AAACGATGCT CCTGTCACCA AGATTACTAT TGTTCCTCGT ACTAGCGGAG CCCTTGGCTT TACCATGCAG GTTGAGGACG ATGAGCGTTA TCTGATGAGT AAGAGTCAAG CCATGGATGA GATTGCTGTT CTCTGTGGTG GACGCGCTGC TGAAGAGCTT ATCTTTGGCG AGATGACCAA TGGTGCCTCC AATGATATTG AGCGCGCAAC TGCAATTGCA CGCGCAATGG TTACCCAGTA CGGCATGTCT GACAAGCTTG GTATGGTTAC CCTAAGCCAG CAGCAAAGCC GCTATCTTGG TGGTGGCTCT TCCCTCACCT GCTCTGAAGC AACTGCTGAA GAGATCGACG CTGAGGTTAG ACGTATTGTT GAAGAGGGTC ACCAGCGGGC ACTTCAAACG CTTAAAGAGA ATCGCTTTAA ACTGCATGAA ATTGCTCACT ATCTACAGAA GAAAGAAACT ATTACCGGCG AGGAGTTCAT GAATATCCTC AAGCGTGAGA ATACCTTTGC ACCTGTAGAT AAGAACATCA ACGATGAAGG CTCTTCTACT CCTTCAGAAG AGTAA
|
Protein sequence | MANNDNKHRR SMSMLLYIAV AIFVYLLLSN TLLPGLLRQQ IQTVSYSEFL NKIESNEVTK VDLNTGNRNI RFTTGSGDSE KIFETTQFPN DSTLVQTLRE HKVDFSASIP DNSANMLMYA LIQYGIPLII FLGIGFFINR SLKRAMGDDG PSMNFGGGFG GLGGNLGRSS AKEIKGEDTG ITFKDVAGQE EAKESMQEIV SFLKTPDKYK EIGARCPRGA LLVGPPGTGK TLIAKAVAGE AGVPFFQIAG SEFVEMFVGR GAAKVRDLFK QANEKAPCII FIDEIDAVGK RRDASLNSND EREQTLNQLL SEMDGFDNHK GIVVLAATNR PETLDKALLR PGRFDRRIPV ELPDLKGREA VLQIHANDVK MEPGVDLSIV AKSTPGASGA DLANIINEAA LRAVRFGRRR VTTEDLTESV DVVIAGAKKK NSVLSEHEKD VVAYHETGHA IVGAIQKNDA PVTKITIVPR TSGALGFTMQ VEDDERYLMS KSQAMDEIAV LCGGRAAEEL IFGEMTNGAS NDIERATAIA RAMVTQYGMS DKLGMVTLSQ QQSRYLGGGS SLTCSEATAE EIDAEVRRIV EEGHQRALQT LKENRFKLHE IAHYLQKKET ITGEEFMNIL KRENTFAPVD KNINDEGSST PSEE
|
| |