Gene Apar_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0842 
Symbol 
ID8413708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp935340 
End bp937274 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content46% 
IMG OID645022425 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003179862 
Protein GI257784645 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0456572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATA ACGATAACAA GCATCGTAGA TCTATGTCGA TGCTACTCTA TATTGCAGTA 
GCTATCTTTG TATATCTCCT GCTTAGCAAC ACTTTATTAC CAGGTCTTTT GCGCCAACAG
ATACAAACTG TCTCCTACAG TGAGTTTCTT AATAAAATTG AAAGTAATGA GGTTACTAAA
GTAGATCTCA ACACTGGTAA CAGAAACATT AGATTTACTA CAGGCTCTGG AGATTCCGAG
AAAATCTTTG AGACTACGCA GTTCCCTAAT GATTCAACAC TGGTACAAAC GCTCAGAGAA
CACAAAGTTG ACTTCTCAGC TTCCATTCCT GACAACTCTG CAAACATGCT GATGTATGCC
CTTATTCAAT ACGGCATTCC TCTTATTATC TTCTTGGGTA TTGGTTTCTT TATTAACCGC
TCCCTTAAAC GCGCTATGGG AGACGATGGT CCATCCATGA ACTTTGGCGG CGGTTTTGGT
GGTCTCGGCG GCAATCTTGG TCGCTCAAGC GCTAAAGAAA TTAAGGGAGA AGATACTGGC
ATTACGTTTA AAGACGTTGC TGGCCAAGAA GAGGCTAAAG AGTCTATGCA AGAGATTGTT
AGCTTCCTTA AGACTCCCGA TAAGTACAAA GAAATTGGTG CTCGCTGTCC TCGTGGTGCT
CTACTTGTAG GACCTCCAGG CACCGGTAAA ACTCTTATAG CTAAAGCAGT TGCTGGTGAA
GCTGGCGTTC CTTTCTTCCA GATTGCTGGC TCTGAGTTTG TTGAGATGTT TGTTGGACGC
GGTGCCGCAA AAGTCCGCGA TCTCTTCAAG CAGGCAAATG AGAAAGCTCC TTGCATTATC
TTCATTGATG AGATTGATGC TGTTGGTAAG CGCCGCGACG CTTCCCTCAA CTCCAACGAT
GAGCGTGAGC AGACCTTAAA CCAGCTGCTC TCAGAGATGG ATGGCTTTGA TAACCACAAG
GGTATTGTTG TTCTGGCAGC AACTAACCGC CCAGAAACCT TGGACAAGGC ACTTTTGCGT
CCTGGTCGCT TTGATCGTCG TATTCCTGTT GAGCTTCCAG ATCTTAAGGG TCGTGAGGCA
GTTCTCCAGA TTCACGCCAA TGATGTAAAA ATGGAGCCAG GCGTTGACCT CTCTATCGTT
GCTAAGTCCA CGCCAGGAGC ATCTGGTGCA GACCTTGCAA ACATCATCAA TGAGGCAGCT
CTTCGTGCTG TTCGCTTTGG TCGCCGTCGT GTTACCACTG AAGACCTTAC AGAGTCTGTC
GACGTCGTTA TTGCCGGAGC AAAAAAGAAA AATAGTGTTC TATCTGAGCA TGAGAAGGAT
GTTGTTGCCT ATCACGAGAC CGGCCACGCA ATTGTTGGTG CCATCCAGAA AAACGATGCT
CCTGTCACCA AGATTACTAT TGTTCCTCGT ACTAGCGGAG CCCTTGGCTT TACCATGCAG
GTTGAGGACG ATGAGCGTTA TCTGATGAGT AAGAGTCAAG CCATGGATGA GATTGCTGTT
CTCTGTGGTG GACGCGCTGC TGAAGAGCTT ATCTTTGGCG AGATGACCAA TGGTGCCTCC
AATGATATTG AGCGCGCAAC TGCAATTGCA CGCGCAATGG TTACCCAGTA CGGCATGTCT
GACAAGCTTG GTATGGTTAC CCTAAGCCAG CAGCAAAGCC GCTATCTTGG TGGTGGCTCT
TCCCTCACCT GCTCTGAAGC AACTGCTGAA GAGATCGACG CTGAGGTTAG ACGTATTGTT
GAAGAGGGTC ACCAGCGGGC ACTTCAAACG CTTAAAGAGA ATCGCTTTAA ACTGCATGAA
ATTGCTCACT ATCTACAGAA GAAAGAAACT ATTACCGGCG AGGAGTTCAT GAATATCCTC
AAGCGTGAGA ATACCTTTGC ACCTGTAGAT AAGAACATCA ACGATGAAGG CTCTTCTACT
CCTTCAGAAG AGTAA
 
Protein sequence
MANNDNKHRR SMSMLLYIAV AIFVYLLLSN TLLPGLLRQQ IQTVSYSEFL NKIESNEVTK 
VDLNTGNRNI RFTTGSGDSE KIFETTQFPN DSTLVQTLRE HKVDFSASIP DNSANMLMYA
LIQYGIPLII FLGIGFFINR SLKRAMGDDG PSMNFGGGFG GLGGNLGRSS AKEIKGEDTG
ITFKDVAGQE EAKESMQEIV SFLKTPDKYK EIGARCPRGA LLVGPPGTGK TLIAKAVAGE
AGVPFFQIAG SEFVEMFVGR GAAKVRDLFK QANEKAPCII FIDEIDAVGK RRDASLNSND
EREQTLNQLL SEMDGFDNHK GIVVLAATNR PETLDKALLR PGRFDRRIPV ELPDLKGREA
VLQIHANDVK MEPGVDLSIV AKSTPGASGA DLANIINEAA LRAVRFGRRR VTTEDLTESV
DVVIAGAKKK NSVLSEHEKD VVAYHETGHA IVGAIQKNDA PVTKITIVPR TSGALGFTMQ
VEDDERYLMS KSQAMDEIAV LCGGRAAEEL IFGEMTNGAS NDIERATAIA RAMVTQYGMS
DKLGMVTLSQ QQSRYLGGGS SLTCSEATAE EIDAEVRRIV EEGHQRALQT LKENRFKLHE
IAHYLQKKET ITGEEFMNIL KRENTFAPVD KNINDEGSST PSEE