Gene Apar_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0995 
Symbol 
ID8413867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1123190 
End bp1125820 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content51% 
IMG OID645022584 
Productpeptidase U32 
Protein accessionYP_003180015 
Protein GI257784798 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC GTAATCGTCG TGAGCGAGAA TTTACAGCTC ACGAGTTAAA TCGCATGAAC 
GAGCTGGAGT GGACGGAGGA AGCCTCTGAC TTAGAGTTTT CATCTCAACC GCTTCCTCGT
GCAACACAGA TGGAACTTCT CGCACCAGCG GGAGGACCTG CACCGTTTGC TGCTGCTTTG
GCTGGCGGTG CGGACGCTAT TTATTGCGGT TTGGGTAACA ACTTTAACGC GCGCCGTGGC
GCAGACAACT TTGATGACGA GTCTTTTGCA CGTGCTTGTA GGCAAGCTCA TCTGGCTGGC
GCTCGCGTGT ACGTCACTGT TAATGTTGTT GTTAAGTGGG ATGAAATGCA GCGCGTACTG
CGCCTGATTC GTCGTGCATG GATTCTTGGA GCAGATGCTT TTATCATCCA AGACTGGGGT
CTTATGGCTC AGGTTAGAAA GACTTGGCCA GAGATTGAAT GTCACGTATC AACGCAGGCA
AACATTCACG ATACGCGTGG TGTTTCTGCC TGTAAAAAGC TTGGTGTAGG ACGCGTTACC
CTGTCCAGAG AGCTTACTAA AGAAGAGATT TCTACTATTT CCAAGCTTGG TGTTGAACTA
GAGTGCTTTG GTCACGGCGC CTTGTGTTTC TGTTATTCGG GTATTTGCCA TATGTCATCC
ATGCGTGGAG ACCGTTCTGC TAACCGCGGA GCTTGTGCTC AGCCGTGTCG TCTTCCGTAT
GAGCTGCTCA ATTCTAAGCA CGAAGTTGTC TCGATGGGCG GCATTGATAG GCTGCTGTGT
CCTAAGGATT ATTGCACTAT TGATGACGTG CCGGACATGA TTGAGGCGGG CGTGGGCTCG
CTTAAGATTG AAGGTCGTAT GAAGGCGCCT GAGTACGTGT ACTCTGTTGT GTCTTCGTAT
CGCCAGGCAA TTGATGCTGC CGAGAAAGGC GTTGATAACC AGGCCGATGT GGCACGTCGC
CATCGTTTGC TTAAACGTTC GTTTAACCGC GGCTTGACTA ATGCATATCT GCACGGCACC
GCCGGCAACA AGATGATGAG CTACGAGCGC TCCAATAACC GCGGCGAGGT TGTAGGTGAG
GTTACTGGAG GACGTTCGCT TGAAGATGCT CTTGAGCGCA AGAGTGGCTT GAATGGTGGC
CGCGTTAAGC TGCGTCGCTA CAAGCAGGCT GACGTTGACC TTATTGCACA TGCGCCTATT
GGCAAGAATG ACCTGCTTGA GATTAGACCT TTAGATGAGC CTGACAAGTT CCTGACCGCT
CTTTGTCCTA CAGATGTTGA ACCTGGTCAG AAGGTCACCG TCCGCACTTC TCGCGTTATG
CAGACTGGTT CTACGGTGCG CATTATTCGC TCTGAGGCAG CACGTGTGGC AGCTGAGCAG
ATTTCTTCAC TGGAATATCC TCGCAAACGA GCTGTTGATG TGACTATTAT TGCTCGCATC
GGCCAGCCTT TTACCGTGGT GCTTACTACA ACAGATGGAG CGGCAAGTGC GTCTGCTGAA
GGCTTTGTGG TTGAGGAAGC TCGCACTAAG GCGGTAACTT CTGACGAGCT TATTGAGCAC
GTAGGACGTA TGGGGACCTC TCCATTTGAG GCAGTGAGTT TTGACGTACA GATGGATGAC
GCGTGTGGCA TGAGTTTTAG TGCTGTTCAC AAGGTTCGCG CCGCAGCATG TGAGCAGCTT
GAGGCTGCGC TTCTGGAGGA GTACCAGGAT CGTGAGTATA AGATTGCTCC GCTTTCACGC
CTTGCCTATC AAAAGGAGCG AGAAGCTCAG GATCAAGAGA AACTCTTTGT TTTTGACAAG
GCAGCTGCAA AAACAAATGC ATCGCAGGCA GAGATATGTG TTCTGGTTGA GACTCCTGAG
CAGGCACGCG TTGCACTCAA GGCGGGAGCA GATCGTCTGT ACGCAACAAC AGATGTACTT
ACAGACGCTT CGTGGCCAGA AGATCTTCTC GCCAAGATAG TGCCATGGCT TGACGAGGTG
TGCCGCGAGA TTGACCACAA TCGTTTGGAT CCGTACGTCG TATCTGGTAA GCCGATTGCC
GTAGGCAACA TCTCTGAGCT AGCGCTGGCC GTTGAGCGTG GAGCTGTCCC AGAGGTGCGT
GAGTGTATTC CAATCCACAA CGATTACGCT TTGCAGGCTC TTGCTGACAT GGATGCAGAA
GGCGTCTGGC TCAACTCAGA ACTTACCCTC CAAGAGATTT GTCACATGGC GAGAAACGCC
TCGATTCCTG TGGGCTATAT GGTTAGCGGC CGTATTCGTA CCATGACAAC TGAGCATTGT
ATTTTGATGT CTACGGGTAA GTGCATTCAC GATTGTGATG CCTGTCAGTT GCGCCTTGAG
GAGCATACGC TTAGGGGTAT TGATAATGAT TACATGCCCG TAAGAACTGA CAGACACGGT
CGCTCAAAGA TTTGGAGTCC TAAACTTTTT GATGGTGTGC CAGAAATCTC TGAGATGCTT
TCAGCCGGCG TGAAGCGTTT TATGGTGGAC GCAACACTTT TGAGTGTGGA GCAGACAAGA
GAGGCCACTT CTCGCGTAGC TGCAGCGATT GAGGCAACAG CGTCGAGTGC ATCATTGCCT
CCACGTCTCA AAGATGCTTC AGTTGGACAT CTTTTCTCGC CAATTGGATA G
 
Protein sequence
MNERNRRERE FTAHELNRMN ELEWTEEASD LEFSSQPLPR ATQMELLAPA GGPAPFAAAL 
AGGADAIYCG LGNNFNARRG ADNFDDESFA RACRQAHLAG ARVYVTVNVV VKWDEMQRVL
RLIRRAWILG ADAFIIQDWG LMAQVRKTWP EIECHVSTQA NIHDTRGVSA CKKLGVGRVT
LSRELTKEEI STISKLGVEL ECFGHGALCF CYSGICHMSS MRGDRSANRG ACAQPCRLPY
ELLNSKHEVV SMGGIDRLLC PKDYCTIDDV PDMIEAGVGS LKIEGRMKAP EYVYSVVSSY
RQAIDAAEKG VDNQADVARR HRLLKRSFNR GLTNAYLHGT AGNKMMSYER SNNRGEVVGE
VTGGRSLEDA LERKSGLNGG RVKLRRYKQA DVDLIAHAPI GKNDLLEIRP LDEPDKFLTA
LCPTDVEPGQ KVTVRTSRVM QTGSTVRIIR SEAARVAAEQ ISSLEYPRKR AVDVTIIARI
GQPFTVVLTT TDGAASASAE GFVVEEARTK AVTSDELIEH VGRMGTSPFE AVSFDVQMDD
ACGMSFSAVH KVRAAACEQL EAALLEEYQD REYKIAPLSR LAYQKEREAQ DQEKLFVFDK
AAAKTNASQA EICVLVETPE QARVALKAGA DRLYATTDVL TDASWPEDLL AKIVPWLDEV
CREIDHNRLD PYVVSGKPIA VGNISELALA VERGAVPEVR ECIPIHNDYA LQALADMDAE
GVWLNSELTL QEICHMARNA SIPVGYMVSG RIRTMTTEHC ILMSTGKCIH DCDACQLRLE
EHTLRGIDND YMPVRTDRHG RSKIWSPKLF DGVPEISEML SAGVKRFMVD ATLLSVEQTR
EATSRVAAAI EATASSASLP PRLKDASVGH LFSPIG