Gene Apar_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0463 
Symbol 
ID8413312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp532459 
End bp533583 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content49% 
IMG OID645022031 
Productpeptidase M24 
Protein accessionYP_003179485 
Protein GI257784268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00525602 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATT TACAGGCGGC CGCAAAGCGC GTTGCGCGTT TTCGCGAGGT AATGGCTCAG 
AGGGGTTACG ACGCTGTTGT CTTGCGTCAC AATCCAGACC TTCGTTGGTT GACTGATGCT
GAGCGTACCT TTGACTTTGA ACAGGCACAT ACTGCATTTA TTACGCAAGA TGCACTCTTC
TTACACACTG ACTCTCGTTA CTACAACACA TTCCTGGAGC GCCTTGGCAC TGACTCTCCA
TGGAAGTTTG ACCAGGAAGC TACTACACCT ACCGAGTGGG TTGCTGCACA TGTTGCTGAG
GCTCGTGCTC GTGTTGTTGC TATTGAGGAT ACCGTTGACC TTGCTTTCTT TGATGGGCTA
GAGCAGGCGC TGCGCAATCG TTCAATTGCT GCTCTGCTTC CACGCATGCA TGGTGATATT
GCTGAGTTGC GTATTGTCAA AGACCCAGCA GAGATTGAGC TTATGAAGCA TGCTCAGTCA
ATCACTGATA AGGCATTTCT TCACATCTGT GAGTACATCA AGCCAGGCCT CACTGAGCAG
CAGATTCGTG CAGAACTTGA GAATTACATG CTCTCTAATG GCGCAGATGC TCTGTCCTTT
GATTCCATCA TTGCTTCTGG CCCTAACGGT GCTAATCCTC ACGCACAGCC AGGCGAGCGT
GTGGTTCAGA CTGGCGACAT GATTGTTATG GACTACGGTG CGGGCTACTT GGATTACCAC
TCAGACATGA CCCGTACGGT TGTTGTTGGT GCACCTTCTG AGGAGCAGCA GCATGTCTAC
GATGTTGTTC GCAAGGCAAA TGAGACTTGC GCTGCAGCTA TTCATGCAGG CGTAACCGGT
TCTGATATTC ATAATCTTGC AGTTAAGGTT ATCTCTGAGG CTGGTTACGG TGAGTATTTT
GGACATGGCC TTGGTCATGG TGTTGGTGTT GAGATCCATG AGCGTCCATT CTTTAACCCT
CGTTGGAATA AGGTTATTGC AGCAGGTTCT GTTGTTACCG ATGAGCCTGG TATCTATCTA
CCTGGTAAGT TTGGTATCCG TCTTGAAGAT TTTGGTGTTG TTACCGAGGA CGGCTACGAT
GTCTTTACTC AGTCCACACA CGACCTTGTG TCTGTTGGTT GCTAA
 
Protein sequence
MADLQAAAKR VARFREVMAQ RGYDAVVLRH NPDLRWLTDA ERTFDFEQAH TAFITQDALF 
LHTDSRYYNT FLERLGTDSP WKFDQEATTP TEWVAAHVAE ARARVVAIED TVDLAFFDGL
EQALRNRSIA ALLPRMHGDI AELRIVKDPA EIELMKHAQS ITDKAFLHIC EYIKPGLTEQ
QIRAELENYM LSNGADALSF DSIIASGPNG ANPHAQPGER VVQTGDMIVM DYGAGYLDYH
SDMTRTVVVG APSEEQQHVY DVVRKANETC AAAIHAGVTG SDIHNLAVKV ISEAGYGEYF
GHGLGHGVGV EIHERPFFNP RWNKVIAAGS VVTDEPGIYL PGKFGIRLED FGVVTEDGYD
VFTQSTHDLV SVGC