Gene Apar_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0953 
Symbol 
ID8413824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1072779 
End bp1074029 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content49% 
IMG OID645022541 
Productpeptidase T 
Protein accessionYP_003179973 
Protein GI257784756 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000027828 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG TTTTAGATAG ATTTATTGCG TATTGCAAGG TTTCTTCTCA GTCCAATCCG 
CTGACTGCAG ATACGGTTCC TTCAACTGAG TCTCAGCATC AGATGGCTGA GGTTGTTGCG
GCTGATCTTC GTGAGCTTGG TGCAGAAAAT GTAACAGTAG ATGAGCATGC TTATGTGGTT
GCTCACTGGC CTGCAAGTAA GGGTTTGGAA GATCTTCCTA CACTTGGTTT CTGTTGCCAT
ATTGATACTG CTTGGCAGTC TTGGGGCAAT CCCGTTCATC CTCAGGTAGT TACCTACGAG
GGCGGCAAGC TGGTGGTTGG TTCAGATCGC GAGGGTAGAG AGGTTTATAT TAGCCCCGAG
ACAAACCCTC AGCTTGAGCA TATGACTGGT TGGCAGCTTG TTACTACTGA CGGTACATCA
TTGCTTGGCG GTGATGACAA GGCTGGTATT GCCATGCTTG TTAGCTTGCT TGCTCGCTAC
AAGGAACATC CGGAGCTTAA GCATCCGCGC ATTGCACTTG CGTTTGTTCC TGATGAGGAG
ATTGGTCACG GTGCGGCCCT TCTGGATCTT GACGCCTTTG GTGCTGTTTA TGGCTACACC
ATTGACGGCG GTCCGTTTGG CGAGTTCTGT TACGAGACCT TCAATGCTGC TGAGGTGTTT
GTTTGCGCTC ATGGACTTTC GGTTCACACG GGAACCGCAA AAGGACAGAT GATTAATGCG
TCTGAGGCAA TCATGCGTTT TCATGAGTTG CTTCCACCTG CGGAGCGTCC TGAGTTCACA
GAAGGTTATG ACGGTTTTTT CTATCTGGAG CGCGTCAATG GTGATTGTGA GTCCGCACGT
GCTGATTACA TTATTCGTGA TCATGACCAG GCAAAAGTTG AGCGTCGTAA GCAGCTGATG
GTTGATGCTG CAGCATATGT AAACAAGCAG ATTGGCTCTG AGGTTCTCTC TGTTGAGATT
CATGATCAGT ATCACAACTT GGCTGATATT GTCTTGAAAC CTGAGTATGC GCACTTAATT
GAGAATGCGC GCATTGCATA CGAGAAGGCT GGTGTACAGA TGACCTGCAT TCCAATGCGC
GGTGGCACCG ATGGGTCTCA GCTCTCCTTT AGAGGATTTC CTTGCGCTAA TCTTTCGGCC
TGCTACTACA ACGCCCACGG CGTTAGAGAG TTTGTTCCTG TCCCAGAGCT TGAGGGTATG
GTTGACATGC TTGAGCACCT GGTAGAGCTT TACACTTATC CACAAAACTA G
 
Protein sequence
MSDVLDRFIA YCKVSSQSNP LTADTVPSTE SQHQMAEVVA ADLRELGAEN VTVDEHAYVV 
AHWPASKGLE DLPTLGFCCH IDTAWQSWGN PVHPQVVTYE GGKLVVGSDR EGREVYISPE
TNPQLEHMTG WQLVTTDGTS LLGGDDKAGI AMLVSLLARY KEHPELKHPR IALAFVPDEE
IGHGAALLDL DAFGAVYGYT IDGGPFGEFC YETFNAAEVF VCAHGLSVHT GTAKGQMINA
SEAIMRFHEL LPPAERPEFT EGYDGFFYLE RVNGDCESAR ADYIIRDHDQ AKVERRKQLM
VDAAAYVNKQ IGSEVLSVEI HDQYHNLADI VLKPEYAHLI ENARIAYEKA GVQMTCIPMR
GGTDGSQLSF RGFPCANLSA CYYNAHGVRE FVPVPELEGM VDMLEHLVEL YTYPQN