Gene Apar_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0317 
Symbol 
ID8413165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp362543 
End bp364099 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content47% 
IMG OID645021884 
ProductCarboxypeptidase Taq 
Protein accessionYP_003179339 
Protein GI257784122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00666453 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.655151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG CAAATAACAC TGTTGAATTA AACGCAGCGC TTTCAAATCC CCGCGATGAT 
ATCGCTAAAC TTGATGAGCT TGAGAAAAAG CTTTTTGCTC TCAACTACGC AGCTAATGAG
ATTGGCTCTT TTGGACCATG TATTGACCCA AAAAAAGCTG CTGAAGAGCG TGGAGAAGCT
CTGGCAATCT TAGGCGAACA GATCCAGGAG ACCCTCTGTG ATCCCGCTGT TGGTGCACTT
CTTGATAGAC TTCACGGGAA CCGCGCTCTG CTTGATGAGA CTCATCGTGC ACAAGTAAAA
ATTCTACGCC GTGACAGAAG CCAACTGGTA GATGTACCTG TTGAACTTCA AAGTAACTTT
GTTCGTCTAA CTACAGAAGC TAACGAGGTC TGGGAGCAGG CAAAGAACAA TGATGACTGG
TCACTCTTTG AACCAAAACT GGATAGCTTA ATTGAGCTCC GCAAAGAAAT GTGCCAGGCA
CGCGATGCTT CAAAAGATCC ATACGATCTC TTGCTCAGCG ATTTTGAGCA CGATACCAAC
AGAGAGTTTT ACAACACTTT CTTCAATAAT GTTAAAGAAG TAGTAGTACC ACTTGTTGCC
GATTGCATGG CTTCTAAGCG TCAGCCAAGC ACCAAACCTC TTGAGGGCAA GTTTGACGTC
AATCGTCAGT GGGACCTTGC AAAAGACCTG GTAAAACTCC AGGGTCTTGA CGAGGATGCC
TACTGGCTTG GCAAAACCGA GCATCCTTAT ACAGGAGGTC CTGGTATTGG CTTTGTTATG
GTTGCAAGTC ATGCTTACGA GAATAACGTT CTCTCCAATG TCTATTCCAT GCTGCACGAG
AACGGTCATG CCCTTTACGA ACAAGGTATT AACTGGGAGT ACCGTTTTAC CTCTCTGAGC
ACCGGCACTT CCATGGGCAT GCACGAGTCT CAATCTAGAT TCTTTGAGAA CTACGTCGGC
CTTTCTGAGG CATTTGCCGA ACCACTGATT CAGCTTATGC GTAAGCACTT CCCTGGTCAG
CTTAATCGTG TCACTGCCTT CCAGCTTTTC TCGGCAGCTA ACAAGGTGCA GCCAAGTCTC
ATTCGTACTG AAGCAGACGA ACTCACCTAT CCTCTGCACA TCCTTATCCG TTATGAGATG
GAGCAAGCAC TCCTATCTGG CGAAATTACC GCAAAAGATG TCCCAACGCT TTGGGCCGAG
AAGTACAAGC AGTACCTGGG TGTTACGGTT TCAAACAACA CCGAAGGTGC TCTTCAGGAT
GTTCACTGGT CATGGGGTGA GTTTGGATAC TTCCCAACCT ATGCACTTGG CAGCGCTTAT
GGTGCACAAT ACAAACACGC TATGATTGCT GAAGGCATGG ATTTTGATGC CGTCTGCGCA
TCTGGAGACC TTACTCCTAT CAGAGAGTGG TTAGGTAGTC GTATTTGGAC CTGGGGTCGC
GCAAAGGACT CCAAGGAACT CATCTTAGAC GCATGCGGTG AACCTTTTGA CGTCCACTAC
TACACAGAAT ATCTGACTGA CAAATATTCT CGCATTTATG GTCTAACCAG TGCATAA
 
Protein sequence
MDSANNTVEL NAALSNPRDD IAKLDELEKK LFALNYAANE IGSFGPCIDP KKAAEERGEA 
LAILGEQIQE TLCDPAVGAL LDRLHGNRAL LDETHRAQVK ILRRDRSQLV DVPVELQSNF
VRLTTEANEV WEQAKNNDDW SLFEPKLDSL IELRKEMCQA RDASKDPYDL LLSDFEHDTN
REFYNTFFNN VKEVVVPLVA DCMASKRQPS TKPLEGKFDV NRQWDLAKDL VKLQGLDEDA
YWLGKTEHPY TGGPGIGFVM VASHAYENNV LSNVYSMLHE NGHALYEQGI NWEYRFTSLS
TGTSMGMHES QSRFFENYVG LSEAFAEPLI QLMRKHFPGQ LNRVTAFQLF SAANKVQPSL
IRTEADELTY PLHILIRYEM EQALLSGEIT AKDVPTLWAE KYKQYLGVTV SNNTEGALQD
VHWSWGEFGY FPTYALGSAY GAQYKHAMIA EGMDFDAVCA SGDLTPIREW LGSRIWTWGR
AKDSKELILD ACGEPFDVHY YTEYLTDKYS RIYGLTSA