Gene Apre_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1251 
Symbol 
ID8398040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1341665 
End bp1343581 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content41% 
IMG OID644995596 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003152996 
Protein GI257066740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA CAAAATTAAG AGATATCTTG GACTATGAAT TCTTATCTAG TCTTGATATT 
TCAAGTGACT ATAGAAAAAT TTCCTACAAA AAAACAATAG GAAATTACAA GGAAAACAGA
TACGACAGCA ATTTATGGAT TTATGACACC GAAACTTGCG AAAACTATCA AATTACTGAT
GATAAGAAGG CGACAATTTC TGCCTTTAAC AAGGATTCTA ATCTAGTTTA CAAGAAAGAA
TCCACAGATG AGGCTGATAT TTTCTATGTC AAAGACGGTA CAGGTCTAGG CCATGAGTAC
TTTTCTATAG ACAAAGATGT AGATATGATA AAGCACTTAG GAGGCGACCT CTTTCTAGTG
AAAGCAAAAG AAAAGAAAAG CAAGGAAGAT AAGGAAAAGG ACAAGGAAAA CTCCTACTCT
AAGGAAATCG ACAAGCTTCC TTTCTACCTA AATGGCGCAG GCTTTATCAA GGATGAGGAC
TCTGCCCTAT ATTTTTATGA CGCATCAGAA GATAGGCTCG AACTTATCAA GGACTTTAAG
GCGGAAGATA AGCTAAGCTT TGTCGATATC AGTAAGGATT CTAGCAAAAT CCTCCTCCTT
AGGGGTAATT TCACAGATAA TTCTGTAATG GAGCTTAAGG AAGACCTCCT CCTTCTCGAT
ACAAAATCAG GGGAAATGAC CCTCCTAATC GAGAACGAAT TCTCCTACTA TACTGCAAGA
TTTATCGAAG ATAGGATAAT CTTTGTAGCG ACCGATATGA AAAAGGGCGG GGTCAATGAA
GATTGTTTTA TTTATTCATG TGACTTTGAG GGAGCTTATA AGAAAATTAG CCCAGACGAT
TTCGATATGG CCTTTGGTAA TTCCATAGGT ACTGATGCAA GATTCGGATC TTCTAGGACC
TTCGATGTAA AGGGCGACAG GCTATATTTC GTCGTAACCG ACTATGAAAA GTCCAAGCTC
TTATCCATAA GTCTTGCTGG AGATATCAGA GAAGAAATCT CAGAAGGCGT TGAAGACTTC
GTCCTAGGAG ATGATGATAT CTACTACCTT GCAATGGGAG TTGATACTCT TTCTGAGCTT
AAGAAAAAGT CTACAGGCGA AACTCTTATA GCAAACAAGG TTCCTTCTGA AGTCCACCCT
ATCGAAACTT TTGACTTCGT ATCAAATGGC GATGAGCTTA CCGGCTACGT CCTCCTTCCA
AAGGACTTCG ATAAGAAGAA GAAATACCCA ACCCTTCTTT CCGTCCATGG TGGACCAAAG
ACAGAGTTTT CTGACATCTT CCACCACGAG CACCAGATGT TTGCATCAGC AGGTTACATT
GTAATTTACA CCAACCCACA CGGTTCAAGT GGTAGAGGAG TCAAGTTCTC CGACATCCGT
GGCAGATACG GAGATATTGA CTACGATGAC CTTATGACCT TTACCGACCT TGCCATAGAA
AAATACCCAC AAATCGATAC AGAAAAAATG GGAGTCTATG GCGGAAGTTA CGGTGGTTTT
ATGACAAATT GGACCATAGG CCACACCGAC CGTTTCGCGG CAGCTTGTAG CCAAAGATCT
ATCTCAAACT GGACAAGCTT TTATGGAGTA TCAGACATAG GCTACTACTT CGCTCCTGAC
CAAACAGCAA GCGATATGTG GGATAATCTC GACAAAATGT GGGACCAATC TCCAATCAAA
TACGCCCCAA AGGTCACGAC CCCAACCCTC TTCATCCACT CTGATGAAGA CTATAGGTGT
CCACTAGAGC AGGGGCTTCA AATGTATACG AGAATCAAGG AAAATGGCAC AGATACTAAG
ATGTACATCT TCCATGGGGA AAATCACGAA CTATCTCGAT CTGGAAAACC AAAGGGCAGG
ATCAAGAGAC TAGAAGCAAT CAAAGAATGG TTTGATAAGT ATCTCAAAGA TGAATAA
 
Protein sequence
MKDTKLRDIL DYEFLSSLDI SSDYRKISYK KTIGNYKENR YDSNLWIYDT ETCENYQITD 
DKKATISAFN KDSNLVYKKE STDEADIFYV KDGTGLGHEY FSIDKDVDMI KHLGGDLFLV
KAKEKKSKED KEKDKENSYS KEIDKLPFYL NGAGFIKDED SALYFYDASE DRLELIKDFK
AEDKLSFVDI SKDSSKILLL RGNFTDNSVM ELKEDLLLLD TKSGEMTLLI ENEFSYYTAR
FIEDRIIFVA TDMKKGGVNE DCFIYSCDFE GAYKKISPDD FDMAFGNSIG TDARFGSSRT
FDVKGDRLYF VVTDYEKSKL LSISLAGDIR EEISEGVEDF VLGDDDIYYL AMGVDTLSEL
KKKSTGETLI ANKVPSEVHP IETFDFVSNG DELTGYVLLP KDFDKKKKYP TLLSVHGGPK
TEFSDIFHHE HQMFASAGYI VIYTNPHGSS GRGVKFSDIR GRYGDIDYDD LMTFTDLAIE
KYPQIDTEKM GVYGGSYGGF MTNWTIGHTD RFAAACSQRS ISNWTSFYGV SDIGYYFAPD
QTASDMWDNL DKMWDQSPIK YAPKVTTPTL FIHSDEDYRC PLEQGLQMYT RIKENGTDTK
MYIFHGENHE LSRSGKPKGR IKRLEAIKEW FDKYLKDE