Gene Apre_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0015 
Symbol 
ID8396762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp17737 
End bp19923 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content39% 
IMG OID644994352 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003151791 
Protein GI257065535 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000973096 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA AAAGAGTTTT TTCTTCATTA ATGGCACTTG GACTAGCTAG TACTCTAGTT 
GCTTGTGGCG GTGGAGAAAA TAAAGACAAC AAAGCTGCTA ATAATGACGC AGACACAAAA
GCTGAAGATA AAAACAAAGA CGATAAAGAT GCAAAAGAAA GTGAAGGAAC CGGCGAAGAA
GGTGCGGAAA ACTTCGAATC ACAAACATCT GACGATACTC TAGTAGTAGG TCTTGGTGAG
CTTAACGGTG ACTTTCTACA AGGATGGACC AACAACTCTG GAGATGTTAA AGTTAGAAAA
TATCTTGGTA TCGAAGGAAA CAACGGCTAC CAAACAGTTG TTCAAGATGA ATCAGGAGCT
TGGGTAAACA ACACTGCAGT TCTAGATGGA GAGCCAGTAT CTCAAGATAA CGAAGATGGT
TCAAAGACTG TTACATTCAA GATCAAGAAA GACCTTAAAT GGTCTGATGG CGAACCAATC
AAGGCAGATG ATTACTTATT CATGTCTCTT CTACACACTC ACCCAGAATA TACCAAACTA
ACAGGCTCTA CAAGCCCAGG ATCTGACTCA GTTAAGGGAT ATGAAGCCTA TAAGAAGGGC
GATAGCGATG TATTTGAAGG TCTAGAGAAG GTAGATGATT ACACATTTAA GATTACAATA
GATGCATCAT TCCTTCCATA CTTCGAAGAA GCGTCTCTAC TAGCTATTCA ACCACACCCA
ATGCACTATC TAAACGAGAA CCTAGCTCTT TCTAAAGAGG GTAACAAGCT TGTTGCTAAA
GAAGGCTATA AAGTTAGTGA CGAAGAGAAA GAAAACTATG TAAAGAACCT GGATGAACAA
ATCAAAAAAC AAAATGAAGA CTTTGAGGAA AACAATCCAG CTCCAGCAGA TGATGCAGCA
GAAGAAGATA AGAAGGCTTA CGAAGAAGCT AAAAAAGAAC ACGAAGATGC AATCAAAGAT
CTAGAAGAAC GTAAAGCAGG AGATGTAGAT CCTACTCAAC AACTTATCGA TGAAGCTATG
CTTAAGGAAG TTAATGAATA CAGACTTAAT CCAGCAGTTG TATCAGGACC TTACAAGTTT
GAATCATTTG AAAACAACAT GGTTAAACTA AGCCTAAACG AAAACTATGT TGGAAACTTC
AAGGGTGATA AGGCTACAAT TCCTAACGTA ATCCTTCAAA CTGTAAACAA AAATATTGCT
GTAGACTTAC TTGAAAATGG AGATATCGAC CTTTGGGAAG AAGAATCTGA AGGTGGTCCA
ATCGACAGAA TGAGAGAAGC TGCTGATAGT GGAAAGATCG GTGGATATAA CACATTCGAA
AGAAACGGTT ATGGTAACGT AACATTCCTA ACAGACAGAG GAAGTACAAA ATACAAAGAA
GTTAGACAAG CTATAGCTCA CCTAATGGAT AGAAACAGCT TCGTACAATC CTTCGCGGGT
GGATACGGTG TTGTAACTAA CGGTATGTAC GGTAGCAGCC AATGGATGTA TAAGGAAAGA
GGAGCAGATC TTGAAGGTAA GTTAATCAAC TACCAAATGA ACTTAGATCA AGCTAATGCC
CTTCTCGACA AAACACCTTA CAAGTTCGAG TCTGATGGAA CTACACCTTG GGATAAGACT
AAGGCTGATG AAGCTTTCGC ATCTAACCCA GATGGATTTG ACTATTATAG ATACGATGAA
AATGGTAAGA AGCTTGTAGT TAACCAATAC GGTTCTGATG AATCACCAAT TACAACATTA
ATCTCTAACC AAGTACCAAA CAATGCTAAG CAAGTTGGTA TGGAATACAA TGTTACAGCT
GGTTCATTCG CAACTCTTCT AAACTACTAC TACTATCCAG AAGAAGACCC AGAATATACA
GTATTCAATA TGGGTACAAA CTTCGGTACA CCATTTGACC CATGGTACGC TTATAACTCT
GAAGGACCTT ACAACTATAC TAAGACTAAT GATCCAAAGG CTGATGAGTT GACAGTTAAA
CTACGTAAGA CACCTGCTGA TAAGAAGGAT GAATACCTAG ATAACTGGGA AGAATTCCAA
ATCTGGTACA ATGATTACCT ACCAGAAATC CCACTTTACG CTAACCAATA TCACACAGGT
TACACAAAGA GAGTTAAGGG ATTCGATGTT AATACACCAG TATGGCAATC AGAAGATCAA
ATAAACGCTC TTAGTCTAGA AAACTAA
 
Protein sequence
MKMKRVFSSL MALGLASTLV ACGGGENKDN KAANNDADTK AEDKNKDDKD AKESEGTGEE 
GAENFESQTS DDTLVVGLGE LNGDFLQGWT NNSGDVKVRK YLGIEGNNGY QTVVQDESGA
WVNNTAVLDG EPVSQDNEDG SKTVTFKIKK DLKWSDGEPI KADDYLFMSL LHTHPEYTKL
TGSTSPGSDS VKGYEAYKKG DSDVFEGLEK VDDYTFKITI DASFLPYFEE ASLLAIQPHP
MHYLNENLAL SKEGNKLVAK EGYKVSDEEK ENYVKNLDEQ IKKQNEDFEE NNPAPADDAA
EEDKKAYEEA KKEHEDAIKD LEERKAGDVD PTQQLIDEAM LKEVNEYRLN PAVVSGPYKF
ESFENNMVKL SLNENYVGNF KGDKATIPNV ILQTVNKNIA VDLLENGDID LWEEESEGGP
IDRMREAADS GKIGGYNTFE RNGYGNVTFL TDRGSTKYKE VRQAIAHLMD RNSFVQSFAG
GYGVVTNGMY GSSQWMYKER GADLEGKLIN YQMNLDQANA LLDKTPYKFE SDGTTPWDKT
KADEAFASNP DGFDYYRYDE NGKKLVVNQY GSDESPITTL ISNQVPNNAK QVGMEYNVTA
GSFATLLNYY YYPEEDPEYT VFNMGTNFGT PFDPWYAYNS EGPYNYTKTN DPKADELTVK
LRKTPADKKD EYLDNWEEFQ IWYNDYLPEI PLYANQYHTG YTKRVKGFDV NTPVWQSEDQ
INALSLEN