Gene Apre_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1681 
Symbol 
ID8398493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1827869 
End bp1829749 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content41% 
IMG OID644996044 
Productoligopeptide transporter, OPT family 
Protein accessionYP_003153422 
Protein GI257067166 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0712144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAG AAAAAGAAAT TGAGAAACTT AGAGAATTTA CTCCCTTGGC TGTAGTCTTG 
GGAGTTATAA TAGCCATAGT ATTTGGTGCT GCCAATGCTT ATTTGGGACT TCGTGTAGGT
CTTACCATAT CTGCATCAAT ACCAGCGGCT GTTATTTCTA TGGGTATAGT TAGAAAAATC
TTAAAGCGTG ATTCTATACT CGAAAACAAC CTCGTTCAAA CAATAGGATC TGCGGGAGAG
TCTCTTGCAG CAGGAGCCAT CTTTACCCTT CCAGCAGCCT TCTTGTGGGA AGCAGAGTGG
GGAGATAGTC ACTATATATC GTATTTAAAT ATTTTGATGT TGACCTTGAT TGGGGGAGTA
TTGGGTATTA TCTTTATGAT TCCTCTAAGA AGGGCACTTA TCGTTAAGGA AGATGGAATC
TTACCATATC CTGAAGGAAG AGCCTGCGCA GAAGTACTTA AGGCAGGAGA GGCAGGAGGA
CAAGACTCAA GCGTAGTATT TAAGGGCTTG GGCCTTGCTT CTATCTATAA GTTTTTGGCA
AATGGACTTA AGGTCTTCCC AGAAGGGGTA AGCTATGAAA TATCTACTAA AAACTTTGGA
GGAACTGCCC TTGGTTTCGA TGCCCTTCCA GCCCTTATGG GTGTAGGCTA TATCGTAGGG
CCTAAAATCG ACGCTATAAT GCTTTCAGGA GGAATCCTTG CCTGGCTTGT ACTTATGCCA
CTGCTTCACG CCTTTGGCCC AGCAGAGATA GCAAGTCTTA GCCCATCTGA CCTATGGTCA
AACTACATCA GATATATAGG AGCAGGAGCT GTTGCTACTG GAGGAATTAT ATCTTTGATC
AAGTCCCTTC CTATGATTAT TAAATCCTTT AAGGATTCTA TTAAGGACCT TAAGGGCAGA
GATTCATCCC AGAGTAAGGA TAGATCTGAT GCTGATATTT CTATGAAGAC ATCCATAATT
CTTGTAATCA TTGCAATAGT ATTGATGTTT ATGATGCCTT CTTCACCACT GAACTTCTTT
GGTGCCTTGA TTATAGTAAT ATTTGGTTTC TTCTTTGCTA CAGTTTCATC AAGGATGGTA
GGAATAATAG GATCTTCAAA TAACCCTGTA TCAGGAATGT CCATAGCGAC TCTCTTAATT
GCAACCCTTC TTCTAAGACT AACAGGCTTT GTAGGCCATG ACGGAATGAT AGCGGCGATA
TCTATAGGAA CTATAATTTG TGTAATAGCT GCTATAGCAG GAGACTGCTC ACAAGATTTA
AAGACAGGTT ATATAGTAGG AGCAAGCCCA AGATACCAAC AAATCGGCGA GCTTATAGGA
GTTCTCGCCT CATCCCTTGC CATAGGTGGA GTTTTGTGGA TACTAAACAA ATCTATAGGT
TTTGGAACAA AGGACCTTCC AGCTCCTCAA GCCATGCTTA TGAAGATGAT AGTAGAAGGA
GTTATGAACA ATGACCTTCC TTGGAACTTG GTATTTGTAG GAAGCTTTAT AGCTATTATG
GTAGAGCTTT TGGGAGTAAC AGTCCTACCT TTTGCTATAG GCCTTTACCT ACCTATCAAC
ACATCACTTG GAATAATGTT CGGTGGTCTT GTAAGAATCG CTGTAGATAA GATCAAGGCA
AGTAAGGAAG AGAAAAAGGA TGCAGAGACA AGAGGAACCC TCTACTCAGC AGGTCTTATT
GCTGGAGAAG GAATCATGGG AATAATTCTA GCAGTCTTTG CCCTAATTCC TGTCAAAGGC
AAGACTCTGG CAGATCTTAT CAATATCTCT GATAAATTTT CCTTAAGCCA AGAAGCCTCT
GTAGTGATAT TTGTCCTACT AGGCATACTA ATCTATAGCA AGGCAAGAGG AGCCTTAAAG
AAGGGCAAAA ATGAAGCTTG A
 
Protein sequence
MKKEKEIEKL REFTPLAVVL GVIIAIVFGA ANAYLGLRVG LTISASIPAA VISMGIVRKI 
LKRDSILENN LVQTIGSAGE SLAAGAIFTL PAAFLWEAEW GDSHYISYLN ILMLTLIGGV
LGIIFMIPLR RALIVKEDGI LPYPEGRACA EVLKAGEAGG QDSSVVFKGL GLASIYKFLA
NGLKVFPEGV SYEISTKNFG GTALGFDALP ALMGVGYIVG PKIDAIMLSG GILAWLVLMP
LLHAFGPAEI ASLSPSDLWS NYIRYIGAGA VATGGIISLI KSLPMIIKSF KDSIKDLKGR
DSSQSKDRSD ADISMKTSII LVIIAIVLMF MMPSSPLNFF GALIIVIFGF FFATVSSRMV
GIIGSSNNPV SGMSIATLLI ATLLLRLTGF VGHDGMIAAI SIGTIICVIA AIAGDCSQDL
KTGYIVGASP RYQQIGELIG VLASSLAIGG VLWILNKSIG FGTKDLPAPQ AMLMKMIVEG
VMNNDLPWNL VFVGSFIAIM VELLGVTVLP FAIGLYLPIN TSLGIMFGGL VRIAVDKIKA
SKEEKKDAET RGTLYSAGLI AGEGIMGIIL AVFALIPVKG KTLADLINIS DKFSLSQEAS
VVIFVLLGIL IYSKARGALK KGKNEA