Gene Apre_1213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1213 
Symbol 
ID8398002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1292196 
End bp1295153 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content39% 
IMG OID644995558 
Productglycosyl transferase family 51 
Protein accessionYP_003152958 
Protein GI257066702 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5009] Membrane carboxypeptidase/penicillin-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAATA AACTTGCCGC TCTTGTTGGA AAAATAATAC TAGTTGTAAT ATTATTTGTA 
GCTGGCCTTG GAGTTATTTT TGCTGGCATG ACTGGTGGAG CAATACTAGA AGTAATGAAG
ACTGCTCCCA AGATTGACGC TAACAGTATA AAATACGAAA TGAGCCAAAA CTCAACAATA
GTGGATGAAA ACGGAAACGA AGTCGATTCG ATTGCTACAA GCGAATACAG ACAGATAGTA
GATTATAAAG ACATACCAGA GAATTTAAAA AATGCATTTG TCGCAGTTGA AGATGAAAGA
TTTTATAAAC ACAATGGTAT AGATCCCCTA TCAATCATAG GTTCTGCTTT TGAAAATATG
AAGGCAGGGT CAATCGTAAG GGGTGGATCT ACTATAACCC AACAGCTTGC AAGAAACACC
TACCTATCTA ACGATCAAAC CTACGAAAGA AAGATAAGGG AGATTTATCT TGCCCTAGAG
ATTGAGAAAT ATCTCGAAAA AGACGAAATC CTAGGAGCCT ATCTAAATAG GGTCTTCATG
GGACAAAACT CTTACGGAGT CCAAGCTGCA GCTAAAACTT ACTTCAATGA GGACGTGTCC
GAGCTAAATC TTGCCCAATG TGCATCCCTA GCTGGAATCG TCCAATCACC TTCAGAAAAC
TCTCTTTATA AGTCAATTAA GACGAGTGAG GTAACAGATC AAAGAGTCCT AGGTGAATTT
AGTATAGACG GAAATAAATA CTCAGCAGTA TATAACGAAG CTCCTTACAA AAGAGAAGAA
TACGTCCTAG ATAAGATGCT AGAAAACGGC TACATCAACG AGACTCAATA CAAAGAAGCC
AGAGACTTTG ATGTGGCAAG CACAGTAGAG CCTGCGGAAA GGTCTAATAC AGAAATCGCT
AGCTACTTTA ACTCTCTTCT AGAACGTCAA GTAGTTAATA AGCTTATGGG TGTATTAAAT
ATTTCAGAAA ACCAAGCCTG GGATAAGCTC TACTACGGAG GGCTTAAGAT TACGACGACC
CTAGATAAGG GCTTGCAAGA AAAACTTGAA GATATCTATG CCAATTTTTC AGAGCACTTG
ATAGGAAATA GCGAAGGCCT AGGCTACGCT CCCCTACTTG ATTTATCTTA TGATAATTGG
GGAAATATAG TAAATTCTAG TGGGGCCTTA CTCTATTATA AGAGAGCAAA TCTCCTAGAT
GAAAATAACG ACCTTCACTT AAGTTCTGAT GAGGCTTGGA ATGATGAGGC AGGAGACCTA
ATACTTGCGA CAAATAAGGC CTACCTTGAC CAAACCAAGC TTATCTTTAA GGATTTCTAT
TCCTTGGATG AAACAAACTC CAACCTTAGA ACTCACAGGA CTGGACGAAT AGAGTTCGAG
TCCAACGAAG ATATCTATCA GGATGAGGAT AACAATATAG TAATTAGTAA GAAATATCTA
GATAATAACC CAAATCTTTT TACAGCCTAT GACGATGGAT CTATTAGCCT TAACAAGGAC
TATTACGATA TAGACCTAAA CGGGGTAATC CAACCTCAAT CATCATCTGT AATAATCGAT
CAAAAGACTG GCCATATCAA GGCCTTGATG GGAGGACGTG ACCAAGAAGG AATCAATATC
TTAGACAGGG CATCAAGCGT TCCAAGACAA CCTGGTTCAT CAATCAAGCC TATAGCAACT
TATACAGCTG CCCTAGATCA TGGCTTTAAC CTCGCAACTG GTGTAGATGA TGTTCCATTC
GAGATGAACG AAAATGGCGA AGCTTGGCCT GTCAATGTAT ATGGCTACTA CATGGGTTAC
ACCCCTATAA GAGATGCCAT CAAGATGAGT ATAAATACAA TTGCAGTTTC TACCCTCAAT
AAGGTAGGAA TCGACACTAG TCTTGAATAC CTCAAAAACT TCGGTCTAAT CAAAGAAAAT
GGCAGAGATT ATTTCGTGAC AAAAGACGAA AATCCTGACA CAAACGACGA AAACCTCGCA
GCCCTAGGAG TGGGTGCTAT GAGTCATGGT CTTACCACAC TAGATATGAC TGCAGCCTAC
GCAGCCCTTG CCAATAAGGG TGAATACACT GAACCTTTGA CCTTCTCAAA GATAACCGAC
TCCCAAGGAG AAGTAATCTT TGATGAGGAT AATCTAATAA AACACACAGT AACAAGCCCA
GAGACAGCCT ATCAAGTGAC CTCTGCCCTA GAAAGTGCAG GAGAATATTA TGGAAATATC
CACCTAAACG GAACAGATTA CGCAACAAAG ACCGGAACAA CTGATGATAA GACAGACTTC
TGGTGCGTTG GATATACTCC TTACTACACA GTTGGAGTTT GGATGGGAGC AGATAATCAA
AATATCCACC TAAACAGCAA CAGTGTAGAT AGGGCAGCCC TCATGTGGAA TGTTATAAAT
ACAGAAATCC TTGCTGACAC AGAGCCTGTT AGTTTTGAAG AACCTGAAGG CATAAGGCAT
ATGGAAGTAG ATACCATAAG CGGTAAGCTT CCAACAGACG CCTCAAGGGC AGATCCACGT
GGCACTGTAA AAGAAGAAAT CTTCGGTCCA GAAAACTATC CAAAAGAAGA AGACGACATG
CACAAGTGGG CTTATATAGA TTCTAGAAAT AACCTATTAG CTAGTGATGT CACACCAAAA
TTCTTGATCC AAACTAGGTC CTTAATAGTT GAAAAGAACG AATATGATCC AAACAAGTTT
AACGGAATTA TTCCAAGAGA CTGGGATTAC AGGATGCCAA CAGTCTACTC AAACTTAATC
TATACTCCAC CAAAAGTGGA AAACAATAAA GATAAAGACA AAGATAAAGA TAAAGACAAA
GACAAAGATA AAGATAAAAA TAAAAATGGC GAGAAAGAAA ATGAAAGTAA TAGTGGAACT
AATAATTCCA CAAGTGACGC GCTTGATAAT TTATTAAATA AGGAATCTAC AGAAAATAAT
TCTGGATTCG GTTACTAG
 
Protein sequence
MHNKLAALVG KIILVVILFV AGLGVIFAGM TGGAILEVMK TAPKIDANSI KYEMSQNSTI 
VDENGNEVDS IATSEYRQIV DYKDIPENLK NAFVAVEDER FYKHNGIDPL SIIGSAFENM
KAGSIVRGGS TITQQLARNT YLSNDQTYER KIREIYLALE IEKYLEKDEI LGAYLNRVFM
GQNSYGVQAA AKTYFNEDVS ELNLAQCASL AGIVQSPSEN SLYKSIKTSE VTDQRVLGEF
SIDGNKYSAV YNEAPYKREE YVLDKMLENG YINETQYKEA RDFDVASTVE PAERSNTEIA
SYFNSLLERQ VVNKLMGVLN ISENQAWDKL YYGGLKITTT LDKGLQEKLE DIYANFSEHL
IGNSEGLGYA PLLDLSYDNW GNIVNSSGAL LYYKRANLLD ENNDLHLSSD EAWNDEAGDL
ILATNKAYLD QTKLIFKDFY SLDETNSNLR THRTGRIEFE SNEDIYQDED NNIVISKKYL
DNNPNLFTAY DDGSISLNKD YYDIDLNGVI QPQSSSVIID QKTGHIKALM GGRDQEGINI
LDRASSVPRQ PGSSIKPIAT YTAALDHGFN LATGVDDVPF EMNENGEAWP VNVYGYYMGY
TPIRDAIKMS INTIAVSTLN KVGIDTSLEY LKNFGLIKEN GRDYFVTKDE NPDTNDENLA
ALGVGAMSHG LTTLDMTAAY AALANKGEYT EPLTFSKITD SQGEVIFDED NLIKHTVTSP
ETAYQVTSAL ESAGEYYGNI HLNGTDYATK TGTTDDKTDF WCVGYTPYYT VGVWMGADNQ
NIHLNSNSVD RAALMWNVIN TEILADTEPV SFEEPEGIRH MEVDTISGKL PTDASRADPR
GTVKEEIFGP ENYPKEEDDM HKWAYIDSRN NLLASDVTPK FLIQTRSLIV EKNEYDPNKF
NGIIPRDWDY RMPTVYSNLI YTPPKVENNK DKDKDKDKDK DKDKDKNKNG EKENESNSGT
NNSTSDALDN LLNKESTENN SGFGY