Gene Apre_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1648 
Symbol 
ID8398460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1790308 
End bp1792107 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content39% 
IMG OID644996012 
Productoligoendopeptidase F 
Protein accessionYP_003153390 
Protein GI257067134 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GAAGTGAAGT TGATGTAAAA GAAACTTGGA AGATCGAGGA TATGTATGCC 
GACGATGATG CCTTCTACGA AGAAATAGGA GAGCTAAAGG CTAAGGCAAG CGAATTTTCT
AAGGAATTTA AGGATATAGA AACAGTAGAA GAAGTATCAA AATCCTTGAG AGAGTATTCA
GAAATTATTG CTATAGTAGA TAAGCTAGGA ACCTTTGCGG GAATTTCTAT GGAAGTTGAT
ACTACAGATG ATCATATGGC CAAAAGATAT GCAAGGCTTG GCAATGAGCT TTCTAAGGTA
TTTTCTGACC TATCTTTCTA CGAGCCAGCT CTTTCAAGAC TCGATAAGAA TATCCTAAGC
CAAGTAGTTG AAAAGAATAA GGATTTTGCC TACTTTATAG AAAGAATAGA AGACAAACAA
GCCCACATGC TATCAGATGA GACAGAAGCT GTCCTTGCAA GTCTCGAGCC TACTTTCGAT
GCCCCATATA CTAATTACAA CGACATGCGT TATGGAGATA TGGACTTTGA AAACTTCGAA
TATGAGGGAG AAGAGATCGT CCTAAACCAC AACACATTCG AAGAATTCTT AGAAACGGAT
CCAAGGACAG ATTTAAGACG TGAATCCTTT AAGAGATACC ACGACGTTCT TAGAAAATAC
CAAAACGCAG ATGCTTCAGT TTATAATACC CAAGTGACAA ATGAGAAAAG ACTATCTGAC
ATTCGTGGCT ATGAATCAGT TTTCCATTAT CTCTTGGCCC GTCAAGATGT CGATTTCGAT
ATTTACGAAA ATCAATTAGA CACAATCATG GAAAAACTTG CTCCTCATAT GAGAAAGTAC
GCAGAAATCA TCAAGAAACA CTACGGTCTA GATGAGATGA CCTACGCAGA CCTAAAGCTT
GCAATAGACC CGGATTATGA ACTTGAAGTT GATATAGACA AGGCTCGTGA ATACATCCTA
GACGGACTTT CTCCCCTAGG AGAGGAATAT GTATCATATC TTGATAAGGC CTTTTCTGAT
AGGTGGATTG ACTATGCCCA AAACACAGGC AAGAGAACAG GAGCCTTTTG TGCAAGTCCT
TATAAGTCCC ATCCATTTAT AATGACGACC TACAACAATT CCATGAGCCA AGTTATGACC
CTAGCCCACG AGTTAGGCCA TGCCTGCCAG GGGATTTACT CAAACGATAA TCAAGAGGCT
CTGCTTTCTG GTATGAGCAT GTATTTTGTA GAATCTCCAT CAACAGCCAA CGAGATAACT
ATGGAAAGAT ATCTCCTAAA CAAGGCAGAG GACGATAGGG AGAAATTGTG GGTACTTTCT
ACAATGATTG GAAAGACTTA TTATCACAAT TTCGTTACCC ACTTCCTAGA AGCAAGTTTC
CAAAGAGATG TTTATCGTGC AGTAGAAAAA GGAGAGAGCC TTTCATCTTC TGACTTTAAT
AGAATCTTTA AGGAAAATCT AGAGAAATTC TGGGGAGATT CTGTAAAACT TGATGAGGGA
AGCGAGCTAA CTTGGATGAG GCAACCTCAC TATTATATGG GACTTTACCC TTACACCTAC
TCAGCAGGCC TTACTATCGG AACAATAATT TCAGATAAGA TTGTAAATGG TACAGACGAA
GATAGGAAAA GATGGATAGA TGTCTTAAAG ATGGGTGGAT CAAAAGGACC AATCGACCTT
GCCAAGGAAG CTGGAGTAGA TATGACTACT ACAAGGCCTC TTGAGGAAGC CATAGAATTT
ATAGGGTCAA TAATCGATCA AATCGACGAG CTTCTAGCAA AGCTTGACAT GTATAAGTAG
 
Protein sequence
MKKRSEVDVK ETWKIEDMYA DDDAFYEEIG ELKAKASEFS KEFKDIETVE EVSKSLREYS 
EIIAIVDKLG TFAGISMEVD TTDDHMAKRY ARLGNELSKV FSDLSFYEPA LSRLDKNILS
QVVEKNKDFA YFIERIEDKQ AHMLSDETEA VLASLEPTFD APYTNYNDMR YGDMDFENFE
YEGEEIVLNH NTFEEFLETD PRTDLRRESF KRYHDVLRKY QNADASVYNT QVTNEKRLSD
IRGYESVFHY LLARQDVDFD IYENQLDTIM EKLAPHMRKY AEIIKKHYGL DEMTYADLKL
AIDPDYELEV DIDKAREYIL DGLSPLGEEY VSYLDKAFSD RWIDYAQNTG KRTGAFCASP
YKSHPFIMTT YNNSMSQVMT LAHELGHACQ GIYSNDNQEA LLSGMSMYFV ESPSTANEIT
MERYLLNKAE DDREKLWVLS TMIGKTYYHN FVTHFLEASF QRDVYRAVEK GESLSSSDFN
RIFKENLEKF WGDSVKLDEG SELTWMRQPH YYMGLYPYTY SAGLTIGTII SDKIVNGTDE
DRKRWIDVLK MGGSKGPIDL AKEAGVDMTT TRPLEEAIEF IGSIIDQIDE LLAKLDMYK