Gene Apre_1132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1132 
Symbol 
ID8397919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1215503 
End bp1217173 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content40% 
IMG OID644995478 
ProductNLP/P60 protein 
Protein accessionYP_003152879 
Protein GI257066623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000155845 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AAAATATAGG AGCTATGCTA TCAGTTATAG CTGCAGTTAC AGCAGGTGCG 
AGTGCATATG CTACTACAAT TGACAATTTA GACCAAGATC AACATACAAA TTCGCTAATT
ACAGAAGCGA ACTATACAGA AAGCTATTCT GATTATGATC TAGTGAAATC TGAATATACT
GAAGATGTAA AAAACACAAA AATTGCTAAA GAAGACAAAC CAGCCAAAGA AGAAAAAACA
GATAGTAAAT CTGATTTAGA TAAATTAGTA GAAAATGCTG CAAGAGATGC CCTAGTTGAA
ATAACAGCAA CTAAGGAAGT TGAAGCAAGC GAAGAAGCTC CAGCTGAAAA AGAAAAAGTT
GAAGCTAAAG AAGAAGCTAG CGAAATAGCG GACAAAGAAG AAACTCACGA AGAAAAAGTT
GAAGCTGCTA AGGGAAAAGA CAAAGACCTT ACCCTTGTAA AATACGTTAA TACAGAAATC
CTTAACGTAA GAAGTAGCAA GGATATGGAT GAGAATAATA TCGTATCTTC CCTCAAGGCA
GGAGATGAAA TAGAAGGAGT CCTAGAAGAA GGATTCCTAA AGACTGAATT AGGATATGTA
AATGATGAAT TTCTTTCAGA TGTTTATCCT GTAGATTTAG TAAATGAATT AAATAATAAA
GGAGAAGAAA AAGCCCAAGA AGTTGAAAAG CAAGAAGAAG CGAAAAAGGC TGAAGATGCA
GAAAAAGCTC AAGAGGCTGA AGAAGCTAAG AAAGCTCAAG AAGCAAAGAA AGCTGAAGAA
GCTAAGAAAG CTCAAGAAGC AAAGAAAGCT GAAGAAGCTA AGAAAGCTCA AGAAGCAAAA
GAGGCTGAGG AAGCTAAGAA GGCTCAAGAA GCAAAAGAAG CCGAAGAAGC AAGAAAAGCC
GAAGAAGCTA AGAAGGCTGA AGAAGCAAAA AAAGCTGAAG AGGCTAAGAA GGCCGAAGAA
GCAAAAAGAG CTGAAGAAGA AAGACAAGCC CAAGAGGCTC AATCATACTA CTATACAGGA
TGGGTTAACA CATCAGTCCT CAATGTTAGA AGTAAGGCAG GAGACGGCAG TATCATCGGA
TCTGTTAGAA AGGGTGACTG GCTAGAAGGC GAGGCTAGTA ATGGTTGGCT AGCAATTGAC
TATAATGGTC AAAAGGGATA TGTAGCAGCA GACTTCCTAT CTGACACAGA AGTAGCTAAG
GAAGAAGTGA AAGAAGAAGC AGCTGAGGCT AACGAACAAG TCCAAGAAGT TGAAGAAGTT
CAAGAAGTAG AACAAGCTTC AGCACCAGCC TATAATGGTT CTGGACTAGC AGCAGCAGAT
CTTGCAACAC AATTCGTAGG AAGCCCATAC GTTTGGGGTT CTGCTAACCC AGGAGTAGGC
TTTGACTGTT CAGGTCTTAC ATCTTATGTA TATGGCCAAA TGGGCATATC TATCCCACAC
CAATCAGCAG CCCAATACTC AAGCGGATAC GCTGTAGATT CATCTAACCT TCAAGCAGGA
GATCTTGTGT TCTTCTCTTA TGGTGGAGGT GGAATCGACC ACGTAGGAAT TGTAGTTAAT
TCTGACGGTA CCTTCGTTCA CGCATCTACA CCTGCAACAG GTGTTAGATA TGACAATGTA
TACAACGGTA GCTTCCAAAA CGCATTCGTT GGAGCTAGAA GGATATATTA G
 
Protein sequence
MKKKNIGAML SVIAAVTAGA SAYATTIDNL DQDQHTNSLI TEANYTESYS DYDLVKSEYT 
EDVKNTKIAK EDKPAKEEKT DSKSDLDKLV ENAARDALVE ITATKEVEAS EEAPAEKEKV
EAKEEASEIA DKEETHEEKV EAAKGKDKDL TLVKYVNTEI LNVRSSKDMD ENNIVSSLKA
GDEIEGVLEE GFLKTELGYV NDEFLSDVYP VDLVNELNNK GEEKAQEVEK QEEAKKAEDA
EKAQEAEEAK KAQEAKKAEE AKKAQEAKKA EEAKKAQEAK EAEEAKKAQE AKEAEEARKA
EEAKKAEEAK KAEEAKKAEE AKRAEEERQA QEAQSYYYTG WVNTSVLNVR SKAGDGSIIG
SVRKGDWLEG EASNGWLAID YNGQKGYVAA DFLSDTEVAK EEVKEEAAEA NEQVQEVEEV
QEVEQASAPA YNGSGLAAAD LATQFVGSPY VWGSANPGVG FDCSGLTSYV YGQMGISIPH
QSAAQYSSGY AVDSSNLQAG DLVFFSYGGG GIDHVGIVVN SDGTFVHAST PATGVRYDNV
YNGSFQNAFV GARRIY