Gene Apre_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1186 
Symbol 
ID8397975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1264544 
End bp1266712 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content35% 
IMG OID644995532 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_003152932 
Protein GI257066676 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.41344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAA AGATTCTTTT ATTTACCTTA GGTCTCCTTT TGGGCCTTTT GATATTTATT 
AATTATGAAA ATTTGAATAT TTTATATATA CTTACAGGAT GTATTCTGAT AAGCCTTCTA
TCAGTCTATA GGAAAAGCAA TTTGATCTAT ATGGCTATTG GGATCTTTCT GATCTTTTCC
CTATCTTCTA TAAAGTTTAA AAGTCAGATG GGAAGTATTG AGGATAAGGG GAAGTTTAAT
CTCACAGTCC TAGAAAAAAG GAAGGAAGAT TACGGCTTTC GATACTTCTT ACAGACAAAA
GATGGCATAA GAGAGAGCAA AGTCCTTGCC TTTATGGACG AAGACTTAGC TATTGGAGAA
TCCTTTGTAG GGTATGGGAA AGTTAAACTT CCTTCTACAA ACACCAACCC CAATCTATTT
TCTTACAGGA AGTATCTGGC TAGCAAGGGG ATTTTTAAGG AAATTAAGAT AGAAAAGATC
GATCAAAGAA GAAGGACTCG AAATTTTCCT CTGGATATGA GAAATTCATT TTATAATTAT
ATCCATAAGA CTTTCGATAA TAATTTAAGT AAAAGATCAG CTGACTTTGT CGTTTCTGTT
ATCCTTGGGG AAAATCTTAT AGAAAATGAT TCGATAAGGG ACCTAGGTCT AAGCCACATC
CTTGCTGTAA GTGGACTTCA TATGGATATG CTATTTTTCT TTATCCTTAT TTTGTTTGGA
AGATTTAACT ATAGGTACGC CTATGGCTTT GGACTTTTCC TTGCCCTTAT CTATGGATAC
CTAATAGGCT TTCCCTTTTC AGTAATTAGG GTGATAGGCC TAAATGTGAT TTCTTTTTTG
GCCTTTTTGT ATAATAAGCC CATGGATAAA ATAAAAGCTT TATTAATAAT AGGATCTGGG
ATATTACTAA TTAATCCCTT CGCTGGTCTA AATGCTGGTT TTATTTTGAC CTTTGCGGCA
AGCTTTGCAG TCTATTTAGT CTATCCAAAG ATTAAAAATC ACTTTAGCAA ATCCTACATA
GGAGAAAGCC TCGCCTTTAC TAGCTCCATC CAGCTTGCTC TTTTTCCCTT TATGATTTAT
TATTATGGGA GCTTTAACCT GGTAAGCATA CTGGCAAACT TTCTAATCCT GCCTGTCTTT
AGCCTTTCTA TGTATATAAT ATTTATAATA ATATTTGCCT ATCCCCTATT AGGAAGTTTC
CTTAAGCTAT TATTTATTGG ACTTAACTTT CTTGTTGAAA GCATATTAAA TATGACAGAA
CTTTTAAATA AGATCAAATT CCTTGCCCTA GATTTCAGAA AGCCTCATAT CTTACTTGCA
TTTTATGGAT TTATCCTTAT CATAATAATG TTAAATTTAG GAAGAAATAG GGCCAAGGCT
CATGCAAACA TCATCCCCCT ATCGATTATG GTCTTGATAT TTTCACTAGG AAAAGAGAGG
GGAGAGATTA GCTATCAGAT GATAGATATT GGCCAAGGAG ACGCCTTTTT ATTAAATGAT
AAGGGATCTT ACTATATGAT AGATGTAGGA GGCCCCAAAT ACAAAAATTA TGATTCAGGA
GAGAAAATCC TCCTTCCCTA CCTTAAGTCA CTTGGAATTA GGGAGATTGA AGGGATATTT
ATATCCCATG AGGATAAGGA TCATATGGGA AATCTGGACT TAGTTTGGGA TAATTTCCAG
GTTAAAAATG TCTACACTAA TAAGCTTAAT GAGGATTCTC TTAGAAAATA TAAGCCCCAT
ATCCTAAAAA AGGGAGATAG GATTAAGCTT AAATCAGGCT ACATTACTGT TATAGATGAG
GGCTACGATT CTAATGAAAA TGCAAATTCT ATGGGGCTGA TCCTTGATAT AAGGGGAGTT
AAGATAATGA CCTTGGGAGA TTTGCCGAGT GAATTTGAAA AAAATATAAA AGATACAGCT
CATATATTAA AGCTCTCCCA CCATGGATCA AAGACTTCTA CTAGGAGGGA CTTTGTGGAG
CAAGTAAATC CTAAAATCGT CCTAATTTCT GCAGGAAGAA ATAATGTTTA CGGCCATCCC
CACAGGGAAG TTTTGGATAA TGTATATGAC AGAAAGATTT ATAATAGCCA GACAGATGGG
ATGGTCGAGA TGAGATTTAA TAAGGACTTT GAGATAGAGA GATTTCTTAA GGGAGGATAT
TTTAGATGA
 
Protein sequence
MRKKILLFTL GLLLGLLIFI NYENLNILYI LTGCILISLL SVYRKSNLIY MAIGIFLIFS 
LSSIKFKSQM GSIEDKGKFN LTVLEKRKED YGFRYFLQTK DGIRESKVLA FMDEDLAIGE
SFVGYGKVKL PSTNTNPNLF SYRKYLASKG IFKEIKIEKI DQRRRTRNFP LDMRNSFYNY
IHKTFDNNLS KRSADFVVSV ILGENLIEND SIRDLGLSHI LAVSGLHMDM LFFFILILFG
RFNYRYAYGF GLFLALIYGY LIGFPFSVIR VIGLNVISFL AFLYNKPMDK IKALLIIGSG
ILLINPFAGL NAGFILTFAA SFAVYLVYPK IKNHFSKSYI GESLAFTSSI QLALFPFMIY
YYGSFNLVSI LANFLILPVF SLSMYIIFII IFAYPLLGSF LKLLFIGLNF LVESILNMTE
LLNKIKFLAL DFRKPHILLA FYGFILIIIM LNLGRNRAKA HANIIPLSIM VLIFSLGKER
GEISYQMIDI GQGDAFLLND KGSYYMIDVG GPKYKNYDSG EKILLPYLKS LGIREIEGIF
ISHEDKDHMG NLDLVWDNFQ VKNVYTNKLN EDSLRKYKPH ILKKGDRIKL KSGYITVIDE
GYDSNENANS MGLILDIRGV KIMTLGDLPS EFEKNIKDTA HILKLSHHGS KTSTRRDFVE
QVNPKIVLIS AGRNNVYGHP HREVLDNVYD RKIYNSQTDG MVEMRFNKDF EIERFLKGGY
FR