Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1186 |
Symbol | |
ID | 8397975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1264544 |
End bp | 1266712 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644995532 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003152932 |
Protein GI | 257066676 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.41344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAAAA AGATTCTTTT ATTTACCTTA GGTCTCCTTT TGGGCCTTTT GATATTTATT AATTATGAAA ATTTGAATAT TTTATATATA CTTACAGGAT GTATTCTGAT AAGCCTTCTA TCAGTCTATA GGAAAAGCAA TTTGATCTAT ATGGCTATTG GGATCTTTCT GATCTTTTCC CTATCTTCTA TAAAGTTTAA AAGTCAGATG GGAAGTATTG AGGATAAGGG GAAGTTTAAT CTCACAGTCC TAGAAAAAAG GAAGGAAGAT TACGGCTTTC GATACTTCTT ACAGACAAAA GATGGCATAA GAGAGAGCAA AGTCCTTGCC TTTATGGACG AAGACTTAGC TATTGGAGAA TCCTTTGTAG GGTATGGGAA AGTTAAACTT CCTTCTACAA ACACCAACCC CAATCTATTT TCTTACAGGA AGTATCTGGC TAGCAAGGGG ATTTTTAAGG AAATTAAGAT AGAAAAGATC GATCAAAGAA GAAGGACTCG AAATTTTCCT CTGGATATGA GAAATTCATT TTATAATTAT ATCCATAAGA CTTTCGATAA TAATTTAAGT AAAAGATCAG CTGACTTTGT CGTTTCTGTT ATCCTTGGGG AAAATCTTAT AGAAAATGAT TCGATAAGGG ACCTAGGTCT AAGCCACATC CTTGCTGTAA GTGGACTTCA TATGGATATG CTATTTTTCT TTATCCTTAT TTTGTTTGGA AGATTTAACT ATAGGTACGC CTATGGCTTT GGACTTTTCC TTGCCCTTAT CTATGGATAC CTAATAGGCT TTCCCTTTTC AGTAATTAGG GTGATAGGCC TAAATGTGAT TTCTTTTTTG GCCTTTTTGT ATAATAAGCC CATGGATAAA ATAAAAGCTT TATTAATAAT AGGATCTGGG ATATTACTAA TTAATCCCTT CGCTGGTCTA AATGCTGGTT TTATTTTGAC CTTTGCGGCA AGCTTTGCAG TCTATTTAGT CTATCCAAAG ATTAAAAATC ACTTTAGCAA ATCCTACATA GGAGAAAGCC TCGCCTTTAC TAGCTCCATC CAGCTTGCTC TTTTTCCCTT TATGATTTAT TATTATGGGA GCTTTAACCT GGTAAGCATA CTGGCAAACT TTCTAATCCT GCCTGTCTTT AGCCTTTCTA TGTATATAAT ATTTATAATA ATATTTGCCT ATCCCCTATT AGGAAGTTTC CTTAAGCTAT TATTTATTGG ACTTAACTTT CTTGTTGAAA GCATATTAAA TATGACAGAA CTTTTAAATA AGATCAAATT CCTTGCCCTA GATTTCAGAA AGCCTCATAT CTTACTTGCA TTTTATGGAT TTATCCTTAT CATAATAATG TTAAATTTAG GAAGAAATAG GGCCAAGGCT CATGCAAACA TCATCCCCCT ATCGATTATG GTCTTGATAT TTTCACTAGG AAAAGAGAGG GGAGAGATTA GCTATCAGAT GATAGATATT GGCCAAGGAG ACGCCTTTTT ATTAAATGAT AAGGGATCTT ACTATATGAT AGATGTAGGA GGCCCCAAAT ACAAAAATTA TGATTCAGGA GAGAAAATCC TCCTTCCCTA CCTTAAGTCA CTTGGAATTA GGGAGATTGA AGGGATATTT ATATCCCATG AGGATAAGGA TCATATGGGA AATCTGGACT TAGTTTGGGA TAATTTCCAG GTTAAAAATG TCTACACTAA TAAGCTTAAT GAGGATTCTC TTAGAAAATA TAAGCCCCAT ATCCTAAAAA AGGGAGATAG GATTAAGCTT AAATCAGGCT ACATTACTGT TATAGATGAG GGCTACGATT CTAATGAAAA TGCAAATTCT ATGGGGCTGA TCCTTGATAT AAGGGGAGTT AAGATAATGA CCTTGGGAGA TTTGCCGAGT GAATTTGAAA AAAATATAAA AGATACAGCT CATATATTAA AGCTCTCCCA CCATGGATCA AAGACTTCTA CTAGGAGGGA CTTTGTGGAG CAAGTAAATC CTAAAATCGT CCTAATTTCT GCAGGAAGAA ATAATGTTTA CGGCCATCCC CACAGGGAAG TTTTGGATAA TGTATATGAC AGAAAGATTT ATAATAGCCA GACAGATGGG ATGGTCGAGA TGAGATTTAA TAAGGACTTT GAGATAGAGA GATTTCTTAA GGGAGGATAT TTTAGATGA
|
Protein sequence | MRKKILLFTL GLLLGLLIFI NYENLNILYI LTGCILISLL SVYRKSNLIY MAIGIFLIFS LSSIKFKSQM GSIEDKGKFN LTVLEKRKED YGFRYFLQTK DGIRESKVLA FMDEDLAIGE SFVGYGKVKL PSTNTNPNLF SYRKYLASKG IFKEIKIEKI DQRRRTRNFP LDMRNSFYNY IHKTFDNNLS KRSADFVVSV ILGENLIEND SIRDLGLSHI LAVSGLHMDM LFFFILILFG RFNYRYAYGF GLFLALIYGY LIGFPFSVIR VIGLNVISFL AFLYNKPMDK IKALLIIGSG ILLINPFAGL NAGFILTFAA SFAVYLVYPK IKNHFSKSYI GESLAFTSSI QLALFPFMIY YYGSFNLVSI LANFLILPVF SLSMYIIFII IFAYPLLGSF LKLLFIGLNF LVESILNMTE LLNKIKFLAL DFRKPHILLA FYGFILIIIM LNLGRNRAKA HANIIPLSIM VLIFSLGKER GEISYQMIDI GQGDAFLLND KGSYYMIDVG GPKYKNYDSG EKILLPYLKS LGIREIEGIF ISHEDKDHMG NLDLVWDNFQ VKNVYTNKLN EDSLRKYKPH ILKKGDRIKL KSGYITVIDE GYDSNENANS MGLILDIRGV KIMTLGDLPS EFEKNIKDTA HILKLSHHGS KTSTRRDFVE QVNPKIVLIS AGRNNVYGHP HREVLDNVYD RKIYNSQTDG MVEMRFNKDF EIERFLKGGY FR
|
| |