Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1682 |
Symbol | |
ID | 8398494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1829949 |
End bp | 1831280 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644996045 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003153423 |
Protein GI | 257067167 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000471578 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACTG GACCCTACCA TAGCTTGATC ACAACGCTTA GTATAATATT GTTGCTTTTT ATAAATGGAG TATTTACAGC AATTCATACA GGACTTATTA GCCTTAACAC ATCAAAGTTA GAAGAGGATT CTCTAAATGG AGATGAGAAG GCAGGATCTG TCCTTAAGAT TTTATCAAAT CAGGATAGGC TCAATCAATC CTTTGAAATA GCAAATATAA TATTTTCTCT TTTAACCATA GCTTACTTCG CAAAAAGGCT TAGAGAAGTG AGCTATGATG GGTATTTTTG GGGAGGAATC TTATCAGAAA GATGGGTGAC TATAGTAGCT ATAGTGGTGT ATACCATTCT TAAGCTGATA TTTGTAGACA AAATTCCCCA AAGAATTGGT GTAAGATTTC CAATGGAGAT TACCAAAATG GCTACGGGAA TCACTAAGCT TATCATGGTT CTTACAAGAC CTATTGTGGC CTTTACTACA GCTGTGACTA ATTTATTTAT GAATATATTT GGCATAGAGG CTAAAAATAT ACAAAAGCAA GTTACAGGCG AGCAGATTAA ATCAATTGTC CAGATAGGTG AAGATCAGGG AATCCTAAGA CCCATGGAGT CTAAGATGAT TCATTCCATA ATGGCTTTCG ACGACCTACT TGCAGAAGAG ATTATGACAG CAAGGACTGA TGTCTTTATG ATCGATATAA ATGATAAGGA CAAGGAGTAT CTCGAAGAGT TTGTCAAGAT AAAGCATGCG AGAATTCCAG TCTATGACGG GGAAGTAGAT AATATCTTGG GCATTGTTTA TACCAAAGAC TTCCTCCTTG AGGCGACAAA AGTAGGGCTT AAAAATGTAG AGATCAAGGA AATCATAAGG CCTGCCTACT TTGCACCAGA TAAGATAGAA ACCGACAAGC TTTTTTCCGA TATGCAGAAA AAACATATCC ATATGGCGAT TTTGATAGAC GAATACGGTG GATTTTCTGG GGTTGTCACA ATGGAAGATT TGATAGAAGA AATTGTAGGG GATATAGATG ATTCCTACGA CTATGATATT CCAGAAATCA AGGAAAATGG CAGAGATGTC TTCGTAGTCA AGGCTTCTGT AGGTATCAAG GACCTAAATG AGAAGATAAA TATAGGTATC GATGAAGATA ACGAAAACTA CGATTCCTTG GGAGGTTTTA TTATAGACAG GCTTGGCTAT ATACCGGAAG AAGACTCCAA GCTGTCCTTT GATTACAATG GTTACGAGAT CAAAATCCTC TATATAGAAG ATAATAGAAT CAAGGCTGTT AGGATTAGAA AATTAAAAAA TAAAGAAGAT AAAGAAGACT AG
|
Protein sequence | METGPYHSLI TTLSIILLLF INGVFTAIHT GLISLNTSKL EEDSLNGDEK AGSVLKILSN QDRLNQSFEI ANIIFSLLTI AYFAKRLREV SYDGYFWGGI LSERWVTIVA IVVYTILKLI FVDKIPQRIG VRFPMEITKM ATGITKLIMV LTRPIVAFTT AVTNLFMNIF GIEAKNIQKQ VTGEQIKSIV QIGEDQGILR PMESKMIHSI MAFDDLLAEE IMTARTDVFM IDINDKDKEY LEEFVKIKHA RIPVYDGEVD NILGIVYTKD FLLEATKVGL KNVEIKEIIR PAYFAPDKIE TDKLFSDMQK KHIHMAILID EYGGFSGVVT MEDLIEEIVG DIDDSYDYDI PEIKENGRDV FVVKASVGIK DLNEKINIGI DEDNENYDSL GGFIIDRLGY IPEEDSKLSF DYNGYEIKIL YIEDNRIKAV RIRKLKNKED KED
|
| |