Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0072 |
Symbol | |
ID | 8396823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 90619 |
End bp | 92838 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 644994412 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_003151847 |
Protein GI | 257065591 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.723397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAGT ATAATGAAAA ATCTAGAACA TTTTATTTAG GGAATGAGTA TGTGAGTTAT ATCTTTAAAA TATTGGAAAA CGAACAGCTA GGACAGTTAT ACTACGGTAA AGCTATAAAA GATTCTAAAA ACTTTGATCA CTTATTTGAA AGTGAGCCAA GACCTATGAC TGTTTGTACA TTTGACGGAG ATATGAAGTT TTCTCTTGAA TATATAAAGC AAGAATATCC ATCATACGGT ACAGGAGATA TGCGTCATCC AGCCATAGAT ATATTACAAG AGAATGGAAG TCGAATCATA GACTTCAAAT ACCAAAGTCA TGAAATAATA AAAGGAAAGC CAGAATTAAA AGATCTACCA GCTACGTATG TTGAACATGA TGATGAAGCA GAAACACTAT CAGTAAGCTT ATACGATGAT TTAATAGATG CTAAGTTAAT ACTAACTTAT ACAATTTTTA AAGATAGACC TGTGATAACA AGAAATGCTT ATATTGAAAA TTGTGGCGAT ACTGAGTTTA GACTTAATAG AGCTATGAGC CTATCTTTAG ACTTGCCAGA TAAAGATTAT GATATGATTG AATTAACAGG AGCTTGGTCA AGGGAAAGAC ATATAAAGTC TAGAAAACTA GAGCATGGAA TTCAATCAAT ATACTCGCTT AGAGGAATAT CTAGTGCTAA TTTTAATCCT TTTATAGCAT TAAAAAGATA CGATTGTAAC GAGAATAGCG GAGAAGTACT AGGATTTAGC TTTGTATATA GTGGTAACTT TTTGGCTCAA GTTGAAGTTG ACACATACGA TATATCAAGA GTGAGCATGG GTATACATCC ACATAATTTT TCATGGAAGT TAAGAAAGGG AGAATCTTTC CAAACTCCTG AAGTGGTAAT GGTATATAGC GATAAAGGTC TTAATGGTAT GAGCCAAACA TTCCATAAAT TATATCAATC AAGACTTGCG AGAGGAAAAT TCCGTGATGA AGCAAGACCA ATCCTCGTAA ACAATTGGGA AGGAACTTAT TTTGATTTTG ATGAAGAAAA AATACTTAGC ATGGCAAAAC AATCTAAGGA ATTAGGAGTT GAGTTATTTG TATTAGATGA TGGGTGGTTT GGAGTTAGAA ATGATGATAC ATCTGGATTA GGAGATTGGT ATCCAAATCT AGATAAACTT CCAAATGGGA TATCAGGGTT ATCTAAAAAA GTTACAGAAA TGGGAATAAA ATTTGGATTA TGGATAGAGC CAGAAATGGT TAATAAAGAT TCAGAATTAT ACAGAAAACA TCCTGAATGG ACTTTAGAAA CACCTAATAG AAAATCAAGC CATGGTAGAC ATCAACATGT TTTAGATTTT TCTAATCCAG ATGTTATAGA TTATATATAT AAAATGATAT CTAAGGTAAT TAGAGAATCA GATATCTCTT ATATCAAATG GGACATGAAT AGATCTCTTA GTGAAGTTTA TTCTAATGTA CATGATAGCG AAAGTCAAGG TAAAGTAATG CACAAGTATG TTTTAGGAGT GTATAGATTG TATGAAATGC TCATAAATGA ATTCCCAGAC ATACTATTTG AATCATGTTC GAGTGGAGGA TCAAGATTTG ATCCAGGAAT GTTATATTAT GCTCCACAAT GTTGGACAAG TGATGATACT GATGCTATAG AAAGACTTAA AATTCAGTAC GGAACATCCC TAGTTTATCC ATTATCATCA ATAGGCGCTC ACGTATCCGC TATACCAAAT GCCCAAGTTT TCAGAAATGT ACCTATAGAA ACAAGGGCTA ATGTTGCTTG CTTTGGAACT TTCGGATATG AACTTGACGT AAACAAGTTG AGCGAAGAAG ATAAAAAAGT AATAGTTGAG CAAATAAAAT TTATGAAAGA TAATAGGAAG CTTTTACAGT TTGGAACTTT CTATAGACTA AAGAGCCCGT TTGAAGGTAA TGAGACTGTA TGGATGGTTG TATCCGAAGA CAAAGATAAG GCTATCGTAG GTTATTACAA AACACTACAA AAGGTAAATT GCCCATATAA TAGGGTAAAA CTTCAAGGAT TAGATCCAGA AAAGAAGTAT GAAGTATCAA TCAATGATTA TGAAGCGTAT GGCGATGAAT TAATGAATGT AGGAATGATA ACTACTGATA GATCGTCTGG AGAGCAAAAA GATATAAATA AGGCCGAAGG AGACTATTCT TCAAGACTTT ATATACTTAC AGCAAAATAA
|
Protein sequence | MIKYNEKSRT FYLGNEYVSY IFKILENEQL GQLYYGKAIK DSKNFDHLFE SEPRPMTVCT FDGDMKFSLE YIKQEYPSYG TGDMRHPAID ILQENGSRII DFKYQSHEII KGKPELKDLP ATYVEHDDEA ETLSVSLYDD LIDAKLILTY TIFKDRPVIT RNAYIENCGD TEFRLNRAMS LSLDLPDKDY DMIELTGAWS RERHIKSRKL EHGIQSIYSL RGISSANFNP FIALKRYDCN ENSGEVLGFS FVYSGNFLAQ VEVDTYDISR VSMGIHPHNF SWKLRKGESF QTPEVVMVYS DKGLNGMSQT FHKLYQSRLA RGKFRDEARP ILVNNWEGTY FDFDEEKILS MAKQSKELGV ELFVLDDGWF GVRNDDTSGL GDWYPNLDKL PNGISGLSKK VTEMGIKFGL WIEPEMVNKD SELYRKHPEW TLETPNRKSS HGRHQHVLDF SNPDVIDYIY KMISKVIRES DISYIKWDMN RSLSEVYSNV HDSESQGKVM HKYVLGVYRL YEMLINEFPD ILFESCSSGG SRFDPGMLYY APQCWTSDDT DAIERLKIQY GTSLVYPLSS IGAHVSAIPN AQVFRNVPIE TRANVACFGT FGYELDVNKL SEEDKKVIVE QIKFMKDNRK LLQFGTFYRL KSPFEGNETV WMVVSEDKDK AIVGYYKTLQ KVNCPYNRVK LQGLDPEKKY EVSINDYEAY GDELMNVGMI TTDRSSGEQK DINKAEGDYS SRLYILTAK
|
| |