Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0454 |
Symbol | |
ID | 8397229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 517859 |
End bp | 519187 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644994811 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003152222 |
Protein GI | 257065966 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000262925 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAAA AATATTCAAT CGTTGTAGCT GGTGGTGGTT CAACATTTAC ACCAGGAATT ATCGGAATGC TTCTAGATAA CCTTGATAGA TTCCCAATTA GATCTATCAA ACTTTACGAT AACGACGCAG ATAGACAAGG CGTTATCGGT AAGGCACTAG AAATCTTACT AAAAGAAAGA CACCCAGAAA TCAACTTTGT CTACACTACA GATCCTGAAG AAGGATTTAC AGATGTTGAC TTCGTTCTAG CTCAATTAAG AGTTGGTAAG TATGAAATGC GTGATAGGGA CGAGAAAGTT CCATTAAAAC ACGGATGTAT CGGACAAGAA ACATGTGGAC CAGGAGGACT ATCTTATGGT ATGCGTTCAA TCGGTGGAGT ACTTGAAATC CTTGACTACA TGGAAAAATA CTCACCAGAT GCATGGATGT TAAACTACTC AAACCCTGCA GCAATCGTAG CAGAAGCTAC TAGAAGACTA AGACCAGATT CTAAGATTAT AAACATCTGT GACATGCCAA TTGACTTAAT GTACAAGATG GCAGACATGG TTGGCTTAAA AGAATGGCAA GAGCTTGACT TCTCTTATTA CGGTCTAAAC CACTTCGGTT GGTTTACAGC AATCTCTGAT AAGGAAGGAA ATGACCTAAT GCCACAAATC AAAGAGCACG TTTCAGTTAA CGGCTTTGCT GATGGTATAG GAACAGCCCA ACACCTTGAT CCATCATGGG TTGAGACATT CACAAAGGCA AAAGACGTTT ATGCCCTAGA TCCAGCAACA ATTCCAAACA CTTATCTAAA ATACTACTTC TACCCAGACT TCGTAATGGA GCATACTGAT CCAAACCACA CAAGAGTTGA CGAAGTTCGT GAAGGAAGAG AAAAAGATGT ATTTGGTTTC TGCCAAAATA TCATTGATAA GGGAACAGCA GAAGGAGTTG AAATCGAGCT TGATGCACAC GCAACCTACA TCGTAGACCT TGCTATTGCA CTTGCAGAAA ACACAAAGGA AAGATTCCTT CTTATAGTTG AAAACAACGG AGCTATTCCA AACTTTGACC CAACAGCAAT GGTAGAAATC CCATGTCTAG TTGGTAAAAA CGGAATCGAA AGAATCAACC AAGGAGCAAT TCCACAATTC CAAAAAGGCC TAATGGAACA ACAAGTATCT GTAGAAAAAC TTGTAGTTGA AGCTTGGATT GAAGGAAGCT ACCTAAAGAT GTGGCAAGCT CTAACACTTT CTGCAACAGT ACCAAGTGCG AAAGTTGCCA AAGAACTTCT TGACGATCTT ATAGAAGCAA ACAAGGACTT CTGGCCAGAA CTTAACTAA
|
Protein sequence | MTQKYSIVVA GGGSTFTPGI IGMLLDNLDR FPIRSIKLYD NDADRQGVIG KALEILLKER HPEINFVYTT DPEEGFTDVD FVLAQLRVGK YEMRDRDEKV PLKHGCIGQE TCGPGGLSYG MRSIGGVLEI LDYMEKYSPD AWMLNYSNPA AIVAEATRRL RPDSKIINIC DMPIDLMYKM ADMVGLKEWQ ELDFSYYGLN HFGWFTAISD KEGNDLMPQI KEHVSVNGFA DGIGTAQHLD PSWVETFTKA KDVYALDPAT IPNTYLKYYF YPDFVMEHTD PNHTRVDEVR EGREKDVFGF CQNIIDKGTA EGVEIELDAH ATYIVDLAIA LAENTKERFL LIVENNGAIP NFDPTAMVEI PCLVGKNGIE RINQGAIPQF QKGLMEQQVS VEKLVVEAWI EGSYLKMWQA LTLSATVPSA KVAKELLDDL IEANKDFWPE LN
|
| |