Gene Apre_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0414 
Symbol 
ID8397188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp470198 
End bp472579 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content40% 
IMG OID644994772 
Productglycogen/starch/alpha-glucan phosphorylase 
Protein accessionYP_003152184 
Protein GI257065928 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAC TGAAAAAGAA TAAATTTATT GAAAATTACG TAGAGAACTT ACAAAGAATT 
ACCCTTAAAT CCTTCGATGA GACAAGCGAT AAGGATAGAT ACAATGCTCT ATGCGATTCA
ATCATGGAAT TAATCAACGA AGAGTGGAGA GCTTGTAAGA GAAATACAAG AAACGAGAGA
AAGGCTTATT ACCTATCTGC AGAATTTCTA ATAGGAAGAT CTTTGGGAAA TAACCTCATT
AACCTAGGTA TCTATGACGA GGTCAAAGAG CTTCTAGATG AGATTGGAAT CGACTTTGAA
GCCATAGAAA ACTACGAGGA TGACGCAGCA CTTGGTAATG GAGGTCTCGG AAGACTTGCA
GCCTGCTTTA TGGACTCTGC CGCAACCCAA GGGATCGATC TTGTAGGCTA CGGTGTAAGA
TATAGAGAAG GAATCTTTAA ACAAAAAATC GAAGAAGGCT TCCAAGTAGA AAGCGGAGAC
AGCTGGATCA AGGACGGAGA TGGCTGGTCA ATCAGAGTAG ACTCCGATGC TAAAATCGTA
AAATTTAGAG ACCAACAAGT AAAGGCAGTT CCATTCGACA TGCCTGTAGT AGGTTTCGAA
AACGGCAGGG TAAACACCCT AAGACTATGG CAATCTGAGC CATTTGAAGA GTTTGACTTC
GCTAAGTTTA ATAACTACGA ATACGACGAT GCAGTAGCAG AAAAAAACCG TGCGGAAGAT
ATCACAAGAG TACTCTATCC AAACGACATG CAAAGAGCTG GTAAGGTCCT AAGACTTAAA
CAACAATATT TCTTCTGCTC AGCCTCTATC CAAGATATGA TAGAAAAATA CAAGAGAGAC
TTCCCAGAAG ACCTACAGTT TAAGAACTTC TCCAAATACC ACGTTATCCA ACTTAACGAT
ACCCACCCAA TTATGGCTAT TCCTGAACTA ATCAGAGTCT TGGTAGATGA AAATGGAATC
TTCTTTGAAG ACGCCCTAAA GATTGCTAGA AAGGTATTCG CCTTCACCAA CCACACTGTC
CTTCAAGAAG CCCTAGAAAG ATGGGATAAG GATATAGTTC TTGAAGTAAG TCCAAGATGT
CTTGAAATAA TCGAAAAGAT TAACGAAGAA CTAGTAAAGG AATTTAAGGC AAAAGGCTAT
TCAGAAGAGC AAATCGACCC ATACAGAATC GAAAGATTTG AGCAAATCCA CATGGCAAAT
CTTGCAATCT ATGTAGGATT TTCTGTAAAT GGTGTAGCAG CCCTCCACAC AGAAATTCTA
AAGGCTGATA CTTTCAAGCA CTGGTATAAA CTAAGACCTG AAATGTTTAA CAACAAGACC
AACGGTATCA CCCCAAGAAG ATGGCTAGTA TACTCTAACA GAGAGCTATC AAGCTTCATT
ACAGAAAAGC TTGGAACAGA CGAGTGGAAA TACCAACTAG ATCTACTCAA GGGACTAGAA
AAATACAAGG ATGACGAAAA GGTCCTAGAA GAACTTTGGG ATATTAAACA AACAAAGAAA
AACGAGCTTG CCAAATATAT TTTGGACACA GAGGGAGTAA AAATTGACCC AGAATCAATC
TTTGATATCC AAATTAAGAG AATCCACGAA TACAAGAGAC AACACCTTAA CGTCTTACAC
ATTATCTATC TCTACCACAA GCTTAAGAAA AATCCTGACA TGGAATTTAC TCCAACAACC
TTTATCTTCG GAGGAAAAGC AGCTCCAGGA TACTTTAGAG CCAAGGGAAT GATCAAACTT
GCTAACGAAG TTGCAAGAGT AGTAAACGCA GATCCTGACG TAAATGACAA GATTAAGGTA
GTATTTGTAG AAAACTACAG GGTAAGCTAC GCAGAAAAAC TCTTCCCTGC AGCAGACATC
TCAGAACAAA TCTCAACAGC AGGTAAGGAA GCAAGTGGTA CAGGAAACAT GAAATTCATG
CTAAACGGAG CCCTCACACT TGGAACACTC GATGGAGCTA ATATCGAAAT CTTCGAACAC
GCTGGAGAAG AAAACAACTT CAGATTCGGT GCTACAGTTG AAGAGCTCAA CGAGATAATG
GATAGCTACA ACCCAGTAGA ATACTACTCA AAAGACCCAG ATATCAAGGA TGTAGTAGAC
AGCCTCGTAA GCGGAGAATT TAAAGATAAC GAATCCTACA TGTTCCTAGA TATCTACAAC
GAACTAATCA AGCCACAAGA AGGCCAAAGA GGAGACAACT ACTTCCTACT AAAAGACTTC
AAGTCTTACG CTAAGGCTCA CGAAAGAGTA AACGAAGCCT ACAAGAATAA GCTAGACTGG
TCTAGAAAAT GCTTGATTAA CATTGCAAAT GCTGGATTCT TCTCATCAGA TAGAACAATC
CTAGATTATG CGGCTGACAT CTGGAAAATA GATCAAGAGT AG
 
Protein sequence
MDKLKKNKFI ENYVENLQRI TLKSFDETSD KDRYNALCDS IMELINEEWR ACKRNTRNER 
KAYYLSAEFL IGRSLGNNLI NLGIYDEVKE LLDEIGIDFE AIENYEDDAA LGNGGLGRLA
ACFMDSAATQ GIDLVGYGVR YREGIFKQKI EEGFQVESGD SWIKDGDGWS IRVDSDAKIV
KFRDQQVKAV PFDMPVVGFE NGRVNTLRLW QSEPFEEFDF AKFNNYEYDD AVAEKNRAED
ITRVLYPNDM QRAGKVLRLK QQYFFCSASI QDMIEKYKRD FPEDLQFKNF SKYHVIQLND
THPIMAIPEL IRVLVDENGI FFEDALKIAR KVFAFTNHTV LQEALERWDK DIVLEVSPRC
LEIIEKINEE LVKEFKAKGY SEEQIDPYRI ERFEQIHMAN LAIYVGFSVN GVAALHTEIL
KADTFKHWYK LRPEMFNNKT NGITPRRWLV YSNRELSSFI TEKLGTDEWK YQLDLLKGLE
KYKDDEKVLE ELWDIKQTKK NELAKYILDT EGVKIDPESI FDIQIKRIHE YKRQHLNVLH
IIYLYHKLKK NPDMEFTPTT FIFGGKAAPG YFRAKGMIKL ANEVARVVNA DPDVNDKIKV
VFVENYRVSY AEKLFPAADI SEQISTAGKE ASGTGNMKFM LNGALTLGTL DGANIEIFEH
AGEENNFRFG ATVEELNEIM DSYNPVEYYS KDPDIKDVVD SLVSGEFKDN ESYMFLDIYN
ELIKPQEGQR GDNYFLLKDF KSYAKAHERV NEAYKNKLDW SRKCLINIAN AGFFSSDRTI
LDYAADIWKI DQE