Gene Apre_0243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0243 
Symbol 
ID8397017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp276172 
End bp277782 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content39% 
IMG OID644994604 
Productalpha amylase catalytic region 
Protein accessionYP_003152016 
Protein GI257065760 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC ATTGGTGGCA TAAGGCGACT ATTTATCAGA TTTATCCAAG GTCTTTTATG 
GACTCAAACA ATGATGGAAT AGGAGATCTT AGAGGAATTA TTTCTAAGCT CGATTATCTG
GAAAAGCTCG GGATCAATGC GATTTGGCTT TCTCCAGTTT ACCAGAGTCC AATGGATGAT
AATGGCTATG ATATATCTGA TTATAGGGCT ATTGCAGATA TTTTTGGCAA TATGGATGAT
ATGGAGGAGC TCCTAGATGA AGCTAAGAAG AGAGATATTA GAATCATCAT GGATCTTGTT
GTAAACCATA CCTCAGATGA GCATGCTTGG TTTATAGAGG CGAGAGATAA TCAAGCTAGC
CCTAAGCGTG ACTATTATAT CTGGAGGAAG GAGAAAAACG GCCTAGAATC TACCTTCTCT
GGCTCTGCTT GGGAGTATGA TGAGGATTCT GGCGAATATT ATCTCCACTT ATTCAGCAAG
AAGCAACCAG ACCTTAACTG GGAGAATGAA GACTTGCGTC ATGAAATTTA CGACATGATG
AACTTCTGGA TTGATAAGGG AATCGGGGGC TTCCGTATGG ATGTAATAGA CCTACTAGGC
AAAGTTCCTG ATAAAGAAAT CAAGGAAAAT GGACCAATGC TTCATACCTA CCTTAAAGAG
ATGAACAAAA ATACCTTTGG TAAGCATGAT TTATTGACAG TTGGTGAGAC TTGGGGAGCA
AGTCCTGAAA TTGCCAAGAA ATATTCAAAT CCAGATAACG AAGAGCTTTC CATGGTATTT
CAATTTGAGC ATATTGGCCT CCAACACAAG GAGGGTATGG CTAAATGGTT CTATGAAAAG
GACCTTGATG TAAGCAAGCT TAAGGAAATT TTCGCCAAAT GGCAAACTGA ACTAGAGCTT
GGCAAGGGTT GGAACTCGCT ATTTTGGGAA AACCACGACC TTCCTAGAGT CCTTTCACTC
TGGGCAGATG TCGACGAATA TAGGGAAAAA TCAGCCAAGG CTCTCGCCAT TCTCCTTCAT
CTTATGAGAG GAACTCCTTA TATCTATCAG GGAGAAGAAA TCGGCATGAC CAATTATCCT
TTCAAGGACT TAGCAGAATT TGAAGATATT GAGTCAATAA ATTATGCCAA GGAATGTCTA
GAAAAGGGAG AAGACGAGGA AGAGATCCTA GATAGGATAT CTGTTATAGG TCGTGACAAC
GCTAGGACTC CTATGCAATG GGACGACTCC AAGAATTCGG GCTTTTCTAA GGCGGATAAA
ACCTGGCTTC CTGTAAATCC AAATTACAAA GAAATAAATG TAGAAGAAGC TCTAAAAGAT
CCTGATTCAA TATTTTACAC CTACCAAAAA CTAGTTGACC TAAGGAAAAA GAAGGATTGG
CTAGTAGACG CTGACTTTAA GCTTTTAGAA ACAGATGAGA AAGTCTTCGC CTACACAAGA
GAGACTGACT TAGAAAAATA TCTCATTGTG GTTAATTTTT CTGGGGAAAG CCAAGACTTT
GACTTAGAAG AAGATTATAC TGATATTGTA ATTTCTAATA CAGATGTCAA AGAAGTTAAG
AATTCAGGCA AGCTTAAGGC CTGGGACGCG TTTTGTGTGA AAATTAAATA A
 
Protein sequence
MKKHWWHKAT IYQIYPRSFM DSNNDGIGDL RGIISKLDYL EKLGINAIWL SPVYQSPMDD 
NGYDISDYRA IADIFGNMDD MEELLDEAKK RDIRIIMDLV VNHTSDEHAW FIEARDNQAS
PKRDYYIWRK EKNGLESTFS GSAWEYDEDS GEYYLHLFSK KQPDLNWENE DLRHEIYDMM
NFWIDKGIGG FRMDVIDLLG KVPDKEIKEN GPMLHTYLKE MNKNTFGKHD LLTVGETWGA
SPEIAKKYSN PDNEELSMVF QFEHIGLQHK EGMAKWFYEK DLDVSKLKEI FAKWQTELEL
GKGWNSLFWE NHDLPRVLSL WADVDEYREK SAKALAILLH LMRGTPYIYQ GEEIGMTNYP
FKDLAEFEDI ESINYAKECL EKGEDEEEIL DRISVIGRDN ARTPMQWDDS KNSGFSKADK
TWLPVNPNYK EINVEEALKD PDSIFYTYQK LVDLRKKKDW LVDADFKLLE TDEKVFAYTR
ETDLEKYLIV VNFSGESQDF DLEEDYTDIV ISNTDVKEVK NSGKLKAWDA FCVKIK