Gene Apre_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1603 
Symbol 
ID8398415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1744733 
End bp1746397 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content39% 
IMG OID644995967 
Productalpha amylase catalytic region 
Protein accessionYP_003153345 
Protein GI257067089 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000968128 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA AGTGGTGGCA GAAGGAAATT GTTTATCAGA TTTATCCTAG GTCTTTCAAG 
GACTCGAACA ATGACGGGAT AGGAGATATT AGGGGAATTG TTGAAAAACT TGATTACTTG
AAGGACTTGG GCATTACTAT GATTTGGCTT TGCCCGATTT ATAAGTCCCC TATGGCCGAC
AATGGTTATG ACATTTCTGA TTATTTCGAT ATCAATGAAG AGTTTGGCAA CATGGAAGAC
TTTGATCTTT TGGTTGAAGA AGCGAAAAAA AGAGATATCA AGGTCATGAT GGACCTTGTC
CTAAACCACA CTTCCAACGA ACATGAGTGG TTTAAGGAGG CGATATCAGA TAAGGATAGT
CCTTACAGGA ATTATTATAT AATTAGAGAA GGGAAAGAGC CTCCAAACAA CTGGAGATCA
ATCTTTGGTG GATCTACCTG GACTAAGATT GATGGAGAAG ATGCCTATTA TCTCCACTCC
TTTGCCAAGG AGCAGCCAGA TCTCAATTGG GAAAACCCTA AGCTTAGAGA AGAAGTGATT
AATATCGTCA ATTTCTGGAT TGATAAGGGA ATTACAGCCT TTAGAATGGA TGCAATCAAC
CACATAAAGA AAGATCCTTC ATATAAAAGT GGAGATCCAG ACGGGGCTGA TGGCAGAGTT
TCTGTCGTAA AATTCGGTAG AAATCAAGAT GGAGTCGAAG AACTCATAAG GATCCTTTCA
GATAATACTT TCAAGATCCA CGATTCGATG ACTGTGGGCG AGACTGCTGG TCTTTCTTAT
GACAAGTATG CAAACTACAT CGGTGATGAT GGGGTATTTT CCATGGTATT TGACTTTATC
CCAGCAAACT TCGACGTGGT CGAAGAAACT TGGTACAAGA GACTTGACTG GAAGGTAAGT
GACTTTAGAA AGTCAATTTT CGATAGTCAA GAGTCAATCC AAAAATACGG CTGGTCAGCA
AATTTCATAG AAAACCACGA CCAACCAAGG GCTACTACCA AGATTTTAAG GGAAAAGGAC
GAGGATATTG ATGCTATAAA GATGCTTGGA GGAATTTATT TCTTCTTTAG GGGAACTCCT
TATATCTACC AAGGCCAAGA GCTCGGTATG AAAAACTTCG TAAGAGAATC ACCAGACGAC
TTCCAAGACA TCCAATCCAT AGACTCTTAT AAGAGATCGC TTGAAGAAGG ATTTAGCGAG
AAGGAAGCCC TCTACTTCAC CAACCTCAGA AGCAGGGACA ACCCAAGAGT TCCTTTCGCC
TGGACTAATG AAAAGTACGG AGGCTTCTCA GAAACTAAAC CTTGGCTTGC CATGGCCTAC
GATAATCCTA AAGTAAATGC TGAGGATGAA GAAAAGGATA AGGATTCTGT CCTAAACTTC
TACAAAAAAA TGATAGACTT TAGGCAAAAT AGCCAGTATT CTGATATCCT AATCTATGGA
GACTTCAAGC CTTTGGAAGG TTTTGATGAT GAAATAATAG CCTACGAAAG AATCCTAGAT
GGTAAAAAAC TAGAAATTAT CGCCAACTTC TCAGATGAAG AAAAGAAAAT AGAAGCAAGG
GGTAAGGATA TAATCTTTTC CAACTCAAAG GGAGAGATTG AAGGAGATAT CTTAAGCTTA
AATCCATATA GTTTTGTAAT ATTAAAGAAT AAAAATAATA AATAA
 
Protein sequence
MQKKWWQKEI VYQIYPRSFK DSNNDGIGDI RGIVEKLDYL KDLGITMIWL CPIYKSPMAD 
NGYDISDYFD INEEFGNMED FDLLVEEAKK RDIKVMMDLV LNHTSNEHEW FKEAISDKDS
PYRNYYIIRE GKEPPNNWRS IFGGSTWTKI DGEDAYYLHS FAKEQPDLNW ENPKLREEVI
NIVNFWIDKG ITAFRMDAIN HIKKDPSYKS GDPDGADGRV SVVKFGRNQD GVEELIRILS
DNTFKIHDSM TVGETAGLSY DKYANYIGDD GVFSMVFDFI PANFDVVEET WYKRLDWKVS
DFRKSIFDSQ ESIQKYGWSA NFIENHDQPR ATTKILREKD EDIDAIKMLG GIYFFFRGTP
YIYQGQELGM KNFVRESPDD FQDIQSIDSY KRSLEEGFSE KEALYFTNLR SRDNPRVPFA
WTNEKYGGFS ETKPWLAMAY DNPKVNAEDE EKDKDSVLNF YKKMIDFRQN SQYSDILIYG
DFKPLEGFDD EIIAYERILD GKKLEIIANF SDEEKKIEAR GKDIIFSNSK GEIEGDILSL
NPYSFVILKN KNNK