Gene Apre_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0072 
Symbol 
ID8396823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp90619 
End bp92838 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content33% 
IMG OID644994412 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_003151847 
Protein GI257065591 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.723397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGT ATAATGAAAA ATCTAGAACA TTTTATTTAG GGAATGAGTA TGTGAGTTAT 
ATCTTTAAAA TATTGGAAAA CGAACAGCTA GGACAGTTAT ACTACGGTAA AGCTATAAAA
GATTCTAAAA ACTTTGATCA CTTATTTGAA AGTGAGCCAA GACCTATGAC TGTTTGTACA
TTTGACGGAG ATATGAAGTT TTCTCTTGAA TATATAAAGC AAGAATATCC ATCATACGGT
ACAGGAGATA TGCGTCATCC AGCCATAGAT ATATTACAAG AGAATGGAAG TCGAATCATA
GACTTCAAAT ACCAAAGTCA TGAAATAATA AAAGGAAAGC CAGAATTAAA AGATCTACCA
GCTACGTATG TTGAACATGA TGATGAAGCA GAAACACTAT CAGTAAGCTT ATACGATGAT
TTAATAGATG CTAAGTTAAT ACTAACTTAT ACAATTTTTA AAGATAGACC TGTGATAACA
AGAAATGCTT ATATTGAAAA TTGTGGCGAT ACTGAGTTTA GACTTAATAG AGCTATGAGC
CTATCTTTAG ACTTGCCAGA TAAAGATTAT GATATGATTG AATTAACAGG AGCTTGGTCA
AGGGAAAGAC ATATAAAGTC TAGAAAACTA GAGCATGGAA TTCAATCAAT ATACTCGCTT
AGAGGAATAT CTAGTGCTAA TTTTAATCCT TTTATAGCAT TAAAAAGATA CGATTGTAAC
GAGAATAGCG GAGAAGTACT AGGATTTAGC TTTGTATATA GTGGTAACTT TTTGGCTCAA
GTTGAAGTTG ACACATACGA TATATCAAGA GTGAGCATGG GTATACATCC ACATAATTTT
TCATGGAAGT TAAGAAAGGG AGAATCTTTC CAAACTCCTG AAGTGGTAAT GGTATATAGC
GATAAAGGTC TTAATGGTAT GAGCCAAACA TTCCATAAAT TATATCAATC AAGACTTGCG
AGAGGAAAAT TCCGTGATGA AGCAAGACCA ATCCTCGTAA ACAATTGGGA AGGAACTTAT
TTTGATTTTG ATGAAGAAAA AATACTTAGC ATGGCAAAAC AATCTAAGGA ATTAGGAGTT
GAGTTATTTG TATTAGATGA TGGGTGGTTT GGAGTTAGAA ATGATGATAC ATCTGGATTA
GGAGATTGGT ATCCAAATCT AGATAAACTT CCAAATGGGA TATCAGGGTT ATCTAAAAAA
GTTACAGAAA TGGGAATAAA ATTTGGATTA TGGATAGAGC CAGAAATGGT TAATAAAGAT
TCAGAATTAT ACAGAAAACA TCCTGAATGG ACTTTAGAAA CACCTAATAG AAAATCAAGC
CATGGTAGAC ATCAACATGT TTTAGATTTT TCTAATCCAG ATGTTATAGA TTATATATAT
AAAATGATAT CTAAGGTAAT TAGAGAATCA GATATCTCTT ATATCAAATG GGACATGAAT
AGATCTCTTA GTGAAGTTTA TTCTAATGTA CATGATAGCG AAAGTCAAGG TAAAGTAATG
CACAAGTATG TTTTAGGAGT GTATAGATTG TATGAAATGC TCATAAATGA ATTCCCAGAC
ATACTATTTG AATCATGTTC GAGTGGAGGA TCAAGATTTG ATCCAGGAAT GTTATATTAT
GCTCCACAAT GTTGGACAAG TGATGATACT GATGCTATAG AAAGACTTAA AATTCAGTAC
GGAACATCCC TAGTTTATCC ATTATCATCA ATAGGCGCTC ACGTATCCGC TATACCAAAT
GCCCAAGTTT TCAGAAATGT ACCTATAGAA ACAAGGGCTA ATGTTGCTTG CTTTGGAACT
TTCGGATATG AACTTGACGT AAACAAGTTG AGCGAAGAAG ATAAAAAAGT AATAGTTGAG
CAAATAAAAT TTATGAAAGA TAATAGGAAG CTTTTACAGT TTGGAACTTT CTATAGACTA
AAGAGCCCGT TTGAAGGTAA TGAGACTGTA TGGATGGTTG TATCCGAAGA CAAAGATAAG
GCTATCGTAG GTTATTACAA AACACTACAA AAGGTAAATT GCCCATATAA TAGGGTAAAA
CTTCAAGGAT TAGATCCAGA AAAGAAGTAT GAAGTATCAA TCAATGATTA TGAAGCGTAT
GGCGATGAAT TAATGAATGT AGGAATGATA ACTACTGATA GATCGTCTGG AGAGCAAAAA
GATATAAATA AGGCCGAAGG AGACTATTCT TCAAGACTTT ATATACTTAC AGCAAAATAA
 
Protein sequence
MIKYNEKSRT FYLGNEYVSY IFKILENEQL GQLYYGKAIK DSKNFDHLFE SEPRPMTVCT 
FDGDMKFSLE YIKQEYPSYG TGDMRHPAID ILQENGSRII DFKYQSHEII KGKPELKDLP
ATYVEHDDEA ETLSVSLYDD LIDAKLILTY TIFKDRPVIT RNAYIENCGD TEFRLNRAMS
LSLDLPDKDY DMIELTGAWS RERHIKSRKL EHGIQSIYSL RGISSANFNP FIALKRYDCN
ENSGEVLGFS FVYSGNFLAQ VEVDTYDISR VSMGIHPHNF SWKLRKGESF QTPEVVMVYS
DKGLNGMSQT FHKLYQSRLA RGKFRDEARP ILVNNWEGTY FDFDEEKILS MAKQSKELGV
ELFVLDDGWF GVRNDDTSGL GDWYPNLDKL PNGISGLSKK VTEMGIKFGL WIEPEMVNKD
SELYRKHPEW TLETPNRKSS HGRHQHVLDF SNPDVIDYIY KMISKVIRES DISYIKWDMN
RSLSEVYSNV HDSESQGKVM HKYVLGVYRL YEMLINEFPD ILFESCSSGG SRFDPGMLYY
APQCWTSDDT DAIERLKIQY GTSLVYPLSS IGAHVSAIPN AQVFRNVPIE TRANVACFGT
FGYELDVNKL SEEDKKVIVE QIKFMKDNRK LLQFGTFYRL KSPFEGNETV WMVVSEDKDK
AIVGYYKTLQ KVNCPYNRVK LQGLDPEKKY EVSINDYEAY GDELMNVGMI TTDRSSGEQK
DINKAEGDYS SRLYILTAK