Gene Apre_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0454 
Symbol 
ID8397229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp517859 
End bp519187 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content40% 
IMG OID644994811 
Productglycoside hydrolase family 4 
Protein accessionYP_003152222 
Protein GI257065966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000262925 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAA AATATTCAAT CGTTGTAGCT GGTGGTGGTT CAACATTTAC ACCAGGAATT 
ATCGGAATGC TTCTAGATAA CCTTGATAGA TTCCCAATTA GATCTATCAA ACTTTACGAT
AACGACGCAG ATAGACAAGG CGTTATCGGT AAGGCACTAG AAATCTTACT AAAAGAAAGA
CACCCAGAAA TCAACTTTGT CTACACTACA GATCCTGAAG AAGGATTTAC AGATGTTGAC
TTCGTTCTAG CTCAATTAAG AGTTGGTAAG TATGAAATGC GTGATAGGGA CGAGAAAGTT
CCATTAAAAC ACGGATGTAT CGGACAAGAA ACATGTGGAC CAGGAGGACT ATCTTATGGT
ATGCGTTCAA TCGGTGGAGT ACTTGAAATC CTTGACTACA TGGAAAAATA CTCACCAGAT
GCATGGATGT TAAACTACTC AAACCCTGCA GCAATCGTAG CAGAAGCTAC TAGAAGACTA
AGACCAGATT CTAAGATTAT AAACATCTGT GACATGCCAA TTGACTTAAT GTACAAGATG
GCAGACATGG TTGGCTTAAA AGAATGGCAA GAGCTTGACT TCTCTTATTA CGGTCTAAAC
CACTTCGGTT GGTTTACAGC AATCTCTGAT AAGGAAGGAA ATGACCTAAT GCCACAAATC
AAAGAGCACG TTTCAGTTAA CGGCTTTGCT GATGGTATAG GAACAGCCCA ACACCTTGAT
CCATCATGGG TTGAGACATT CACAAAGGCA AAAGACGTTT ATGCCCTAGA TCCAGCAACA
ATTCCAAACA CTTATCTAAA ATACTACTTC TACCCAGACT TCGTAATGGA GCATACTGAT
CCAAACCACA CAAGAGTTGA CGAAGTTCGT GAAGGAAGAG AAAAAGATGT ATTTGGTTTC
TGCCAAAATA TCATTGATAA GGGAACAGCA GAAGGAGTTG AAATCGAGCT TGATGCACAC
GCAACCTACA TCGTAGACCT TGCTATTGCA CTTGCAGAAA ACACAAAGGA AAGATTCCTT
CTTATAGTTG AAAACAACGG AGCTATTCCA AACTTTGACC CAACAGCAAT GGTAGAAATC
CCATGTCTAG TTGGTAAAAA CGGAATCGAA AGAATCAACC AAGGAGCAAT TCCACAATTC
CAAAAAGGCC TAATGGAACA ACAAGTATCT GTAGAAAAAC TTGTAGTTGA AGCTTGGATT
GAAGGAAGCT ACCTAAAGAT GTGGCAAGCT CTAACACTTT CTGCAACAGT ACCAAGTGCG
AAAGTTGCCA AAGAACTTCT TGACGATCTT ATAGAAGCAA ACAAGGACTT CTGGCCAGAA
CTTAACTAA
 
Protein sequence
MTQKYSIVVA GGGSTFTPGI IGMLLDNLDR FPIRSIKLYD NDADRQGVIG KALEILLKER 
HPEINFVYTT DPEEGFTDVD FVLAQLRVGK YEMRDRDEKV PLKHGCIGQE TCGPGGLSYG
MRSIGGVLEI LDYMEKYSPD AWMLNYSNPA AIVAEATRRL RPDSKIINIC DMPIDLMYKM
ADMVGLKEWQ ELDFSYYGLN HFGWFTAISD KEGNDLMPQI KEHVSVNGFA DGIGTAQHLD
PSWVETFTKA KDVYALDPAT IPNTYLKYYF YPDFVMEHTD PNHTRVDEVR EGREKDVFGF
CQNIIDKGTA EGVEIELDAH ATYIVDLAIA LAENTKERFL LIVENNGAIP NFDPTAMVEI
PCLVGKNGIE RINQGAIPQF QKGLMEQQVS VEKLVVEAWI EGSYLKMWQA LTLSATVPSA
KVAKELLDDL IEANKDFWPE LN