Gene Apre_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0794 
Symbol 
ID8397578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp895110 
End bp896504 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content32% 
IMG OID644995140 
Productsucrose-6-phosphate hydrolase 
Protein accessionYP_003152543 
Protein GI257066287 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID[TIGR01322] sucrose-6-phosphate hydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.656645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATA AGAAAATATT TGAAGAAAAT TTTGCTAGAT TTTGCAAGAT GAAAGATAGG 
GCCAAAGAAG ATCCTAATAT GTTAAGATTT CACATATATC CAGAAGCAGG TTGGCTCAAT
GACCCGAATG GTCTGTGTGA ATACAAGGGG ATATATCACA TATACTACCA ATATTCTCCC
TTAGATGAAA ATAAATCCGA TACCTGTTGG GGGCATGTTA GTACAGTAGA TTTCATAAAT
TATAAAAGAG AGGATATCTT TATTTATCCT GATGCAAAAT TCGATAAAGA CGGAGCTTAT
TCGGGTTCTG CTTTTATTAA AGATGATAGG ATAAACTTCT TTTATACAGG TAATGTTAAG
CATGATGGTG AATATGATTT CATAAAAGAA GGAAGAGAGC ATAATACCAT CAAGCTAGCA
TCAGATGCCT ATACTTACTC TGAAAAAAAA TTAATCTTAG ATAATGATGA TTATCCAAAA
GATATGACAA AACATGTAAG AGATCCAAAA ATATATGAAA AAGATGGATA TAATTATATG
TTTATTGGAG CAAGAAGTCT TGAGGATAAG GGGTTGGTAC TAGTTTATAA GTCAAAAGAC
TTAGATAATT TCACCTATCA TATGAGAATA GAAACTGATT ACGATTTCGG TTACATGCGG
GAATGTCCAG ATTTTTTTAC GGTTGATTCT AAAGATTTTC TAATACTCTG CCCTCAAGGT
GTAGAAAGCG AAACCTATAG ATATCAGAAT ATTTACCAAG CTGGATACTT CCCGATAGAA
ATTGACCTTT CAGAAAAGAC TTATAAGCTA GGAAACTTTA AAGAATTAGA TTATGGATTT
GATTTCTACG CGCCTCAAAC ATTCGAAGAT TCAAAATCTA GAAGAATATT AATAGGTTGG
ATGGGTATGC CTGACGCAAA CTACACAAAT CCTACGATTG ATTATAAATG GCAACATTGT
CTTACTGTAC CAAGAAGCCT ATCTGTAAAA AGCGGAAAGC TATGTCAAAA TCCAATTTCA
GAGATGGAAA ATCTTAGGAA AGATAATATC ACCCTAAAAA GGGGAGAAAA CTATGAAGAT
TTCGTATTTG AAGCCTTGGC TAAAAATATA AAAGATTCTT TTAAGATTAA CCTAAGAAGT
GACGTTAGTC TATCCTATGA TGATGGAATA TTGAGATTAA ATATGGGTTA TTCATCGTTT
GGGAGAGATA CTAGACAGGT AAAAATTAAT GTAATTAGTG ATTTAAGAAT ATATTCTGAT
AAGTCTTCTC TAGAAATTTT TATAAATGAA GGAGAATATG TATTAACCAG TAGAGTATAC
TCTGAAAAAG CAGGTTTTTA TTCTAAGGAT TTGGATTTTG ACCTATACAC ATTAGATTCA
ATAGGTTTCA TATAA
 
Protein sequence
MLDKKIFEEN FARFCKMKDR AKEDPNMLRF HIYPEAGWLN DPNGLCEYKG IYHIYYQYSP 
LDENKSDTCW GHVSTVDFIN YKREDIFIYP DAKFDKDGAY SGSAFIKDDR INFFYTGNVK
HDGEYDFIKE GREHNTIKLA SDAYTYSEKK LILDNDDYPK DMTKHVRDPK IYEKDGYNYM
FIGARSLEDK GLVLVYKSKD LDNFTYHMRI ETDYDFGYMR ECPDFFTVDS KDFLILCPQG
VESETYRYQN IYQAGYFPIE IDLSEKTYKL GNFKELDYGF DFYAPQTFED SKSRRILIGW
MGMPDANYTN PTIDYKWQHC LTVPRSLSVK SGKLCQNPIS EMENLRKDNI TLKRGENYED
FVFEALAKNI KDSFKINLRS DVSLSYDDGI LRLNMGYSSF GRDTRQVKIN VISDLRIYSD
KSSLEIFINE GEYVLTSRVY SEKAGFYSKD LDFDLYTLDS IGFI