Gene Apre_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0474 
Symbol 
ID8397249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp541427 
End bp543088 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content38% 
IMG OID644994831 
Productalpha,alpha-phosphotrehalase 
Protein accessionYP_003152242 
Protein GI257065986 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02403] alpha,alpha-phosphotrehalase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0201427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTAG GAAAACAAGT AATCTACCAG GCCTATCCTA GAAGCTTTAA GGATACGTCA 
GGAAATGGTA TAGGAGACCT TAAGGGAATC TGCCAAAAGG TAGATTATCT AAAAGAGCTT
GGAGTAGATA TGGTTTGGCT CAATCCTTTC TTTATCTCTC CACAAAACGA TAATGGCTAC
GACATAGCAG ATTATTACCA TGTAGACCCT TCCTTTGGGA CTGATGAGGA CCTTGATAAT
CTCATTGCAG AGTTTGACAA GGCAAATATT AAGTTAATGT TTGATATGGT ACTAAACCAC
ACATCGACAG AGCATGTTTG GTTCAAGAAG GCCCTAGCAG GTGATGAAAA ATATCAAAGT
TTCTACTATA TAAGAGATGG AAAGGACGGG TCATATCCTA CAAATTGGCA GTCCAAATTT
GGAGGACCTG CTTGGAATGA GTTTGGAGAT AGTGGAAAAT ACTATCTCTG CCTCTACGAT
AAGACCCAGG CCGACCTTAA TTGGCACAAT CCTGACCTTA GAGAAGAGCT TTACAAGATC
GTAAACTATT GGATTGATAA GGGAATCGGA GGATTTAGAT TTGACGTCTT AAATGTAATA
GGGAAAAGCC AAATCTTAGA AGATTCTTCA GGAGATATAA GCGAGGAGAA AAAGCTCTAC
ACAGACACAC CTATAGTTCA CAGGTGGATA AGGGAGCTTA ACAAACATAC TTTCGGGAAA
AACGCCGAGA TAATTACAGT AGGAGAGATG AGCTCGACTG ATATAGAAAA TTCTATCATG
TACTCAGCAG CAGAAGGCGA TGAGCTTTCC ATGATCTTTA GCTTCCACCA TCTTAAGGTA
GACTACAAGG ATGGGGACAA GTGGACTGAC ATGGACTTTG ACTTTATGAA ATTAAAGGAA
ATCCTAAACA AATGGCAGAA GGGACTCCAA GATGGGGGCG GTTGGAATGC TCTTTTCTGG
AACAATCACG ACCAGCCAAG GGCCAACAAT AGATTTGGAG ATGTTAAAAA TTATCCGAAA
GAAACAGCTA CTATGCTTGC CCAGACCATC CATATGATGA GAGGAACCCC TTATATCTAC
CAGGGAGAAG AAATCGGAAT GACAGACCCT GACTTTTCTT CAATTGATGA CTACAAGGAT
ATTGAAAGCA TCAATGCCTA CAAGGATTTA TTAGAAAAAG GTAAAAGTTC CGATGAAGCT
TTAAATATTA TCAAGAAAAA ATCTAGAGAC AATTCTAGAA CTCCTATGCA ATGGGACGGC
TCAGAAAATG CAGGCTTTAC GACAGGAACT CCTTGGATAG GGGTAAATGA TAATTATGAA
GAAATAAATG CAGAAAAGGC CCTAGAGGAT AAGGATTCAA TATTCTATTA TTACAAGAAA
CTAATAGAAT TAAGAAAAGA AGAGTCTATA ATCTCCGATG GTCTCTACTT CCTAATCCTT
GAAGATGATC CTCATATCTT CGCCTACATC AGAGAATATG AGGGAGAAGT TTTAATCAAT
ATGAATAACT TCTCAGGAGA GGAAGTTACA GTCGATCTTG AAAAAATCCT AGAAAATTAT
GATAAATTCG AATATCTCTT AGGAAATTAT GGAGAAAGAA AAGTAGACAA AAAAATAAGG
CTAAGGCCTT ACGAAAGCCT TGCCTTTATA AAAAGAAGCT AG
 
Protein sequence
MNLGKQVIYQ AYPRSFKDTS GNGIGDLKGI CQKVDYLKEL GVDMVWLNPF FISPQNDNGY 
DIADYYHVDP SFGTDEDLDN LIAEFDKANI KLMFDMVLNH TSTEHVWFKK ALAGDEKYQS
FYYIRDGKDG SYPTNWQSKF GGPAWNEFGD SGKYYLCLYD KTQADLNWHN PDLREELYKI
VNYWIDKGIG GFRFDVLNVI GKSQILEDSS GDISEEKKLY TDTPIVHRWI RELNKHTFGK
NAEIITVGEM SSTDIENSIM YSAAEGDELS MIFSFHHLKV DYKDGDKWTD MDFDFMKLKE
ILNKWQKGLQ DGGGWNALFW NNHDQPRANN RFGDVKNYPK ETATMLAQTI HMMRGTPYIY
QGEEIGMTDP DFSSIDDYKD IESINAYKDL LEKGKSSDEA LNIIKKKSRD NSRTPMQWDG
SENAGFTTGT PWIGVNDNYE EINAEKALED KDSIFYYYKK LIELRKEESI ISDGLYFLIL
EDDPHIFAYI REYEGEVLIN MNNFSGEEVT VDLEKILENY DKFEYLLGNY GERKVDKKIR
LRPYESLAFI KRS