Gene Apre_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1040 
Symbol 
ID8397827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1109367 
End bp1110935 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content34% 
IMG OID644995388 
ProductDak phosphatase 
Protein accessionYP_003152789 
Protein GI257066533 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA TTGATGCAAA TAAACTTCAA CAAATGATTA TTGGAGCTTA TGAATATTTA 
AATGAAAATA AGGACTTGGT TAATGAGCTA AACGTCTTTC CTGTACCAGA TGGAGATACA
GGTACGAATA TGTTTATGAC AATAAAATCT GGTCTTGATA AAGTAAATAA ATCTGGATTG
ACTTCTGTTG AAGAAGTTTC TAAGGCACTA AGTCAAGGGA CTCTTATGGG AGCTAGGGGA
AATTCAGGTG TTATTCTATC TCAGCTCGTT AGGGGTATGT CAAAGACTCT TAAGGGTAAG
GATACAATCT ATGCTACTGA TGTAAGGGAT ATTTTCGCAA ATGCTAGTAA AACAGCCTAC
AAAGCTGTCA TGCAGCCAAC AGAGGGAACA ATCCTTACTG TAGCCAATAA AATGGCTGAC
AAGGCTAAGG AATCCTTCAA TGAAGATATA GAATTAGACG ATTACCTTAT AGAAATTATT
GGAGCGGGTC AAGTAGCATT AGATAATACC CCTAACCAAC TTCCAGTCCT CAAAGAAGCA
GGTGTTGTTG ACTCAGGTGG ACAAGGATTA ATTTTCTTAC TAAGAGGTGC ACTAAATGCA
TTAAATTCAA ATATCGACAG AGATATCGAT TTGTCTGAAG AAAAAAGCGA TGATGACTTT
ACTTATAAGG TGGAATTTGA GCTTGCTGGA AATGAAGAAA AGCTTGCTTC TTTAAATGAA
AACCTTGAAA GAATCACGAA GAATTATGAA TCTGACTTAA ATAAAGATTT ATTGAAATCA
ACATTTAAGA CTGACAGCCC TCAAAATATA GTTCAAATGA TTTTGATGGA GGGAGTGATA
TTAAAAATCA CGGTAGAAAA CTTAAAGCCT GAAGTAGAGA ACATCCCTGA AAATAAGGAA
GCTATCAATA AGAAATACGG ATTTATTGCA GTATCTAGGG GCGAAGGCTA CAACGCTATA
ATGGAAAGCA TGAATATAGA TAAGGTTATT GAAGGTGGAC AAACTATGAA CCCATCTACT
GAAGATCTAT ATAAGGCCGT TGAAGAAATA AAGGCTGAAA ATATTTTTAT TTTCCCGAAC
AACAAGAATA TTATAATGTC TGCGAAGCAA GCTGCTGAAG TTTCAGATAA GAATGTTTTC
GTTGTTGAGA CAAGATCAAT TCCTGAAGCC TTCAGTGCAA TTCTTGAATT TGATGAAGGG
ATGAGTCCAG AAGAAAATTT AGAGAATATG AATGAAATCA TTGAAGATAT TCATATTGCT
GAAGTTTCTA TATCTATAAG AGATACAAGC GTCAATGATA TTAAGATAAG AAAAGATGAT
TATATAGGAA TACTCGATGG TAAAATTGTA GCTACAGATT CATCTATAGA AAAAACTTGT
GAGGAAACCA TAGCTGATAT TATAGAAGAA GATGATATTT CTTTGATTAC TATTTATTAT
GGTGAGGATA TAGAAAAAAG AAGAGCTAAG GATTTTTCTA AGAAACTTTC TAAAAAATTC
AAGGACGTGG ACTGCGAATT AGTCTACGGT GGACAGCCAG TCTACTACTA TACAATCACA
CTTGAATAA
 
Protein sequence
MKSIDANKLQ QMIIGAYEYL NENKDLVNEL NVFPVPDGDT GTNMFMTIKS GLDKVNKSGL 
TSVEEVSKAL SQGTLMGARG NSGVILSQLV RGMSKTLKGK DTIYATDVRD IFANASKTAY
KAVMQPTEGT ILTVANKMAD KAKESFNEDI ELDDYLIEII GAGQVALDNT PNQLPVLKEA
GVVDSGGQGL IFLLRGALNA LNSNIDRDID LSEEKSDDDF TYKVEFELAG NEEKLASLNE
NLERITKNYE SDLNKDLLKS TFKTDSPQNI VQMILMEGVI LKITVENLKP EVENIPENKE
AINKKYGFIA VSRGEGYNAI MESMNIDKVI EGGQTMNPST EDLYKAVEEI KAENIFIFPN
NKNIIMSAKQ AAEVSDKNVF VVETRSIPEA FSAILEFDEG MSPEENLENM NEIIEDIHIA
EVSISIRDTS VNDIKIRKDD YIGILDGKIV ATDSSIEKTC EETIADIIEE DDISLITIYY
GEDIEKRRAK DFSKKLSKKF KDVDCELVYG GQPVYYYTIT LE