Gene Apre_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0039 
Symbol 
ID8396786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp47341 
End bp48642 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content40% 
IMG OID644994376 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003151815 
Protein GI257065559 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0183104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTA TTGATATAAT TGAAAAGAAA AAACTTAAAG AAGAACTAAC AGATGAAGAA 
ATCCAATTTT TTATAGATGG AGTAACTGAC GATTCAATCG AAGATTATCA AATAGCAGCC
CTCCTCATGG CGATCAGGCT TAACGGCATG ACAGAGCATG AGACAGCCAA GCTTGCAGAA
GCTATGATGC ACTCTGGAGA TGTTATTGAC CTATCAGAAA TAGAAGGAAT CAAATCTGAC
AAGCACTCAA CAGGTGGTGT TGGAGATAAG ACCTCAATGG CACTCGGTGC AATGGTTGCT
GCTTGTGGCC TTAAGCTTGC CAAGATGAGT GGTAGGGGCC TAGGACATAC TGGTGGAACC
CTTGATAAAC TAGAATCTAT AGAAGGATTT AACATCTCCC TTACCGAAGA AGAATTCAAA
AAGCAAGTAA ACGAAATAGG CCTTGCCATA ATTGGTCAAA CAGGAGACTT GGTCCCAGCT
GATAAGAAGC TCTACGCCCT AAGGGATGTA ACAGCAACAG TAGATTCGAT TCCGCTAATT
GCTTCATCAA TTATGTCTAA GAAACTTGCT TCTGGATCAG ATACCATATT ACTCGATGTA
AAATACGGTG AAGGTGCCTT CATGCACACA GTAGAAGATG CTAAGAAACT TGCCGAAGCT
ATGATTTCAA TCGGTAAGAA ACTAGGCAAA AATACTATGG CCATGATTAC AGATATGAAC
CAACCTTTAG GAAATACTAT AGGTAATGCC CTTGAAGTAA GAGAAGCTAT AGAGACTGTA
AGGGGACATG GACCAAAAGA CTTCACAGAA CTTTGTATGT GTGCTGGGGA GATTATGCTC
ATGCAAGCAG ACAAGGCAGA GACTAAAGAA GAAGCTAGAA AGATGTTAGA AGAAGCAATC
TCATCTGGAA AAGCCTACGA AAAGCTAGAA AAAATGGTAG AATACCAAGG CGGAAATGTA
GAACAAATCA GAAACACAGA CCTCCTTCCT CAAGCGAAAT TCAAGACAGA AATGTTATCT
AAAGAAGAAG GCTACATTGA AAATATCCAC TCAATGGGAC TTGGTATCCA AGCGATGAAG
CTTGGAGCTG GAAGAGCTAA GAAAACTGAC CCTATAAACT ACGCTGTTGG TCTCGAGATG
AATGCCAAAA AGGGCGACTA TGTCAAAAAG GGCGACCTTC TCTGTACAGT ATATCACGAC
GAAGAATTAA CAGAAGAGTG GAAAAAAGAT TTCTATGATA CCTTTACCTT TACAGACAAG
GAAGTAGAGC CAATTCCAAT AGTAGAAGAA ATTTTAAAAT AA
 
Protein sequence
MRFIDIIEKK KLKEELTDEE IQFFIDGVTD DSIEDYQIAA LLMAIRLNGM TEHETAKLAE 
AMMHSGDVID LSEIEGIKSD KHSTGGVGDK TSMALGAMVA ACGLKLAKMS GRGLGHTGGT
LDKLESIEGF NISLTEEEFK KQVNEIGLAI IGQTGDLVPA DKKLYALRDV TATVDSIPLI
ASSIMSKKLA SGSDTILLDV KYGEGAFMHT VEDAKKLAEA MISIGKKLGK NTMAMITDMN
QPLGNTIGNA LEVREAIETV RGHGPKDFTE LCMCAGEIML MQADKAETKE EARKMLEEAI
SSGKAYEKLE KMVEYQGGNV EQIRNTDLLP QAKFKTEMLS KEEGYIENIH SMGLGIQAMK
LGAGRAKKTD PINYAVGLEM NAKKGDYVKK GDLLCTVYHD EELTEEWKKD FYDTFTFTDK
EVEPIPIVEE ILK