Gene Mflv_4842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4842 
SymboldeoA 
ID4976154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5154907 
End bp5156238 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content69% 
IMG OID640459071 
Productthymidine phosphorylase 
Protein accessionYP_001136098 
Protein GI145225420 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTCG ACGCCGCCGG CCCCATCTCG CATCTCGATG CTCCGTCGGT GATCCGAACG 
AAGCGCGATG GTGGAGCCCT GTCCGACGAG GCCATCGACT GGGTCATCGA CGCCTACACG
CGCGGGGAGG TCGCCGAGGC GCAGATGTCC GCGCTGCTGA TGGCGATCTT CCTCCGGGGT
ATGACCGGCC ATGAGATCGC GCGGTGGACC GCGGCGATGA TCGCGTCGGG CGAGCGACTG
GACTTCAGCG ACCTGCGCCG GGACGGGAAG CCGCTCGCGC TGGTCGACAA ACACTCGACA
GGCGGTGTCG GGGACAAGAT CACGATCCCG TTGGTGCCGG TCGTCATGGC GTGTGGCGGC
GCGGTACCGC AGGCGGCCGG GCGTGGACTC GGACACACCG GTGGCACGCT CGACAAACTG
GAATCCATCC CGGGGTTCAC CGCCGAGATC ACCAAAAGCC AGATCCGGCA ACAACTCTGC
GATCTCGGGG CGGCGATCTT CGCCGCCGGG GAATTGGCGC CCGCCGACCG CAAGATCTAC
GCCCTGCGCG ACGTGACGGC GACGACGGAG TCGCTGCCGT TGATCGCGAG CTCGGTGATG
AGCAAGAAGA TCGCCGAGGG GACGAAGGCG CTGGTGCTCG ACTCCAAGGT CGGGTCGGGT
GCGTTCCTCG ACAGTGAGGC CGAGTCGCGG GAGTTGGCTC GCACCATGGT CGAGCTCGGC
GTCGAGCACG GTGTGCAGAC CCGCGCGCTG CTCACCGACA TGCAGACCCC GCTGGGCCGG
ACCGTCGGCA ACGCCGTCGA GGTCGTCGAG TCCCTGGAGG TGCTCGCCGG CGGCGGACCC
GACGATGTCG TGGAGCTGAC CCTGGCGTTG GCGAGGGAGA TGTGCGACGT CGCAGGGCTC
GACGGCGTCG ACCCCGCGCA GACGTTGCGC GACGGGACCG CGATGGACCG GTTCCGCGAC
CTGGTCGCCG CCCAGGGTGG CGACGTGGAC AGTCTCGATG CCGCGATGTT GCCACTCGGT
CAGCACACCG AAACCATCAG TGCCCCGCGC AGTGGCACGA TGGGAGACAT CGACGCGATG
GCGGTGGGTC TGGCGGTGTG GCGGCTCGGA GCGGGACGCT CGGTACCCGG CGAGCAGGTG
CAGTTCGGCG CGGGGATGCG GATCCATCGC AAACCCGGCG AGGCCGTCGC CGCCGGCGAG
CCGTTGTTCA CCCTCTACAC CGAGACCCCG CACCGGCTTG CCGCCGCGGC GTCCGAACTC
GACGGCGCCT GGAGCGTCGG CGACCACGCA CCGCCCACGC GTCCACTCAT CATCGACCGG
ATCAGCATGT AG
 
Protein sequence
MSFDAAGPIS HLDAPSVIRT KRDGGALSDE AIDWVIDAYT RGEVAEAQMS ALLMAIFLRG 
MTGHEIARWT AAMIASGERL DFSDLRRDGK PLALVDKHST GGVGDKITIP LVPVVMACGG
AVPQAAGRGL GHTGGTLDKL ESIPGFTAEI TKSQIRQQLC DLGAAIFAAG ELAPADRKIY
ALRDVTATTE SLPLIASSVM SKKIAEGTKA LVLDSKVGSG AFLDSEAESR ELARTMVELG
VEHGVQTRAL LTDMQTPLGR TVGNAVEVVE SLEVLAGGGP DDVVELTLAL AREMCDVAGL
DGVDPAQTLR DGTAMDRFRD LVAAQGGDVD SLDAAMLPLG QHTETISAPR SGTMGDIDAM
AVGLAVWRLG AGRSVPGEQV QFGAGMRIHR KPGEAVAAGE PLFTLYTETP HRLAAAASEL
DGAWSVGDHA PPTRPLIIDR ISM