Gene Plav_3461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3461 
Symbol 
ID5455912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3705111 
End bp3706628 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content59% 
IMG OID640879047 
Productthymidine phosphorylase 
Protein accessionYP_001414718 
Protein GI154253894 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.592088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCC GCATCACTCC CCAGACACCC GCCTTGCAAG CATTGCGCAT GCGACTGCAT 
GCCCAGCATC AACCCGTCGT CTTGATGCGT ACCGATTGCC ATGTCTGCCG TGCCGAAGGG
CTGGCACCGC GGTCACAGGT ACTGATCATT GCCGGCGACC GCACTGTGCA AGCGCTACTG
TACCAAATCG ACAGCGATCT GCTCAAAACC GGACAGATCG CTCTGTCCGA GGCCGCCTGG
GATGCCCTGG ACATTCATGA GGGCGATCTT GTGCAGGTTC GGCATCCTCC GCTGCTCGAA
TCGTTGTCGG CCGTGCGTGC GCGAATTCAC GGCCACCGAC TGCAAACGAC GGAGTTGCAG
GCGATCGTCC GTGATGTGGT CGATGGTCGC TATACCGATG TCGCACTTTC GGCCTTCCTG
ACCGCAACGG CGGTACTGCC TCTGGATATG CAAGAGACCA TCCATCTCAC CCGTGCGATG
GTCGATGTCG GAGATCACCT GCAATGGCAG GCTCCGATTG TTGTGGACAA GCATTGCGTG
GGCGGATTAC CGGGAAATCG CACCACGCCG TTGGTGGTTG CCATCGCCGC AGCCAATGGA
TTGGTGATGC CCAAGACCTC ATCACGCGCC ATCACCTCTC CCGCTGGCAC CGCGGACACC
ATGGAAACGC TGGCTCCTGT AGACCTGGAC CTGGATACGC TCAGAAAGGT CGTGGAGAAA
GAGGGTGGAT GCGTGGCGTG GGGCGGCGCG ATGCACCTCA GCCCCGCGGA CGACATCTTC
GTGCGTATTG AGCGTGAACT GGATATCGAC ACGCAAGGAC AACTGATTGC CTCGGTGTTA
TCCAAGAAGA TTGCAGCAGG GGCGACCCAC ATCGTGATCG ATATTCCGGT TGGGCCAACC
GCAAAAGTCC GCAGCCGGGA AACTGCCGAG CATCTTGCGC ATCACCTTTC GGAAGTCGCC
GCGTCATTTG GCCTTGTATT GCGTTGCCTG TTTACAGACG GGAATCAGCC TGTCGGCAGA
GGTATCGGCC CGGCGTTGGA GGCGCGCGAC GTGTTGGCCG TATTGCGCAA CGAGGCGGAT
GCGCCGCAAG ACCTATGTGA CCGCGTGGCG TTGGTCGCGG GTGCGGTACT TGAGCTTGGC
GGCGTCGCCA AAGAAGGGGA TGGAATTCGA TTGGCTCACG AGACGATCAG CAGTGGCCGC
GCCTGGGAAA AATTTCAGAG GATCTGTGCC GCTCAGGGGG GATTTCGTGA GCCACCCCAA
GCTCTTTACG TCGAACCGCT TTTGGCAACC ACTTCAGGCC GAGCAGTACA CATCGACAAC
CGTAAGCTGT CCCGTTTAGC CAAATTAGCC GGAGCGCCTG AGAGTCCAGC CGCAGGGATT
CAATTGCAAG TGCGCTTAGG TGACGAGGTA ACACGCGGAC AATCATTGAT GTTTTTGCAT
GCGCAAACCT CTGGAGAGAT GGCCTATGCA CTCGCATACG TGCATGACAT TGGTGACATC
GTAAAGATTG AACCTTAG
 
Protein sequence
MSARITPQTP ALQALRMRLH AQHQPVVLMR TDCHVCRAEG LAPRSQVLII AGDRTVQALL 
YQIDSDLLKT GQIALSEAAW DALDIHEGDL VQVRHPPLLE SLSAVRARIH GHRLQTTELQ
AIVRDVVDGR YTDVALSAFL TATAVLPLDM QETIHLTRAM VDVGDHLQWQ APIVVDKHCV
GGLPGNRTTP LVVAIAAANG LVMPKTSSRA ITSPAGTADT METLAPVDLD LDTLRKVVEK
EGGCVAWGGA MHLSPADDIF VRIERELDID TQGQLIASVL SKKIAAGATH IVIDIPVGPT
AKVRSRETAE HLAHHLSEVA ASFGLVLRCL FTDGNQPVGR GIGPALEARD VLAVLRNEAD
APQDLCDRVA LVAGAVLELG GVAKEGDGIR LAHETISSGR AWEKFQRICA AQGGFREPPQ
ALYVEPLLAT TSGRAVHIDN RKLSRLAKLA GAPESPAAGI QLQVRLGDEV TRGQSLMFLH
AQTSGEMAYA LAYVHDIGDI VKIEP