Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3461 |
Symbol | |
ID | 5455912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3705111 |
End bp | 3706628 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640879047 |
Product | thymidine phosphorylase |
Protein accession | YP_001414718 |
Protein GI | 154253894 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.592088 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGCCC GCATCACTCC CCAGACACCC GCCTTGCAAG CATTGCGCAT GCGACTGCAT GCCCAGCATC AACCCGTCGT CTTGATGCGT ACCGATTGCC ATGTCTGCCG TGCCGAAGGG CTGGCACCGC GGTCACAGGT ACTGATCATT GCCGGCGACC GCACTGTGCA AGCGCTACTG TACCAAATCG ACAGCGATCT GCTCAAAACC GGACAGATCG CTCTGTCCGA GGCCGCCTGG GATGCCCTGG ACATTCATGA GGGCGATCTT GTGCAGGTTC GGCATCCTCC GCTGCTCGAA TCGTTGTCGG CCGTGCGTGC GCGAATTCAC GGCCACCGAC TGCAAACGAC GGAGTTGCAG GCGATCGTCC GTGATGTGGT CGATGGTCGC TATACCGATG TCGCACTTTC GGCCTTCCTG ACCGCAACGG CGGTACTGCC TCTGGATATG CAAGAGACCA TCCATCTCAC CCGTGCGATG GTCGATGTCG GAGATCACCT GCAATGGCAG GCTCCGATTG TTGTGGACAA GCATTGCGTG GGCGGATTAC CGGGAAATCG CACCACGCCG TTGGTGGTTG CCATCGCCGC AGCCAATGGA TTGGTGATGC CCAAGACCTC ATCACGCGCC ATCACCTCTC CCGCTGGCAC CGCGGACACC ATGGAAACGC TGGCTCCTGT AGACCTGGAC CTGGATACGC TCAGAAAGGT CGTGGAGAAA GAGGGTGGAT GCGTGGCGTG GGGCGGCGCG ATGCACCTCA GCCCCGCGGA CGACATCTTC GTGCGTATTG AGCGTGAACT GGATATCGAC ACGCAAGGAC AACTGATTGC CTCGGTGTTA TCCAAGAAGA TTGCAGCAGG GGCGACCCAC ATCGTGATCG ATATTCCGGT TGGGCCAACC GCAAAAGTCC GCAGCCGGGA AACTGCCGAG CATCTTGCGC ATCACCTTTC GGAAGTCGCC GCGTCATTTG GCCTTGTATT GCGTTGCCTG TTTACAGACG GGAATCAGCC TGTCGGCAGA GGTATCGGCC CGGCGTTGGA GGCGCGCGAC GTGTTGGCCG TATTGCGCAA CGAGGCGGAT GCGCCGCAAG ACCTATGTGA CCGCGTGGCG TTGGTCGCGG GTGCGGTACT TGAGCTTGGC GGCGTCGCCA AAGAAGGGGA TGGAATTCGA TTGGCTCACG AGACGATCAG CAGTGGCCGC GCCTGGGAAA AATTTCAGAG GATCTGTGCC GCTCAGGGGG GATTTCGTGA GCCACCCCAA GCTCTTTACG TCGAACCGCT TTTGGCAACC ACTTCAGGCC GAGCAGTACA CATCGACAAC CGTAAGCTGT CCCGTTTAGC CAAATTAGCC GGAGCGCCTG AGAGTCCAGC CGCAGGGATT CAATTGCAAG TGCGCTTAGG TGACGAGGTA ACACGCGGAC AATCATTGAT GTTTTTGCAT GCGCAAACCT CTGGAGAGAT GGCCTATGCA CTCGCATACG TGCATGACAT TGGTGACATC GTAAAGATTG AACCTTAG
|
Protein sequence | MSARITPQTP ALQALRMRLH AQHQPVVLMR TDCHVCRAEG LAPRSQVLII AGDRTVQALL YQIDSDLLKT GQIALSEAAW DALDIHEGDL VQVRHPPLLE SLSAVRARIH GHRLQTTELQ AIVRDVVDGR YTDVALSAFL TATAVLPLDM QETIHLTRAM VDVGDHLQWQ APIVVDKHCV GGLPGNRTTP LVVAIAAANG LVMPKTSSRA ITSPAGTADT METLAPVDLD LDTLRKVVEK EGGCVAWGGA MHLSPADDIF VRIERELDID TQGQLIASVL SKKIAAGATH IVIDIPVGPT AKVRSRETAE HLAHHLSEVA ASFGLVLRCL FTDGNQPVGR GIGPALEARD VLAVLRNEAD APQDLCDRVA LVAGAVLELG GVAKEGDGIR LAHETISSGR AWEKFQRICA AQGGFREPPQ ALYVEPLLAT TSGRAVHIDN RKLSRLAKLA GAPESPAAGI QLQVRLGDEV TRGQSLMFLH AQTSGEMAYA LAYVHDIGDI VKIEP
|
| |