Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4089 |
Symbol | |
ID | 3973178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4542803 |
End bp | 4544344 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927193 |
Product | thymidine phosphorylase |
Protein accession | YP_533934 |
Protein GI | 90425564 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0836613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCG CCGACGCGCC CCGCCCGCAA CTCAAGATCC GCCTGATCCA TCTCGACACC GGCCGCGAAA ACGTCGCGGT GATGTCGCGG CGCTCAAAGG CGCTGCGCCC GGAGGTGTTC AGCGGCTTCA GCCGGGTCGA GATCCGCCGC AATGCGAAGT CGGTGCTGGC GACCTTGCTG ATTACCGACG ACGACGCGCT GGTCGGGCCG GACGAACTCG GCCTCGCCGA GCCGGCGTTC CGGCGTTTCG GCGAACCCGC CGGGACCTTC GTCACGGTGT CGCCGGCGAC GCCGCCGGAC AGTCTCGACG CGGTGCGCGG CAAGATCCAG GGCCGCACGC TGAGCGCGGC GGAGATCACC GCCATCGTCA ACGATCTCAC CCGCTATCGT TATTCCGACA TGGAGATCGC CGCCTTCCTG ATCGGCGCGG CGCGGTTCAT GACTTCGGAC GAACTGCTGG CGCTGGTGTC GGCGATGGCC TCGGTCGGCA CTCAGTTGCG CTGGGATCGC CCGATCGTGG TCGACAAGCA TTGCATCGGC GGCATTCCCG GCAATCGCAC CTCGATGATC CTGGTGCCGA TCGTCGCCGC GCACGGGCTG ACGATTCCGA AAACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGATACCATG GAGGTGTTGG CGCGGGTCGA TGTCAGCGTC GCGGAAATGA AAGAGATCGT CGCCGCCTGC AACGGCTGTC TGATCTGGGG CGGCCACGTC AATCTCTCGC CCGCCGATGA CATCTTGATC TCGGTGGAGC GGCCGCTGTG TCTCGACACC CGCGAGCAGA TGGTGGCCTC GATCATGTCG AAGAAGCTCG CCGCCGGCTC GACGCATCTG TTGGTCGATC TGCCGGTCGG CCCGACCGCC AAGGTCGCCA GCGCACTCGA CGCGATGCGG CTGCGCAAGC TGTTCGAATT CGTCGGCGAT CATTTCGGCA TCGCGGTGGA GACCATCACC ACCGACGGCC GGCAGCCGAT CGGCAACGGC ATCGGCCCGG TGCTGGAGGC GCAGGACGTC ATGGCGGTGC TCGGCAACGA TCCGAAGGCT CCCGCCGATC TGCGCGAAAA ATCGCTACGG TTGGCGGCGC ATCTGTTGGA ATACGATCCG CATCTGCGCG GCGGCGCGGG CTACGCGCGG GCCCGCGAGT TGCTGGAAAG CGGCGCGGCG CTGAAGCAGA TGCAGAAGAT CATCGACAAT CAGGGGCCTT CGACCTGTCA CAAGGACCTC GGAACGCTGA CCGCCGAGGT GACGGCGGAG CGCGACGGCG TGGTGTCGGC GATCGATTGC CTGCAGCTCA ATCGGCTGGC GCGCACCGCC GGCGCGCCGA TCGACAAGGG CGCCGGCATC CGGCTGTTCA AGAAGGTCGG CGATCGTGTC GAGGCCGGCG AGCCGCTGTA TCGAATTTAC GCCTTCGATC CGGCCGAGCG CGAACTCGCG GTGGCTGCCG CCAAGCTCGC CTGCGGTTAC ACGGTCGACG ACGCGCAAAC CTTTCGCGAG CAGGTGATGT AG
|
Protein sequence | MTSADAPRPQ LKIRLIHLDT GRENVAVMSR RSKALRPEVF SGFSRVEIRR NAKSVLATLL ITDDDALVGP DELGLAEPAF RRFGEPAGTF VTVSPATPPD SLDAVRGKIQ GRTLSAAEIT AIVNDLTRYR YSDMEIAAFL IGAARFMTSD ELLALVSAMA SVGTQLRWDR PIVVDKHCIG GIPGNRTSMI LVPIVAAHGL TIPKTSSRAI TSPAGTADTM EVLARVDVSV AEMKEIVAAC NGCLIWGGHV NLSPADDILI SVERPLCLDT REQMVASIMS KKLAAGSTHL LVDLPVGPTA KVASALDAMR LRKLFEFVGD HFGIAVETIT TDGRQPIGNG IGPVLEAQDV MAVLGNDPKA PADLREKSLR LAAHLLEYDP HLRGGAGYAR ARELLESGAA LKQMQKIIDN QGPSTCHKDL GTLTAEVTAE RDGVVSAIDC LQLNRLARTA GAPIDKGAGI RLFKKVGDRV EAGEPLYRIY AFDPAERELA VAAAKLACGY TVDDAQTFRE QVM
|
| |