Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1443 |
Symbol | |
ID | 4021920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1611744 |
End bp | 1613288 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637961635 |
Product | thymidine phosphorylase |
Protein accession | YP_568581 |
Protein GI | 91975922 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0271925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCTT CCCACCTGTC CCGTCCACCG CTGACGATCC GGCGGATCAG CCTCGATACC GGCCGCGAAA ATGTCGCGGT GATCTCGCGC CGCTCGCGGG CGTTGCGCGC CGACGTCTTT CGTGGCTTCA GCCGGGTCGC GTTGCGCATC AACACCAAGG TGCTGCTGGC GACGCTGATG ATCACCGACG ACGATGCGAT GATCGGCCCC GACGAACTCG GCCTGTCCGA GCCGGCGTTT CGCCGCTTCA ACGAACCGGT TGGCAGCGCG GTGAGCGTGT CGCCGGCGCA GCCGCCGGCC AGCCTCGACG CGGTGCGCGC CAAGATCCAG GGCCACACGC TGACGCCCGC GGAAATCACC GCTGTCATCG ACGACGTCGC ACACTTCCGC TACTCCGACA TGGAGATCGC GGCGTTCCTG ATCTCCGCCG CACGCTTCAC CTCGACCGAT GAATTGCTGG CGCTGGTCGA CGCCATGGCC TCGGTCGGCA CCCGGTTGAA ATGGCAGAAC CCGATCGTGG TCGACAAGCA CTGCATCGGC GGCATTCCCG GCAACCGGAC CACCATGATC GTGGTGCCGA TCGTCGCCGC GCAGGGGCTG ATGATTCCCA AGACCTCGTC GCGCGCGATC ACCTCGCCGG CCGGCACCGC CGACACGATG GAAGTGCTGG CGCGCGTCGA TCTCGACGTC GAACAGATGA AGCGGGTGGT CACCACTTGC GGCGGCTGCC TGATATGGGG CGGCCACGTC AATCTGTCGC CAGCCGACGA CATCCTGATC TCGGTCGAAC GGCCGTTGAG CCTCGACACG CCGGAGCAGA TGGTCGCTTC GATCATGTCG AAGAAGCTCG CGGCAGGCTC GACGCGGCTC CTGATCGATT TTCCGGTCGG GCCGAGCGCC AAGATCGCGA GCGCTTCCGA AGCGATGCGG TTGCGCAAAC TGTTCGAATT CGTCGGCGAT CACTTCGGCA TCGCGGTCGA GGTGGTAACC ACCGATGGCC GACAGCCGAT CGGCCGTGGC ATAGGCCCGG TGCTGGAAGC GCGCGACGTG ATGGCCGTGC TCGGCAACGA GCCGAACGCG CCGGCAGATC TGCGCGAAAA ATCGCTGCGG CTCGCCGCGC ATCTGCTCGA ATACGACCCT CTGCTGCGGG GCGGCGCCGG TTACGCACGG GCGAAGGAGC TGCTCGACAG CGGCGCGGCG CTGAAGAAGA TGCAGCAGAT CATCGACGCT CAGGGGCCGT CCGTGTGCAA CTCCGAACTC GGCAGCCATG CCGCCGACGT GCTCGCGCCG GCCGACGGCG TGGTCAACGG CATCGACTGC CTGCGCATCA ACCGCCTCGC CCGCACCGCG GGGGCGCCGA TCGTGAAGGG CGCGGGCATC GATCTGTTCA AGAAGATCGG CGACCGCGTC GATCAGGGAG AACCGATCTA TCGCATCTAC GCCTCCGACC GCTCCGAGCT GGACCTTGCG ATCGCGGCCG CGGAAGCGGA GTCCGGTTTC TCGGTCAACC ATCACTCTCC CGCCGCCGTG GAACCGGTGT CGTGA
|
Protein sequence | MNASHLSRPP LTIRRISLDT GRENVAVISR RSRALRADVF RGFSRVALRI NTKVLLATLM ITDDDAMIGP DELGLSEPAF RRFNEPVGSA VSVSPAQPPA SLDAVRAKIQ GHTLTPAEIT AVIDDVAHFR YSDMEIAAFL ISAARFTSTD ELLALVDAMA SVGTRLKWQN PIVVDKHCIG GIPGNRTTMI VVPIVAAQGL MIPKTSSRAI TSPAGTADTM EVLARVDLDV EQMKRVVTTC GGCLIWGGHV NLSPADDILI SVERPLSLDT PEQMVASIMS KKLAAGSTRL LIDFPVGPSA KIASASEAMR LRKLFEFVGD HFGIAVEVVT TDGRQPIGRG IGPVLEARDV MAVLGNEPNA PADLREKSLR LAAHLLEYDP LLRGGAGYAR AKELLDSGAA LKKMQQIIDA QGPSVCNSEL GSHAADVLAP ADGVVNGIDC LRINRLARTA GAPIVKGAGI DLFKKIGDRV DQGEPIYRIY ASDRSELDLA IAAAEAESGF SVNHHSPAAV EPVS
|
| |