Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4625 |
Symbol | |
ID | 6412311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4985098 |
End bp | 4986642 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714504 |
Product | thymidine phosphorylase |
Protein accession | YP_001993591 |
Protein GI | 192292986 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATGCTC CCGATCTCAC CCGCCCGCCG CTGACGATCC GGCGGATCAG CCTCGACACC GGCCGCGAAA ACGTCGCGGT GATTTCCCGC CGCTCGCGCG CCCTGCGCCC CGAGGTGTTT CGCGGCTTCA GCCGCGTCGA ACTGCGCATC AACTCCAAGG TGCTGCTGGC GACGCTGATG ATCACCGACG ACGACGCGAT GATCGGCCCC GACGAAGTCG GCCTGTCCGA GCCGGCGTTC CGCCGCTTCA ATGAGCCGGT CGGCAGTGCC GTGTCGGTGA CGCCGGCGCG GTCGCCCGCA AGCCTCGACG CGGTACGCGC CAAAATCCAG GGCCACACGC TATCCGCGGC GGAGATCACC GCGATCGTCG ATGACCTCGC GCACTTCCGT TACTCCGACA TGGAAATCGC CGCGTTCCTG ATCAGCGCCG CGCGCTTCAC CACGACCGAC GAACTGCTGG CGCTGGTCGG CGCGATGGCC TCGGTCGGGA CCAAACTGAC ATGGGACACG CCGATCGTCG TCGACAAGCA CTGCATCGGC GGCATTCCCG GCAACCGCAC CACCATGATC GTGGTGCCGA TCGTCGCCGC GCACGGGCTG ATGATTCCGA AGACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGACACCATG GAACTGCTGG CGCGGGTCGA TCTCGACGTC GAGCAGATGA AGCAGGTGGT ACATGCCTGC GGGGGCTGTT TGGTGTGGGG CGGCCACGTC AACCTGTCGC CTGCCGACGA CATCCTGATA TCGGTCGAGC GACCGCTCAG CCTCGACACG CCGGAGCAAA TGGTCGCCTC GATCATGTCG AAGAAGCTCG CCGCCGGCTC GACCCGGCTG CTGATCGACT TCCCGGTCGG CCCGTCCGCC AAGGTCACGA GCGCCAACGA GGCGATGCGG CTGCGCAAGC TGTTCGAGTT CGTCGGCGAT CATTTCGGGA TCAGCGTCGA AGTGGTGACC ACCGACGGCC GACAGCCGAT CGGCCGCGGC ATCGGCCCGG TGCTGGAAGC CCGCGACGTG ATGGCGGTTC TGGGCAACAA GCCCGGCGCA CCCGCTGACT TGCGCGAAAA ATCGCTGCGG CTCGCTGCGC ATCTGCTTGA ATACGACCCG AAGCTGCGCG GCGGCACCGG CTATGCCCGC GCCAAGGAGC TGCTCGACAG CGGGGCTGCG CTGAAGAAGA TGCAGCAGAT CATCGACGCT CAGGGGCCTT CGCCGTGTCC GGCCGAGCTC GGCAGCTACG CCGCCGATGT ACTCGCTGCG GCCGATGGCG TGGTCAACGG CATCGACTGT CTGCGCATCA ACCGCCTCGC CCGCAGCGCC GGCGCGCCGG TCGCCAAGGG CGCCGGGATC GATCTGTTCA AGAAGATCGG CGACCGCGTC GAAAAGGGCG AGCCGCTGTA TCGGGTTTAT GCGTCCGACC GCTCCGAATT CGATCTGGCG CTGGCAGCCG CACAGGCGGA GTCCGGCTTC GCGATCAATC ATCACACGCC CGCCGACGTG GATCTGGTGT CGTGA
|
Protein sequence | MNAPDLTRPP LTIRRISLDT GRENVAVISR RSRALRPEVF RGFSRVELRI NSKVLLATLM ITDDDAMIGP DEVGLSEPAF RRFNEPVGSA VSVTPARSPA SLDAVRAKIQ GHTLSAAEIT AIVDDLAHFR YSDMEIAAFL ISAARFTTTD ELLALVGAMA SVGTKLTWDT PIVVDKHCIG GIPGNRTTMI VVPIVAAHGL MIPKTSSRAI TSPAGTADTM ELLARVDLDV EQMKQVVHAC GGCLVWGGHV NLSPADDILI SVERPLSLDT PEQMVASIMS KKLAAGSTRL LIDFPVGPSA KVTSANEAMR LRKLFEFVGD HFGISVEVVT TDGRQPIGRG IGPVLEARDV MAVLGNKPGA PADLREKSLR LAAHLLEYDP KLRGGTGYAR AKELLDSGAA LKKMQQIIDA QGPSPCPAEL GSYAADVLAA ADGVVNGIDC LRINRLARSA GAPVAKGAGI DLFKKIGDRV EKGEPLYRVY ASDRSEFDLA LAAAQAESGF AINHHTPADV DLVS
|
| |