Gene Rpal_4625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4625 
Symbol 
ID6412311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4985098 
End bp4986642 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content67% 
IMG OID642714504 
Productthymidine phosphorylase 
Protein accessionYP_001993591 
Protein GI192292986 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATGCTC CCGATCTCAC CCGCCCGCCG CTGACGATCC GGCGGATCAG CCTCGACACC 
GGCCGCGAAA ACGTCGCGGT GATTTCCCGC CGCTCGCGCG CCCTGCGCCC CGAGGTGTTT
CGCGGCTTCA GCCGCGTCGA ACTGCGCATC AACTCCAAGG TGCTGCTGGC GACGCTGATG
ATCACCGACG ACGACGCGAT GATCGGCCCC GACGAAGTCG GCCTGTCCGA GCCGGCGTTC
CGCCGCTTCA ATGAGCCGGT CGGCAGTGCC GTGTCGGTGA CGCCGGCGCG GTCGCCCGCA
AGCCTCGACG CGGTACGCGC CAAAATCCAG GGCCACACGC TATCCGCGGC GGAGATCACC
GCGATCGTCG ATGACCTCGC GCACTTCCGT TACTCCGACA TGGAAATCGC CGCGTTCCTG
ATCAGCGCCG CGCGCTTCAC CACGACCGAC GAACTGCTGG CGCTGGTCGG CGCGATGGCC
TCGGTCGGGA CCAAACTGAC ATGGGACACG CCGATCGTCG TCGACAAGCA CTGCATCGGC
GGCATTCCCG GCAACCGCAC CACCATGATC GTGGTGCCGA TCGTCGCCGC GCACGGGCTG
ATGATTCCGA AGACCTCGTC GCGCGCCATC ACCTCGCCGG CCGGCACCGC CGACACCATG
GAACTGCTGG CGCGGGTCGA TCTCGACGTC GAGCAGATGA AGCAGGTGGT ACATGCCTGC
GGGGGCTGTT TGGTGTGGGG CGGCCACGTC AACCTGTCGC CTGCCGACGA CATCCTGATA
TCGGTCGAGC GACCGCTCAG CCTCGACACG CCGGAGCAAA TGGTCGCCTC GATCATGTCG
AAGAAGCTCG CCGCCGGCTC GACCCGGCTG CTGATCGACT TCCCGGTCGG CCCGTCCGCC
AAGGTCACGA GCGCCAACGA GGCGATGCGG CTGCGCAAGC TGTTCGAGTT CGTCGGCGAT
CATTTCGGGA TCAGCGTCGA AGTGGTGACC ACCGACGGCC GACAGCCGAT CGGCCGCGGC
ATCGGCCCGG TGCTGGAAGC CCGCGACGTG ATGGCGGTTC TGGGCAACAA GCCCGGCGCA
CCCGCTGACT TGCGCGAAAA ATCGCTGCGG CTCGCTGCGC ATCTGCTTGA ATACGACCCG
AAGCTGCGCG GCGGCACCGG CTATGCCCGC GCCAAGGAGC TGCTCGACAG CGGGGCTGCG
CTGAAGAAGA TGCAGCAGAT CATCGACGCT CAGGGGCCTT CGCCGTGTCC GGCCGAGCTC
GGCAGCTACG CCGCCGATGT ACTCGCTGCG GCCGATGGCG TGGTCAACGG CATCGACTGT
CTGCGCATCA ACCGCCTCGC CCGCAGCGCC GGCGCGCCGG TCGCCAAGGG CGCCGGGATC
GATCTGTTCA AGAAGATCGG CGACCGCGTC GAAAAGGGCG AGCCGCTGTA TCGGGTTTAT
GCGTCCGACC GCTCCGAATT CGATCTGGCG CTGGCAGCCG CACAGGCGGA GTCCGGCTTC
GCGATCAATC ATCACACGCC CGCCGACGTG GATCTGGTGT CGTGA
 
Protein sequence
MNAPDLTRPP LTIRRISLDT GRENVAVISR RSRALRPEVF RGFSRVELRI NSKVLLATLM 
ITDDDAMIGP DEVGLSEPAF RRFNEPVGSA VSVTPARSPA SLDAVRAKIQ GHTLSAAEIT
AIVDDLAHFR YSDMEIAAFL ISAARFTTTD ELLALVGAMA SVGTKLTWDT PIVVDKHCIG
GIPGNRTTMI VVPIVAAHGL MIPKTSSRAI TSPAGTADTM ELLARVDLDV EQMKQVVHAC
GGCLVWGGHV NLSPADDILI SVERPLSLDT PEQMVASIMS KKLAAGSTRL LIDFPVGPSA
KVTSANEAMR LRKLFEFVGD HFGISVEVVT TDGRQPIGRG IGPVLEARDV MAVLGNKPGA
PADLREKSLR LAAHLLEYDP KLRGGTGYAR AKELLDSGAA LKKMQQIIDA QGPSPCPAEL
GSYAADVLAA ADGVVNGIDC LRINRLARSA GAPVAKGAGI DLFKKIGDRV EKGEPLYRVY
ASDRSEFDLA LAAAQAESGF AINHHTPADV DLVS