Gene RPD_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0904 
Symbol 
ID4021378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1017635 
End bp1018801 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content67% 
IMG OID637961094 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_568043 
Protein GI91975384 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID[TIGR02274] deoxycytidine triphosphate deaminase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGT GTATTTTTTC CGCGTCTGGC GGCCCTGCGG CCGCTGTGAG GATCGCCGCC 
GTGCCGTTTG CGTTGCCGCC CGATGCCGAT GGAATTCTGC CCGACCGCAT GATTGCGGCG
ATGGCCGAGG CCGGGCTGAT CCTGCCGGAA TACCCGTTCG TCGAAAGCCA GATCCAGCCG
GCCAGCCTGG ATTTGCGGCT CGGCGCGGTC GCCTACCGGG TGCGCGCGAG CTTCCTGCCG
GGGCCGGACT GCACCGTCGC CGAGCGGATC GACGAGCTGA AGCTGCACGA GATCGATCTG
TCCGACGGCG CGGTGCTGGA GACCAACTGC GTCTACATCG TGCCGCTGCT GGAAAGCCTG
GCGCTGCCGC GCAGCATCGT CGCCGCCGCC AATCCGAAAA GCTCGACCGG GCGACTTGAT
GTCTTCACCC GCGTGATCGC CGACGGCACC CGCCGCTTCG ACATGATCGG CGCCGGCTAT
CACGGCCCGC TTTATGCGGA GATCAGCCCG AAGACCTTCC CGGTGCTGCT GCGCGAGGGC
TCGCGGCTGT CGCAGGTCCG CTTCCGCACC GGCCACGCCA CGCTCGACGC CGACGAGTTG
GATGCGCTGC ACGACCTCGA ACGGCTGGTC GATGCCGACG ACGCCGATCT CAATGGCGGC
GTCGCGCTCA GCGTCGATCT GTCGGGCGAA AACTCGAACG GCTTTGTCGG CTACCGCGCC
AAGCGTCACA CCGGCGTGGT CGATGTCGAC CGCCGCGGCG GCTATGCGGT CGGCGAGTTC
TGGGAGCCGA TCGCGGCGCG GCCGGACGGC ACGCTGATCC TCGATCCCGG CGAGTTCTAC
ATCCTCGCCT CGAAGGAAGC CGTCCAGGTG CCGCCGGACT ACGCCGCCGA GATGGTGCCG
TTCGACCCGC TGGTCGGCGA ATTCCGCGTG CACTATGCGG GCTTCTTCGA TCCCGGCTTC
GGCTATGAGG GCGCCGGCGG GCTCGGCTCG CGCGCGGTGC TGGAAGTGCG CTCGCGCGAG
GTGCCGTTCA TTCTCGAACA CGGCCAGATC GTCGGCCGCC TGATCTACGA AAAAATGCTG
TCCCGCCCCG CCTCGCTCTA CGGCCAGCGC ATCGGCTCGA ACTATCAGGG CCAGAGCCTG
AAGCTGAGCA AGCATTTCAA GGCGTAG
 
Protein sequence
MKSCIFSASG GPAAAVRIAA VPFALPPDAD GILPDRMIAA MAEAGLILPE YPFVESQIQP 
ASLDLRLGAV AYRVRASFLP GPDCTVAERI DELKLHEIDL SDGAVLETNC VYIVPLLESL
ALPRSIVAAA NPKSSTGRLD VFTRVIADGT RRFDMIGAGY HGPLYAEISP KTFPVLLREG
SRLSQVRFRT GHATLDADEL DALHDLERLV DADDADLNGG VALSVDLSGE NSNGFVGYRA
KRHTGVVDVD RRGGYAVGEF WEPIAARPDG TLILDPGEFY ILASKEAVQV PPDYAAEMVP
FDPLVGEFRV HYAGFFDPGF GYEGAGGLGS RAVLEVRSRE VPFILEHGQI VGRLIYEKML
SRPASLYGQR IGSNYQGQSL KLSKHFKA