Gene RPD_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1239 
Symbol 
ID4021715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1403417 
End bp1404751 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID637961431 
Productdihydroorotase 
Protein accessionYP_568378 
Protein GI91975719 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.851425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA CATTCGACAC CGTTCTGAAG GGCGCCACCA TCGTCAACCA CGATGGCGAG 
GGCGTCGGCG ACATCGGCAT CACCGGCGGC CGGATTGCGG CGCTGGGCTC GCTCGGCGCC
GCGTCGGCGG GAGAGACGAT CGACTGCCGT GGGCTGCACG TGCTGCCCGG CGTGATCGAT
ACCCAGGTGC ATTTCCGCGA GCCGGGTCTG ACCCACAAGG AAGATCTCGA AACCGGCTCG
CGCAGCGCGG TGCTGGGCGG CGTCACCGCG GTGTTCGAAA TGCCGAATAC CAATCCGCTC
ACGGTGACGG CGGAGACTTT CGCCGACAAG GTGAAGCGCG CCGAACATCG GATGCATTGC
GACTTCGCCT TCTTCATCGG CGGCACCCGC GACAATGTCG ATGAGCTGCC GCAGCTCGAG
CGCGCCCGCG GCTGCGCCGG CGTGAAGGTG TTCATCGGCT CGTCGACCGG CAGCCTGCTG
GTCGAGGACG ATCCGAGCCT GCGGCGGATC CTCAGCGTGA TCCAGCGTCG CGCCGCGTTT
CATGCCGAGG ACGAATATCG CCTCAACGAC CGCAAGGGCG AACGGATCGA GGGCGATCCG
CGCTCGCATC CGGTGTGGCG CGACGACATC GCCGCCTTGA CCGCGACGCA GCGGCTGGTG
GCGATCGCGC GCGAGACCGG CAAGCGCATC CACGTGCTGC ACGTCTCGAC CCGGCAGGAG
ATGGAGTTTT TGCGTGACCA CAAGGACGTC GCTTCGGTCG AAGTGACGCC GCACCACCTC
ACGCTTGTGG CGCCGGATTG TTACGAGCGG CTCGGCACCC GCGCGCAGAT GAACCCGCCT
GTGCGCGACG CTTGGCATCG CGACGGCCTG TGGCACGGGC TGGCGCAGGG CGTCGTCGAC
GTGCTCGGCT CCGATCATGC GCCGCACACG ATCGAGGAGA AGGCCAAGAC CTACCCGGCC
TCGCCCTCCG GCATGACCGG CGTGCAGACG CTGGTGCCGA CCATGCTGGA TCACGTCAAT
GCCGGGAAAT TGTCGCTGGC CCGCTTCGTC GATCTCACCA GCGCCGGGCC GGCGCGGCTG
TTCAACATCG CCTGCAAGGG CCGGATCGCC GCCGGCTACG ACGCCGATTT CACCGTGGTC
GACCTCAAGC GCAGCGAGAC CATCACCAAC GAGCAGGTCG CCTCGCGGGC CGGCTGGACG
CCTTATGACG GCGTCCGCGT CACCGGCTGG CCGGTCGGCA CTTTCGTGCG CGGCGCCAAG
GTGATGTGGC AGGGCGAATT GACGACGCCA TCGACCGGCG AGCCGGTGCG CTTCCTCGAG
ACGCTGAAGT CCTGA
 
Protein sequence
MTATFDTVLK GATIVNHDGE GVGDIGITGG RIAALGSLGA ASAGETIDCR GLHVLPGVID 
TQVHFREPGL THKEDLETGS RSAVLGGVTA VFEMPNTNPL TVTAETFADK VKRAEHRMHC
DFAFFIGGTR DNVDELPQLE RARGCAGVKV FIGSSTGSLL VEDDPSLRRI LSVIQRRAAF
HAEDEYRLND RKGERIEGDP RSHPVWRDDI AALTATQRLV AIARETGKRI HVLHVSTRQE
MEFLRDHKDV ASVEVTPHHL TLVAPDCYER LGTRAQMNPP VRDAWHRDGL WHGLAQGVVD
VLGSDHAPHT IEEKAKTYPA SPSGMTGVQT LVPTMLDHVN AGKLSLARFV DLTSAGPARL
FNIACKGRIA AGYDADFTVV DLKRSETITN EQVASRAGWT PYDGVRVTGW PVGTFVRGAK
VMWQGELTTP STGEPVRFLE TLKS