Gene RPC_4330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4330 
Symbol 
ID3971518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4827406 
End bp4828740 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content66% 
IMG OID637927439 
Productdihydroorotase 
Protein accessionYP_534172 
Protein GI90425802 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.436145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC GTTTCGATAC GATTCTGAAA TCCGGCACCA TCGTCAATCA AGACGGCGAG 
GGCACCGGCG ATATCGGCAT CACCGCCGGC AAGATCGCAG CGCTCGGCGA CCTCGGCCAG
GCCTCGGCCG ACGAGCTGAT CGATTGCCGC GGCCTGCACG TGCTGCCCGG GGTGATCGAC
ACCCAAGTGC ATTTCCGCGA GCCGGGATTG ACCCACAAGG AAGACCTGGA ATCCGGCTCG
CTGAGCGCGG TGATGGGCGG CGTCACCGCG GTGTTCGAAA TGCCCAACAC CAATCCGTTG
ACCGTCACCG CAGAGGCCTT TGCCGACAAG GTGAAGCGCG GTGAGCATCG CATGCATTGC
GATTTCGCGT TCTACATCGG CGGCACCCGC GACAACGTCG CCGAGCTGCC GGACTTGGAG
CGCGCCCGCG GCTGCGCCGG CGTCAAGGTG TTCATCGGCT CCTCCACCGG CAGCCTGCTG
GTCGAGGACG ACGACAGCAT CCGCAAAATC CTGCAGGTGA TCCAGCGCCG CGCCGCGTTT
CACGCCGAGG ACGAATATCG CCTCAACGAT CGCAAGGCGC TGCGCATCGA GGGCGATCCG
CGCTCGCATC CGGTGTGGCG CGACGAGATC GCGGCATTGA CCGCAACGCA GCGCCTCGTG
GCACTGGCGC GCGAGACCGG CAAGCGGATC CACGTGCTGC ACGTCTCGAC CAAACAGGAG
ATCGAATTTT TGCGCGAGCA CAAGGATGTG GCTTCAGTTG AAGTGACGCC GCATCACCTG
ACGCTGGCCG CGCCGGACTG CTACGAGCGG CTCGGCACGC TGGCGCAGAT GAATCCGCCG
GTGCGCGACG CGGCGCATCG CGACGGCATT TGGCATGGCG TGGCGCAGGG CATCGTCGAC
GTGCTCGGCT CCGACCACGC ACCGCACACG CTGGAAGAAA AATCCAAGAC CTATCCGGCG
TCGCCGTCCG GCATGACCGG GGTGCAGACG CTGGTGCCGT TGATGCTGGA TCACGTCAAC
GCCGGCAAAT TATCGCTGGC GCGATTCGTG GACCTCAGCA GCGCCGGCCC GGCGCGGCTG
TTCAACATCG CCTGCAAGGG CCGCATCGCC GCGGGCTATG ACGCCGATTT CACGATCGTC
GATCTGCAGC GCAGCGAGAC CATCAGCAAC GCCTGGACCG CATCGCGCGC CGGCTGGACG
CCCTATGATG GGGTGACGGT GAAAGGCTGG CCGGTCGGCA CTTTCGTGCG CGGCGCCAAG
GTGATGTGGC AGGGCGAATT GCTGACGCCG TCGACCGGCG AGCCGGTGCG GTTTCTGGAA
ACGCTGGGGG CATAG
 
Protein sequence
MTQRFDTILK SGTIVNQDGE GTGDIGITAG KIAALGDLGQ ASADELIDCR GLHVLPGVID 
TQVHFREPGL THKEDLESGS LSAVMGGVTA VFEMPNTNPL TVTAEAFADK VKRGEHRMHC
DFAFYIGGTR DNVAELPDLE RARGCAGVKV FIGSSTGSLL VEDDDSIRKI LQVIQRRAAF
HAEDEYRLND RKALRIEGDP RSHPVWRDEI AALTATQRLV ALARETGKRI HVLHVSTKQE
IEFLREHKDV ASVEVTPHHL TLAAPDCYER LGTLAQMNPP VRDAAHRDGI WHGVAQGIVD
VLGSDHAPHT LEEKSKTYPA SPSGMTGVQT LVPLMLDHVN AGKLSLARFV DLSSAGPARL
FNIACKGRIA AGYDADFTIV DLQRSETISN AWTASRAGWT PYDGVTVKGW PVGTFVRGAK
VMWQGELLTP STGEPVRFLE TLGA