Gene RPB_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1118 
Symbol 
ID3910204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1287015 
End bp1288349 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID637883011 
Productdihydroorotase 
Protein accessionYP_484739 
Protein GI86748243 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.206041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA CATTCGATAC CATTCTGAAA GGCGCCACGA TCGTCAACCA CGACGGCGAG 
GGCATCGGCG ACATCGGAAT CAGCGGCGGC CGGATCGCGG CGCTGGGCTC GCTCGGCGCC
GAAAAGGCCG GCGAGACGAT CGATTGCGGC GGCCTGCATG TGCTGCCGGG CGTGATCGAC
ACCCAGGTGC ATTTCCGCGA ACCGGGCCTC ACCCACAAGG AAGACCTCGA AACCGGCTCG
CGCAGCGCGG TGATGGGCGG CGTCACCGCG GTGTTCGAAA TGCCGAACAC CAACCCGCTG
ACCGTCACCG AAGAGACCTT CACCGACAAG GTGAAACGCG CCGAGCACCG GATGCATTGC
GACTTCGCGT TCTTCATCGG CGGCACCCGC GACAATGTCG ATGAGCTGCC GAAGCTCGAG
CGCGCCCGCG GCTGCGCCGG CGTCAAGGTG TTCATCGGCT CCTCGACCGG CAGCCTCTTG
GTCGAGGACG ATCCGAGCCT GCGCCGCATC CTCAACGTGA TCCAGCGCCG CGCCGCGTTC
CACGCCGAGG ACGAGTACCG GCTCGAGGAC CGCAAGGGCG AGCGCATCGA GGGCGATCCG
CGCTCGCATC CGGTGTGGCG CGACGACATC GCCGCATTGA CGGCGACGCA GCGGCTGGTG
GCGATCGCGC GTGAGACCGG CAAGCGCATC CACGTGCTGC ACGTCTCGAC CCGTCAGGAG
ATGGAGTTTC TGCGCGAGCA CAAGGACGTC GCGTCGGTCG AGGTGACGCC GCATCATCTG
ACGCTGGTCG GACCGGAGTG CTACGAGCGG CTCGGCACCA AGGCGCAGAT GAATCCGCCG
GTGCGCGATG CGTGGCATCG CGACGGCATC TGGCACGGCC TGGCGCAGGG CGTCGTCGAC
GTGCTCGGCT CGGATCATGC GCCGCACACG CTGGAGGAAA AGGCCAAGAC CTATCCGGCC
TCGCCGTCCG GCATGACCGG GGTGCAGACG CTGGTGCCGA CCATGCTCGA CCACGTCAAC
GCCGGAAAAT TGTCGCTGGC GCGCTTCGTC GATCTCACCA GCGCGGGGCC GGCGCGGCTG
TTTGGCATCG CCTGCAAGGG CCGCATCGCC GCCGGCTACG ACGCCGACTT CACGGTGGTC
GATCTGAAGC GCAGCGAGAC CATCACCAAC GACTGGGTCG CCTCGCGCGC CGGCTGGACG
CCCTATGACG GCGTCCGCGT CACCGGCTGG CCGGTCGGCA CCTTCGTGCG CGGCGCCAGG
GTGATGTGGC AGGGCGAACT CGCAACGCCC GCGACCGGCG AGCCGGTGCG GTTTCTGGAG
ACGCTGAAGC CGTAA
 
Protein sequence
MTATFDTILK GATIVNHDGE GIGDIGISGG RIAALGSLGA EKAGETIDCG GLHVLPGVID 
TQVHFREPGL THKEDLETGS RSAVMGGVTA VFEMPNTNPL TVTEETFTDK VKRAEHRMHC
DFAFFIGGTR DNVDELPKLE RARGCAGVKV FIGSSTGSLL VEDDPSLRRI LNVIQRRAAF
HAEDEYRLED RKGERIEGDP RSHPVWRDDI AALTATQRLV AIARETGKRI HVLHVSTRQE
MEFLREHKDV ASVEVTPHHL TLVGPECYER LGTKAQMNPP VRDAWHRDGI WHGLAQGVVD
VLGSDHAPHT LEEKAKTYPA SPSGMTGVQT LVPTMLDHVN AGKLSLARFV DLTSAGPARL
FGIACKGRIA AGYDADFTVV DLKRSETITN DWVASRAGWT PYDGVRVTGW PVGTFVRGAR
VMWQGELATP ATGEPVRFLE TLKP