Gene P9301_02191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02191 
SymbolpyrD 
ID4911905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp203894 
End bp205063 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content31% 
IMG OID640159785 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001090443 
Protein GI126695557 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAC AGCAGGGGGT ATTTAAAAAT CTTTATAAAA ACTTGATGAC ACCTATACTA 
AAAAATGATT CTGGAATAGA TGCAGAATAT TTAACTAATT TATCTCTTAG CCTTCTATCA
TTTAGTTCAA GAAAATATAA TTGGCCTTTA GTTTCATCAA TCCTAAAAAA TTTAAATGAA
GAATTTTCTG TAATTGATAA AAGGTTAAGT CAAAACATAT GTGGAATAAA TTTTTGTAAT
CCAATTGGTT TAGCTGCAGG TTTTGACAAA AATGGGAATG CTGCAAATAT ATGGAAAGAT
TTTGGTTTTG GATTTGCAGA ACTTGGAACA GTAACTAAAT TTGCTCAAAA TGGCAATCCA
AAACCAAGGT TATTTAGATT AGCTGAAGAA GAAGCAGCAT TAAATAGGAT GGGTTTCAAT
AATAATGGTG CTGAAAATCT GGTTAAAAAT TTTCTTGAGC AGGGTATCGA GTTCAAAAAA
AATAGGGATA ATATTTGTTT AGGGATAAAT TTCGGTAAGT CTAAAATCAC AAGCTTATCT
CAAGCAAAAG ATGATTATTT AACTTCTCTA GAATTATTAA TTCCATATTG TGATTACGCA
GCAATAAACG TAAGTTCTCC AAATACTGAA GGACTAAGAA AGTTGCAAGA TCCAATACTT
CTAAAAGACC TTCTTAGAGC AATTAAAAAC TTACCTAATT GTCCACCATT ATTTGTAAAA
ATTGCGCCAG ATTTAAGCCT TAAAGATATT GAAGATATTT GCAAGTTAAT AATCGAGGAA
AATATAGATG GGATAATTGC TACTAACACC AGCATAGATA GATTAGGTCT TGAAAATAGG
AAGATAAGGC AAACAGGATT ATTACTTTCT GAAGAAAATG GAGGCTTAAG TGGAAAACCT
TTACAAAAAA AAGCAAATCA AGTCATAAAA GATATTCGTA ATATTGATAA AAATATTATT
TTAATTGGCG TTGGTGGAAT CGATAGTCCT GAGTCGGCTT GGGAAAGAAT TTGTTCTGGA
GCATCATTAA TTCAACTTTA TACGGGATGG ATATATAAGG GGCCACAATT AGTCCCCAAT
ATACTTGAAG GAATTTTAAA GCAACTCAAT ATCCATCAAT TGTCCAATAT TAAAGAGGCC
ATTGGATCAG ATTTAAAATG GGTTAAATAA
 
Protein sequence
MNEQQGVFKN LYKNLMTPIL KNDSGIDAEY LTNLSLSLLS FSSRKYNWPL VSSILKNLNE 
EFSVIDKRLS QNICGINFCN PIGLAAGFDK NGNAANIWKD FGFGFAELGT VTKFAQNGNP
KPRLFRLAEE EAALNRMGFN NNGAENLVKN FLEQGIEFKK NRDNICLGIN FGKSKITSLS
QAKDDYLTSL ELLIPYCDYA AINVSSPNTE GLRKLQDPIL LKDLLRAIKN LPNCPPLFVK
IAPDLSLKDI EDICKLIIEE NIDGIIATNT SIDRLGLENR KIRQTGLLLS EENGGLSGKP
LQKKANQVIK DIRNIDKNII LIGVGGIDSP ESAWERICSG ASLIQLYTGW IYKGPQLVPN
ILEGILKQLN IHQLSNIKEA IGSDLKWVK