Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02171 |
Symbol | pyrD |
ID | 4716901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 202905 |
End bp | 204074 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640077916 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_001008612 |
Protein GI | 123967754 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAC AGAAGGGTGG ATTTAAAAAT CTTTATAAAA ACTTGATTAC CCCTGTATTA CAAAAAGACT CTGGAATTGA TGCAGAATAC TTAACAAATT TATCTCTTAG TCTCCTATCA TTCAGTTCAA GAAAATATAA TTGGCCTATA GTATCTTCTA TCTTAAAAAA TCTAAATGAA GAATTTTCTG TAGTTGATAA AAGGTTAACT CAGAAGATAT GTGGAATAAA TTTTTGTAAT CCAATTGGTT TAGCTGCGGG TTTTGACAAA AATGGAAATG CCGCAAATAT ATGGAAAGAT TTTGGTTTTG GATTTGCTGA GCTTGGAACA GTAACTAAAT TTGCTCAGGA TGGAAATCCC AAACCAAGGT TATTTAGATT GGCAGAAGAA GAAGCAGCAT TAAATAGAAT GGGTTTCAAT AATAATGGTG CTGAAAATCT AGTTAAAAAC TTTGTCGAAC AAGGTATTGA GTTTAAAAAA AACAGGGAGA ATATTTGTTT AGGGATAAAT TTCGGGAAGT CAAAAATTAC AGGTTTATCT CAAGCAAAAG ATGATTATTT AACTTCTCTA AAATTATTAA TTCCATATTG TGATTACGCA GCAATAAACG TTAGTTCTCC AAATACTGAA GGACTAAGAA AATTACAAGA TCCAATTCTT CTAAAAGAAC TTCTTAGAGA AATTAAAAAC TTACCTAATT GTCCACCATT ATTTGTAAAA ATTGCACCAG ATTTAGGCCT TAAAGATATT GAAGATATTT GCAAATTAAT AATCGAGGAG AACATCGATG GAATAATTGC TACAAATACC AGCATTGATA GATTAGGTCT TGAAAATAGA AAAATCAAGC AAACTGGATT ATTACTCTCT CAAGAGAATG GAGGATTAAG TGGAAAGCCG CTCCAAAAAA AAGCAAATCA AATAATAAAA CATATACATA ATATTGATAA AAAGATTATT TTAATTGGGG TTGGTGGAAT AGATAGTCCT GAGTCAGCTT GGGAAAGAAT TTGTTCTGGA GCATCATTAA TTCAACTTTA TACAGGATGG ATATATAAGG GTCCACAATT AGTACCAGAT ATACTTGAGG GAATTATAAA GCAACTCAAT AACCATCAAT TATCTAGTAT AAAAGATGCA ATTGGATCAG ATTTAAAATG GATTGAATAA
|
Protein sequence | MNEQKGGFKN LYKNLITPVL QKDSGIDAEY LTNLSLSLLS FSSRKYNWPI VSSILKNLNE EFSVVDKRLT QKICGINFCN PIGLAAGFDK NGNAANIWKD FGFGFAELGT VTKFAQDGNP KPRLFRLAEE EAALNRMGFN NNGAENLVKN FVEQGIEFKK NRENICLGIN FGKSKITGLS QAKDDYLTSL KLLIPYCDYA AINVSSPNTE GLRKLQDPIL LKELLREIKN LPNCPPLFVK IAPDLGLKDI EDICKLIIEE NIDGIIATNT SIDRLGLENR KIKQTGLLLS QENGGLSGKP LQKKANQIIK HIHNIDKKII LIGVGGIDSP ESAWERICSG ASLIQLYTGW IYKGPQLVPD ILEGIIKQLN NHQLSSIKDA IGSDLKWIE
|
| |