Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_27841 |
Symbol | pyrD |
ID | 4777460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2449498 |
End bp | 2450676 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640088307 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_001018779 |
Protein GI | 124024472 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGC CATCACCTGC AAAGGTGTAC TCCACCGGTG GGTTGTATCG CCGTTGGCTG GGACCAATAC TCGCGAATGA TCAGGGCTTA GATCCGGAAC AGTTGACTCA AGCAGCATTG AGTGCCCTTA GCCAGACGTC CCTACGTAGA GATTGGCCAG GCGTATCAGC TGTCTTGGCA GCGATAGCGT TGGATTTGCA ACGCCATGAT CTGCGGCTTG AGCAAGTGTT GTTCGGCTGC CGGTTTCGTA ATCCTGTTGG ACTGGCAGCT GGCTTTGACA AAAACGGAGT GGCGGCCAGT ATCTGGGATC GCTTTGGATT TGGATTCGCC GAACTCGGCA CTGTGACTTG GCATGGGCAG ACCGGTAACC CTCGGCCCCG ACTCTTTCGC CTTGCTGCTG AGCAGGCCGC TTTGAACCGG ATGGGTTTCA ACAACAACGG TGCTGAGGTG ATGCGTCGCA CTTTGGAGAA ACAGGCTTTG CCTTCACCTG GCCAGCGTCC AGCGGTGCTG GGTCTCAACT TGGGCAAATC CAGGATCACG CCATTGGAGC AGGCCCCAGA CGACTACGCC TTATCGCTTG AGTTGCTGGC ACCATTGGCG GATTACGCCG TGATCAATGT CAGCTCGCCT AATACACCCG GCTTGCGTGA TCTCCAGGAT GCCAGCCAGT TAAGGCGTCT GGTGGAGCGA TTACGACGAT TGCAAGGATG CCCGCCACTA CTGGTGAAGA TCGCTCCGGA TCTCGAGGAT GATGCCATTG ATGGCCTGGC TCGCTTGGCT TATGAGGAGG GTTTAGCAGG TGTCATTGCT GTCAATACCA GCCTCGACCG TTTCGGCCTC GACGGGCGGG TCCTTCCCAA AACGGGTCGC AGCCTGGCGG AGGAAGCTGG TGGGCTCAGT GGAGCGCCTC TGCGTCAACG GGCCTTGGAG GTGCTGCGTC GGCTGCGGGC CACTGCAGGT CCGGCGTTGC CCTTGATCGG CGTGGGAGGC ATCGACTCGC CAGAGGCTGC ATGGGAGCGA ATCAGTGCCG GAGCTTCACT TGTGCAGCTT TACACCGGTT GGATCTTTAA GGGTCCAGAT TTGGTGCCGA ATATTTTGGA TGGGCTGATC GGCCAGCTTG ACCGCCATGG CTTCCGGCAT GTGTCTGAGG CTGTCGGGAG CGGCGTGCCC TGGCAGTAG
|
Protein sequence | MAEPSPAKVY STGGLYRRWL GPILANDQGL DPEQLTQAAL SALSQTSLRR DWPGVSAVLA AIALDLQRHD LRLEQVLFGC RFRNPVGLAA GFDKNGVAAS IWDRFGFGFA ELGTVTWHGQ TGNPRPRLFR LAAEQAALNR MGFNNNGAEV MRRTLEKQAL PSPGQRPAVL GLNLGKSRIT PLEQAPDDYA LSLELLAPLA DYAVINVSSP NTPGLRDLQD ASQLRRLVER LRRLQGCPPL LVKIAPDLED DAIDGLARLA YEEGLAGVIA VNTSLDRFGL DGRVLPKTGR SLAEEAGGLS GAPLRQRALE VLRRLRATAG PALPLIGVGG IDSPEAAWER ISAGASLVQL YTGWIFKGPD LVPNILDGLI GQLDRHGFRH VSEAVGSGVP WQ
|
| |