Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18131 |
Symbol | |
ID | 4775934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1578600 |
End bp | 1579664 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640087321 |
Product | dihydroorotase |
Protein accession | YP_001017820 |
Protein GI | 124023513 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0418] Dihydroorotase |
TIGRFAM ID | [TIGR00856] dihydroorotase, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.085007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATTCTT TGCCCGACCA ACTCATTTTG CGTCAACCGG ATGACTGGCA TGTCCATCTG AGGGATGGTG CCATGCTGCA TGCGGTTCTC GGCAGCACAG CTCGGGTGTT TCGACGAGCA ATCGTGATGC CCAATCTCAG GCCACCGATC ACGAGTGTAG AAGCAGCAAA AACCTATCGC GACCAGATTT TGGCAGCCCT TCCTGATGGT GTTCCGTTTA CGCCATTGAT GACGGCCTAC CTCAACGAGA GTTTGGCTCC AGATGTTTTA GAGCAAGGGC ATCAACAGCA TGTTTTCATC GCCGCCAAGC TTTATCCAGC CCATGCCACA ACTAATTCCG AGCAGGGAGT CAGTGATCTA AGAGCGATTA ACTCACTTCT AGAGACGATG GAGAAAATTG GGATGCCATT GCTCGTTCAT GGTGAGGTGA GCGATGTCGA TATCGACATT TTCGATAGAG AAGCCTTTTT TATTGAGCAC CACCTGGCAC CACTAATAGT GCGTTATCCA AATCTTCGTG TAGTGATGGA GCACATCACG ACGCAAGAGG CAGTTCAGTT CGTGGAAACA GGTGGACCGA ATTTGGCAGC CACTATCACT CCTCATCATT TACATATCAA CCGCAATGCA ATGTTCCTCG GTGGTTTTCG TAGTGATTTC TACTGCTTAC CAGTAGCCAA GCGTGAACGC CATCGACTCG CCCTGCGTCG AGCAGCGACA AGTGGCAAAC CATGCTTTTT CCTTGGCACT GATTCGGCAC CACACCCTCG CTCTGCAAAG GAGAGTGCTT GTAGTTGTGG TGGCATTTTC AATGCTCATT ATGCGATGGA AAGTTATGCC GAAGTCTTTG AGCAGGAAGG AGCACTTGAT CGTCTTGAGG CCTTCTCCAG TGAATACGGT CCAGCTTTCT ATGGCTTGCC CCTCAATAAC ACCTCAATCA AACTGATACG ACGAGCTCAT GTTGTACCAG CTACTTTTAG CGGACAAACG AACGCAGATT CTTCCGAACA CTTAGTTCCA TTTCATGCTG GCGAACTGTT GGGCTGGTCT GTTTCGGTTG ATTGA
|
Protein sequence | MDSLPDQLIL RQPDDWHVHL RDGAMLHAVL GSTARVFRRA IVMPNLRPPI TSVEAAKTYR DQILAALPDG VPFTPLMTAY LNESLAPDVL EQGHQQHVFI AAKLYPAHAT TNSEQGVSDL RAINSLLETM EKIGMPLLVH GEVSDVDIDI FDREAFFIEH HLAPLIVRYP NLRVVMEHIT TQEAVQFVET GGPNLAATIT PHHLHINRNA MFLGGFRSDF YCLPVAKRER HRLALRRAAT SGKPCFFLGT DSAPHPRSAK ESACSCGGIF NAHYAMESYA EVFEQEGALD RLEAFSSEYG PAFYGLPLNN TSIKLIRRAH VVPATFSGQT NADSSEHLVP FHAGELLGWS VSVD
|
| |