Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31592 |
Symbol | |
ID | 5001907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 194789 |
End bp | 195914 |
Gene Length | 1126 bp |
Protein Length | 325 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417328 |
Product | predicted protein |
Protein accession | XP_001417948 |
Protein GI | 145346959 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.784447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTGA GAGCGGCGGT GGACGCGCCC ATCGTGAAGA TTGGTACGCG AGGATCGCCG CTGGCGCTCG CGCAGGCGTA CATGACGCGC GATTTGTTGA AGGAGAACTT CCCGGAACTC GCCGAGGACG GCGCGTTGGA GATTTGCATC ATTAAGACGA CCGGGGATAA GGTTTTGGAC CAGCCGTTGG CGGATATCGG GGGTAAGGGT TTGTTCACTC GCGAGCTCGA CGACGCCTTG CTCGACGGGC GCATCGACAT CGCCGTGCAC TCGATGAAGG ATGTGCCGAC GTACTTGCCG GAAGGGATGG TGTTGCCGTG CATGTTGCCG CGTGAAGATG TCAGAGATGC GTTCTTGTGC TTGAAGTATG ACTCCTTGTC GCAATTGCCG GAAGGGGCGG TCGTCGGCAC GGCGTCTCTT CGCCGCCAGT CGCAGCTCTT GTACAAGTTT CCAACGCTCA AGTGCGTGAA CTTTAGAGGT AACGTGCAGT CGCGCATTCG CAAGCTCAAG GAGGAAGTTG TTGACTGCAC CTTGCTCGCT ATCGCGGGTT TGAAGCGCAT GGACCTGGCC CAACACGCCA AGGTCATCAT CCCCACCGAA GAAATGTTGC CCGCCGTCGC GCAAGGCGCC ATCGGTATCA CCTGCCGCGC GGGCGACGAC AAGCAGCTCG CGTTCTTGGC CAAGCTTAAC CACGAAGACA CGCGCATGGC TGTTGAAGGC GAGCGCTCTT TCTTGGCCGC TCTCGATGGC TCTTGCCGCA CCCCGATCGC CGCTCACTGC CACCTCGTCG ACGGTAAGAT GCAGTTCCGC GGTTTGATCG CCTCCCTCGA CGGCAAGCAA GTTCTCGAGA CCACCCGCGA AGGTGCCTGG GACGCCGCGT CGTTGTTGGA CGCCGGTAAG GACGCCGGCG CCGAGCTCAA GGGTAAGGCC CCGGCTGATT TCTTCGCCAA CTTGATCGAA AACGGCGGTG GCTGGTAATC GCTCCGTCCA TTTCTCGCCC GACGTTCGCT CCGAGCGCTC GCCGTCGCGA GAAATTTATT CGCTCTATCC ACCACATCAC TATCCTTTCG CGTGAGTTTA GTTATGATCG AATCTTTGTC AATTTCTTAT ATTCGCCATA TTACGC
|
Protein sequence | MVVRAAVDAP IVKIGTRGSP LALAQAYMTR DLLKENFPEL AEDGALEICI IKTTGDKVLD QPLADIGGKG LFTRELDDAL LDGRIDIAVH SMKDVPTYLP EGMVLPCMLP REDVRDAFLC LKYDSLSQLP EGAVVGTASL RRQSQLLYKF PTLKCVNFRG NVQSRIRKLK EEVVDCTLLA IAGLKRMDLA QHAKVIIPTE EMLPAVAQGA IGITCRAGDD KQLAFLAKLN HEDTRMAVEG ERSFLAALDG SCRTPIAAHC HLVDGKMQFR GLIASLDGKQ VLETTREGAW DAASLLDAGK DAGAELKGKA PADFFANLIE NGGGW
|
| |