Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42298 |
Symbol | |
ID | 5006445 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | + |
Start bp | 89499 |
End bp | 91328 |
Gene Length | 1830 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 64% |
IMG OID | 640421866 |
Product | predicted protein |
Protein accession | XP_001422387 |
Protein GI | 145356333 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0621] 2-methylthioadenine synthetase |
TIGRFAM ID | [TIGR00089] RNA modification enzyme, MiaB family [TIGR01574] tRNA-N(6)-(isopentenyl)adenosine-37 thiotransferase enzyme MiaB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.000365258 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00000033172 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGACGC CGCGCGCGGC GGCGGCGGCG GTGGCGGCGC GACCGTCGAC GCCGCGCCGA CGCGCGCGCG CGCGCGCGGG CGACGGGCGA CACCTCGACG CGACGGCGGA CGCGGTGCGC GCGGCGGGCG TCGCGGCGCG GCGCGCGGCG CCGCGAACGG GACGCGCGGA GGACGGGGAG GAGGACGCGC GGGGGCGACG CGCGGTGTAC GTGGAGACGT ACGGGTGCCA GATGAACGTG AACGACTCGG AGGTGATGAT GGCGGTGCTC GAGGGCGCGG GGTACGACGA GACGAAGGAG GTGAACGACG CGGACGTGAT TCTGATCAAC ACGTGCGCGA TTCGGGATAA GGCGGAGGCG AAAATTTGGC AGCGGTTGGC GTACTTTCGA TCGCTGGGGA ACGGGAAGAA ACGGAGCGAA AAGCCGGTGG TGGGCGTGCT GGGATGCATG GCGGAGAGGA TCAAGGAGAA GTTGTTGGAG GCGGATAGGC TGGCGGACAT CGTGGCGGGA CCGGACGCGT ATAGGGATTT GCCGAATCTC ATCGACGCCG TCGTCGGGAA TCCGGGAGGG AAGGCGATGA ACGTGCAGTT GAGCGTGGAG GAGACGTACG CGGACATCAT TCCCGTGCGC GAGGCGGGGT CGCACTCGGC TTTTGTCACC ATCATGCGCG GGTGCGACAA CGCGTGCGCG TTTTGCATCG TGCCGTACAC GCGCGGACGC GAGCGCTCGC GCGATTTGGC GAGCATCATG TACGAGATTC GTCTTTTGAG CGAACAAGGG GTGAAAGAGG TCACTTTGCT CGGGCAAAAC GTGAACTCGT ACGCGGGAGA GCCCGCGAGC GCGACGACGA CGGATTTCTT GAGCTCACTG CGAGGCGAAT CCAAAGACCC GATCGCCGAG CTCGCGAACG CGTCGACGGA ACGTTTGGCG AGCGCGAGCG GTAGCGCGTT CGTCGGCTAC GCCGATGGCT TCGCGAGTCG GTACGATCCC GAGCGCAAGC GAGCGGGGAC GATTCAATTC GCCGAGTTGC TCGATAAAGT CGCGAGCGTG GATCCCGAGA TGCGCATTCG TTTCACGTCG CCGCACCCGA AGGATTTCCC CGACGACGTC TTGCGAGTGA TTCGCGATCG ACCCAACGTG TCGAAGTGCT TGCACATGCC CGCGCAGAGC GGGTCGTCGG CTACCTTGGA GCGCATGGCG CGTGGGTACA CGCGCGAGTC TTACTTTGCC CTCATCGATC GCGTCAAGGC GATGATTCCG GGGTGCGCCA TCACCACGGA TATCATCAGC GGCTTTTGCG GCGAGACCGA GGACGATCAC GAGGACACCG TGAGTTTGAT GAGCGCGATC GGATACGAAC AAGCGTTCAT GTTCGCTTAC AGCGAACGCG AGGGCACGGC GGGGCAAAGA CACCAAATCG ACGACGTCCC CGAAGACGTG AAGCAGCGGC GTCTGCAGGA AGTCATCGAC GCCTTTCGAG CGCGCGCGGC GGAGAAGCAA CAGATGGAGA TCGGTTCCAC GCATTGCGTG TTGGTGGAGG GTCCGAGTAA GAAAAACTCC GACGAGTGGA CGGGGAAGAC GGACACATCG AAGTGGGTGG TGTTCGAAAA GAATGATGCC ATCGGCAAGT ACGCCGGCGA CGAAGACGCG CCGACGAGCG GGTCGTACGG CGTCAAGCCT GGAGATTACG TCGCCGTTCG CGTCACTGGG TGCAGTACGG GGACGTTATT TGGTCAAGTT CTCGGTAAGA CGAGTTTGGT AGAGTTTCAA AACTTGCACG GCGCGCAGTG GACGACGCCA AAGTCGAGCA ACGGCGCGAG CGCGCGTTGA
|
Protein sequence | MATPRAAAAA VAARPSTPRR RARARAGDGR HLDATADAVR AAGVAARRAA PRTGRAEDGE EDARGRRAVY VETYGCQMNV NDSEVMMAVL EGAGYDETKE VNDADVILIN TCAIRDKAEA KIWQRLAYFR SLGNGKKRSE KPVVGVLGCM AERIKEKLLE ADRLADIVAG PDAYRDLPNL IDAVVGNPGG KAMNVQLSVE ETYADIIPVR EAGSHSAFVT IMRGCDNACA FCIVPYTRGR ERSRDLASIM YEIRLLSEQG VKEVTLLGQN LANASTERLA SASGSAFVGY ADGFASRYDP ERKRAGTIQF AELLDKVASV DPEMRIRFTS PHPKDFPDDV LRVIRDRPNV SKCLHMPAQS GSSATLERMA RGYTRESYFA LIDRVKAMIP GCAITTDIIS GFCGETEDDH EDTVSLMSAI GYEQAFMFAY SEREGTAGQR HQIDDVPEDV KQRRLQEVID AFRARAAEKQ QMEIGSTHCV LVEGPSKKNS DEWTGKTDTS KWVVFEKNDA IGKYAGDEDA PTSGSYGVKP GDYVAVRVTG CSTGTLFGQV LGKTSLVEFQ NLHGAQWTTP KSSNGASAR
|
| |