Gene OSTLU_42298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42298 
Symbol 
ID5006445 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp89499 
End bp91328 
Gene Length1830 bp 
Protein Length579 aa 
Translation table 
GC content64% 
IMG OID640421866 
Productpredicted protein 
Protein accessionXP_001422387 
Protein GI145356333 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01574] tRNA-N(6)-(isopentenyl)adenosine-37 thiotransferase enzyme MiaB 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.000365258 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000033172 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACGC CGCGCGCGGC GGCGGCGGCG GTGGCGGCGC GACCGTCGAC GCCGCGCCGA 
CGCGCGCGCG CGCGCGCGGG CGACGGGCGA CACCTCGACG CGACGGCGGA CGCGGTGCGC
GCGGCGGGCG TCGCGGCGCG GCGCGCGGCG CCGCGAACGG GACGCGCGGA GGACGGGGAG
GAGGACGCGC GGGGGCGACG CGCGGTGTAC GTGGAGACGT ACGGGTGCCA GATGAACGTG
AACGACTCGG AGGTGATGAT GGCGGTGCTC GAGGGCGCGG GGTACGACGA GACGAAGGAG
GTGAACGACG CGGACGTGAT TCTGATCAAC ACGTGCGCGA TTCGGGATAA GGCGGAGGCG
AAAATTTGGC AGCGGTTGGC GTACTTTCGA TCGCTGGGGA ACGGGAAGAA ACGGAGCGAA
AAGCCGGTGG TGGGCGTGCT GGGATGCATG GCGGAGAGGA TCAAGGAGAA GTTGTTGGAG
GCGGATAGGC TGGCGGACAT CGTGGCGGGA CCGGACGCGT ATAGGGATTT GCCGAATCTC
ATCGACGCCG TCGTCGGGAA TCCGGGAGGG AAGGCGATGA ACGTGCAGTT GAGCGTGGAG
GAGACGTACG CGGACATCAT TCCCGTGCGC GAGGCGGGGT CGCACTCGGC TTTTGTCACC
ATCATGCGCG GGTGCGACAA CGCGTGCGCG TTTTGCATCG TGCCGTACAC GCGCGGACGC
GAGCGCTCGC GCGATTTGGC GAGCATCATG TACGAGATTC GTCTTTTGAG CGAACAAGGG
GTGAAAGAGG TCACTTTGCT CGGGCAAAAC GTGAACTCGT ACGCGGGAGA GCCCGCGAGC
GCGACGACGA CGGATTTCTT GAGCTCACTG CGAGGCGAAT CCAAAGACCC GATCGCCGAG
CTCGCGAACG CGTCGACGGA ACGTTTGGCG AGCGCGAGCG GTAGCGCGTT CGTCGGCTAC
GCCGATGGCT TCGCGAGTCG GTACGATCCC GAGCGCAAGC GAGCGGGGAC GATTCAATTC
GCCGAGTTGC TCGATAAAGT CGCGAGCGTG GATCCCGAGA TGCGCATTCG TTTCACGTCG
CCGCACCCGA AGGATTTCCC CGACGACGTC TTGCGAGTGA TTCGCGATCG ACCCAACGTG
TCGAAGTGCT TGCACATGCC CGCGCAGAGC GGGTCGTCGG CTACCTTGGA GCGCATGGCG
CGTGGGTACA CGCGCGAGTC TTACTTTGCC CTCATCGATC GCGTCAAGGC GATGATTCCG
GGGTGCGCCA TCACCACGGA TATCATCAGC GGCTTTTGCG GCGAGACCGA GGACGATCAC
GAGGACACCG TGAGTTTGAT GAGCGCGATC GGATACGAAC AAGCGTTCAT GTTCGCTTAC
AGCGAACGCG AGGGCACGGC GGGGCAAAGA CACCAAATCG ACGACGTCCC CGAAGACGTG
AAGCAGCGGC GTCTGCAGGA AGTCATCGAC GCCTTTCGAG CGCGCGCGGC GGAGAAGCAA
CAGATGGAGA TCGGTTCCAC GCATTGCGTG TTGGTGGAGG GTCCGAGTAA GAAAAACTCC
GACGAGTGGA CGGGGAAGAC GGACACATCG AAGTGGGTGG TGTTCGAAAA GAATGATGCC
ATCGGCAAGT ACGCCGGCGA CGAAGACGCG CCGACGAGCG GGTCGTACGG CGTCAAGCCT
GGAGATTACG TCGCCGTTCG CGTCACTGGG TGCAGTACGG GGACGTTATT TGGTCAAGTT
CTCGGTAAGA CGAGTTTGGT AGAGTTTCAA AACTTGCACG GCGCGCAGTG GACGACGCCA
AAGTCGAGCA ACGGCGCGAG CGCGCGTTGA
 
Protein sequence
MATPRAAAAA VAARPSTPRR RARARAGDGR HLDATADAVR AAGVAARRAA PRTGRAEDGE 
EDARGRRAVY VETYGCQMNV NDSEVMMAVL EGAGYDETKE VNDADVILIN TCAIRDKAEA
KIWQRLAYFR SLGNGKKRSE KPVVGVLGCM AERIKEKLLE ADRLADIVAG PDAYRDLPNL
IDAVVGNPGG KAMNVQLSVE ETYADIIPVR EAGSHSAFVT IMRGCDNACA FCIVPYTRGR
ERSRDLASIM YEIRLLSEQG VKEVTLLGQN LANASTERLA SASGSAFVGY ADGFASRYDP
ERKRAGTIQF AELLDKVASV DPEMRIRFTS PHPKDFPDDV LRVIRDRPNV SKCLHMPAQS
GSSATLERMA RGYTRESYFA LIDRVKAMIP GCAITTDIIS GFCGETEDDH EDTVSLMSAI
GYEQAFMFAY SEREGTAGQR HQIDDVPEDV KQRRLQEVID AFRARAAEKQ QMEIGSTHCV
LVEGPSKKNS DEWTGKTDTS KWVVFEKNDA IGKYAGDEDA PTSGSYGVKP GDYVAVRVTG
CSTGTLFGQV LGKTSLVEFQ NLHGAQWTTP KSSNGASAR