Gene OSTLU_19070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19070 
Symbol 
ID5006628 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp374180 
End bp375903 
Gene Length1724 bp 
Protein Length555 aa 
Translation table 
GC content55% 
IMG OID640422049 
Productpredicted protein 
Protein accessionXP_001422728 
Protein GI145357035 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCACG ACGCGTGGAA TGACGACACG ATCGGCGGTG CGTCGGTGTC GGGGAAGCGA 
TCGCGATCGA AGCGCATGGT GCGCGCGAGG AAACGCGCGA GGGCGCTCGA GGCGGCGGCG
AAAGCGACGA GGAACGACGA TGACGCTGAT GACGCTGGGC GACGAAACAG CGCGAACGAG
GCGACGACGA CGACAGAAAC GATCATGCTT CCGCGGACGT CCGCGCCTTG GTTCTCGCGC
ACGCTTTCGG TGCACGAGGA GACGTCGTTT GAGACGTACT ACAGACGACA GGGCATAGTG
GACGAGGGCG AGTGGAGAGA CTTTTTGAGG TACCTTCGAT TGCCTTTACC GGTGACATTT
AGGATGAACG TTATGGCGTC GAGACGAGAA GAAGTGCGCG AAGCGCTGAA CGTGGCGAAA
CACTTTCTGC AAAACGCGAG AGAAACGAGG GACGACCGGG GACGGTTGAT ACCGCCGCCA
ACACGCCTGC CGTGGTGTGA CGGATGGCAG CTCGGCGTGG ACAAGATGAG CCTGAAGTTC
TCTCGAAATC CAATGTTGAG AGATTTGCAG CGTTGGTTGG TGAAGTGCAA CAATACCGGC
GTTCTCACTC GACAGGCGGT GGATTCCATG GTTCCCGCGG CAATTTTACA AGTCGAGCCG
CACCATCGCG TGCTCGACCT TTGCGCAAGT CCTGGATCGA AAACTACGCA GGCACTTGAG
GCACTCAACG TCAATGGAGA GGAAGGATCG GCGTCTGGAT GCGTCATCGC GAACGATATT
AATCCGATGC GATGTTATTT CCTAGTACGA CGCTGTGCAG CGCTTCGCAA CGCCACTGCG
AATTTGATGG TGACGACGCA CCAAGCTCAA TGGTATCCAA ATATCAACGT ACCTACCACC
GATAAGGCAT TGACTGAACG CGGAGGAAGA TATCCAGAAG GCTCTTACGA TCGTATCATT
TGTGATGTTC CGTGCAGTGG TGACGGAACG CTTAGGAAAA ACCCACAGAT TTGGTCCGAA
TGGCGCCCCG AATTCGCCAT GGGCCTTCAC AAGCTACAGC TACGCATTGC GCAGCGTGGC
GCGGCACTCC TTAATGTTGG TGGATATATG GTTTACAGTA CGTGCTCATT CAACCCAGTG
GAAAACGAGG CTGTCGTCGC AGAGCTCATC AAACGGTGCG GTGGAGCGCT AGAAATTGTC
GACGCTTCTG ATCGAATTCC CGAGTTACTG CGCAGACCGG GAATATCGAC TTGGAACGTG
ATGACCATGG TCGAAGGGAA CGTAGTAGAA TATCCAAAGT ACGAGGATAG TCAAGCTTCG
ACCGTGCCCA TTGGATTGAA GCGAAAGTTT GCGAAATCGA TGTGGCCGCC GGCGCAGTCG
CTGGCGCGAA TGGGCCATCT CGGAAAACGC ATCAAGTCGA CGGCGATCCC ATTGCAGCAC
TGCATGAGGC TCGTACCTCA CTTACAAGAC ATGGGCGGAT TTTTTGCAGT CTTATTGAAA
AAGGTGGCGC CGATTCCCGG ACCACAGGCG AAGGAGTCGA CCGCAGAAAA AGTCGACCGC
GCTTATGAGC GCACGGAACC AAAGCACGTC TACTCAAAAG TTTCGAGCAC ATTGGTGAAA
AAGTTGGCAA AAGAGTTTGG TCTTGGTGAT AAATTCTCGG AAAAAATCGC CCCTTGCCTC
TACGCGCGCT CAAATCAACA GAAGTCTATC GTGTATATCG CTGA
 
Protein sequence
MQHDAWNDDT IGGASVSGKR SRSKRMVRAR KRARALEAAA KATRNDDDAD DAGRRNSANE 
ATTTTETIML PRTSAPWFSR TLSVHEETSF ETYYRRQGIV DEGEWRDFLR YLRLPLPVTF
RMNVMASRRE EVREALNVAK HFLQNARETR DDRGRLIPPP TRLPWCDGWQ LGVDKMSLKF
SRNPMLRDLQ RWLVKCNNTG VLTRQAVDSM VPAAILQVEP HHRVLDLCAS PGSKTTQALE
ALNVNGEEGS ASGCVIANDI NPMRCYFLVR RCAALRNATA NLMVTTHQAQ WYPNINVPTT
DKALTERGGR YPEGSYDRII CDVPCSGDGT LRKNPQIWSE WRPEFAMGLH KLQLRIAQRG
AALLNVGGYM VYSTCSFNPV ENEAVVAELI KRCGGALEIV DASDRIPELL RRPGISTWNV
MTMVEGNVVE YPKYEDSQAS TVPIGLKRKF AKSMWPPAQS LARMGHLGKR IKSTAIPLQH
CMRLVPHLQD MGGFFAVLLK KVAPIPGPQA KESTAEKVDR AYERTEPKHV YSKVSSTLVK
KLAKEFGLEV YRVYR