Gene OSTLU_18196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18196 
Symbol 
ID5005236 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp616376 
End bp618909 
Gene Length2534 bp 
Protein Length604 aa 
Translation table 
GC content67% 
IMG OID640420657 
Productpredicted protein 
Protein accessionXP_001421340 
Protein GI145354117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAG ACCGGTGCGT CGACGCGCTG CGAGCGGCGA GCTCGAGGGA GGCGGACGAC 
GAGGCGGACG CGTACTACGC CGCGCTGAGC CTGCACCGAG CGCTCGCGAC GCGGGCGAAT
CGAGACGACG ACGAGGCGAT GGCGCGATGC GTCGCGACGT GCGAGGAGGT GGTAGAGAGG
TACGCGCGAC GCCGCGGCGG GCGCGACGGC GCGATGCGGC GGGAGACGGT GGCGCTGGCG
GCGTTGACGG TGGCGCGCGC GAGAGGACCG GCGGCGGCGG CGGAGACGCT GAGGGGCGGC
GACGAGCGCG CGAGCGTGCT CGGCGCCGCC GTCGCGCGGT GGTTAGCGCT GCGATTGGTG
GACGTGGACG CGGAGAGGGA GCTCACGGTG GCGGATTTGA CGCGATTGGC GGTTATTTTG
TCAAAGGCGC GGGATTCGCG CGCGGCGACG GCGTTTTGCG AGTGCGGCGG CGTGTCGTCG
TTGTGCGCGT TGATCGACGT CGAATCGTCC CCGCCGCTGA GCGCGGGGAC GATATTTGAT
TGCGCCAAAG CCATCGCGGG TTTTTGCGCG GCGCTTTCGG ATGGTTCGAC GACGACGAGG
TTCGACGTCG AAGGCGTGTC GCGCGAGTTG AGGGCGATTC TCTCGTCGGC TTCGGGTAAG
ATGGCCGATC CCCTTCCCGC GGTCGCCGCC ACCGCGTGTC GCCAAGCGTT GGCGTCTTTG
GAAAAGTACG CGGCGACCCG AGCGGGAGCG GTGTCCACGG TGGAGCGGCA GTCGTCGTAC
GGGGCGCTCG GCGATTGCGT CACCGCGGCG AAGGAGCTCG ATTTCGCCGA CGCCTCGCGC
TCGGGTTCGC ACGCCAGTCT ACACCGCGCG CAAAAATACG ACGGTTCGTT TGGATCGCAA
ATCGATCTCT CGCGCACGCG CTCGGAATCG CTCAACTCCT TGGATGCCTC GTTTAGTTCG
CCGACGAAGC TCATCCCCGC GGCGTCCGTT CCCGAAGCCT CCGACACCGC CACCGCGCTC
TTCACCACCC CACCGCCGCT CGCGCGAAAA CGATCCGACT CCGGCACATC ACCTCTGCTC
AAGTCCAAGT CCCCGGCGCG CGCGCGCTCG AAGAAGCGCG CCGTCGCGCT TCGCCGCTTC
CTCTTCTTCG TCTTCGCCGT CGGCGTCGTC TTCGCCTTCG CCGACCGTCT CCGCGCCCAT
CGCGCGTGAT TTCCTCCGTT CGTCCGTCCA TTCCGCCGTT CGTCGTCCGT TCGCTTCCCC
GTTTCCTCGC CGCCGTCTGT TCCTTCGGGC GTCGACCGCG TCGATTCGCC GCCGAGTCGC
GTCGCTCGAG TTAGTTAGTT AGTTGAAAAG ATCCTCCAGC GACTAAACCC CAACACTTCG
GCGCCACATC ATCGCGCCGC GTCGACGCGC GCGATGCGCT GCGCCGCCGC GCCGCGCGAG
ATGCGCGCGT GCGCGACCTC CTCGCGCGCG CGCGCGACGC GAGACCGCCG CGCGACGCCG
CGAGCGGCGC GATCGCGCCG CCGCGCGGCA CCCAGGAAAA GCGCTCGATC GGCGCTCGCG
CGCGCCGCGG ACGGCGCGCG CGACGACGAA ACGACGCACG AGTGGTGCGC GGGAGATGCG
TCGTCGACGG ACGCGCGCGA AAGCGTCGAC GCGCGCGACG CGGGCGCGGA CTGACCTCGA
CGCGTCGAGC ACTCGCGCGC AGCGTCGTCA TCGAAAACCG AGCGGCGAGC CTAGACGGCG
ATAAGCGCAC GCTCGTCGTC GCGATCGAGG ATAAACAGAC GGAATCGAAC TTGCGTAAGA
GCCCGGGGAG CGCGGTGTAC GGAAATGGGA ACGGGAAGGT GTGGACGGAG ATGTACGACG
AGCCGGGGCA GTACGTCAGG GCGCGGTGCG GGTGCGGCGC GGAGACGCGC TTGCCGATAG
CGCGATCGCC GTATCACGTG CGGTACGACT CGGCGAGGTT AGACAGCGCG AAGGTTGAAT
TCTTGGTGGA TTCGAGTCAT CATCCGAACG CGCTGACGGG CGCGAAACCG GGCGACGTGT
TTCACGTGAG CGAGCCGCGC GGCGTCGGCT TCTCCAACGT GTTGTTCGCG GAGCGATCGC
TCGAGGCGGC GATGCGTAAA AACCATCCAT TGGTCCTTCT CGCGAACGGT ACCGACGGTC
TCGCGAGCGT GCGCTCGCTA TTAGATTGGC AACCAGTGAT GGCGTACGCG GACGCGCATC
CTGTGACGTT ATTTTACCTG TGCGAGAGTC AAGAGAGCGC CGCGCTCTTG TCCATCCACG
ACGAGTGGCG GGAGGAGGGC TTCAAAATCA TTCCGTGCTA CGGCGCTTTG GACGACCAAC
TCTTCTTGAT GGAGCAGTGT TTCCTCACCG GCGCCGTCGC GGCGGGTGGC AAGCCAACCA
TTCTCGGCGC CGACCCCGCG GCGTGTTCCG TCTTGCTCGC CGGCGCCGAG GGCGACGTCG
CCGGGAGCAT CTTGAAGCTC CTGAACGCCC GAGGTATCGC GCGCGACAAC ATCTTGACGA
GTGACTTTTT TTAA
 
Protein sequence
MTRDRCVDAL RAASSREADD EADAYYAALS LHRALATRAN RDDDEAMARC VATCEEVVER 
YARRRGGRDG AMRRETVALA ALTVARARGP AAAAETLRGG DERASVLGAA VARWLALRLV
DVDAERELTV ADLTRLAVIL SKARDSRAAT AFCECGGVSS LCALIDVESS PPLSAGTIFD
CAKAIAGFCA ALSDGSTTTR FDVEGVSREL RAILSSASGK MADPLPAVAA TACRQALASL
EKYAATRAGA VSTVERQSSY GALGDCVTAA KELDFADASR SGSHASLHRA QKYDGSFGSQ
IDLSRTRSES LNSLDASFSS PTKLIPAAVV IENRAASLDG DKRTLVVAIE DKQTESNLRK
SPGSAVYGNG NGKVWTEMYD EPGQYVRARC GCGAETRLPI ARSPYHVRYD SARLDSAKVE
FLVDSSHHPN ALTGAKPGDV FHVSEPRGVG FSNVLFAERS LEAAMRKNHP LVLLANGTDG
LASVRSLLDW QPVMAYADAH PVTLFYLCES QESAALLSIH DEWREEGFKI IPCYGALDDQ
LFLMEQCFLT GAVAAGGKPT ILGADPAACS VLLAGAEGDV AGSILKLLNA RGIARDNILT
SDFF