Gene OSTLU_17919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17919 
Symbol 
ID5005008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp438467 
End bp440124 
Gene Length1658 bp 
Protein Length553 aa 
Translation table 
GC content62% 
IMG OID640420429 
Productpredicted protein 
Protein accessionXP_001421152 
Protein GI145353718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.161666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGG TGTTCGACGG ATCGACGACG AAAAAGCGAC ACATCTCGCT CGGAGGACGG 
CGAAAGGATG AACCGTCGAC GCAGCGGGCG CTGATCGAGA GCGCGCGCGA GGCGCGGGAG
CGGAGGGCGG CGCAGAGGGC GACGGCGACG GCGACGGCGA CGGCGCAGCG GTTTCGACGA
GGCGCGCGGG AGGTTAGACG CGCGAAAGCG GCGGTGAGGG AAGCGTACGC GAACGCGAGC
GAAGGTGTGA TGGTTGATGA CGCGTCGTGC AGGGCTTTGG TGTATTTTGG TGATGGATGG
AGAGATTGGG CGGCGTGCGC GGCGGCGTGT GAACGGTTGT ACGCGGCGGC GACGACGACG
ACGAGCGTGG CGCCGAGCGA GGCGTTCGTG GAGAGATGGG ACGACGCGAT GCGGGCGTCG
GGCGTGGAGC GCGCGCGATG GGTGATTAGG ATGCGGAAGA TGGTGCGGTT GATCGTGGGG
TCGCTTCGAG CGACGACGCG CTTGGAGGAG ACGACTGGAC GCGACGTCGC GGAGACGCAC
TTAGCGAGCG CTTTGTTGGC GCTCATGGCG AGAGATGACG CGAGATGGAA AGATACTGGC
GTTCAATTGT GCGCGACTCT GTGCTCGAAC GATGGTTTGG AAGATGTGCG AGAGATTTTG
CTCGATATTT TGAGAAATCA AAGTCGCCGC GACGACGTCG TTCGGGCGGC GATGCTTAGG
ACGTGCGAAA ACATCGCGCG GTGTAGCGGT GACGACGCGA GTGGTGCGCT CGCGTCGATG
CTCGCGACGA TACCCGGGGT GTGGGACACG TTCGGGCCGG AGATTCATTC GAGAGATTTG
TGGTCGTTCG TCGTCGACGC GTTCAAGTCT GATCGTCACG TCGACTCGGT GGCTACCGTC
GCACTCGAGA CGCCGCTGAG CGGCGTCGAC GCCGCGCTCG GGAACGTGTT GCAACTCACG
AAAACGTTCA TAGGTACGAT GGACTTTTCG CAGTCACAGA ATGTCGTCGT CGCGGTGACA
AAACTCATGG AGTCGTCTGT TTTGAGCGGC GCGCTCTTTG CCACAACGCC GAACGCGATG
GAGGACGAAG ACGATGACGA GGGCGGTGCG GACGACGACG ACGACGACGA CGATAGCGTC
AAGATTGACT TACTCAATCT TCGCGCCGTG CGCGCGGAGG CGCGTAAAAT TCGACGCGTG
CCCCCTGACG TGGACATGGA GGCGGTGCGA GCCACGAGGA CGTGGATGCA AACGCGCGAA
TTCTTCGGCG ACAACGCGTG TGTCGCGCGA CTCGTCACAA CGATCATGCC TGCGAACGAC
GCGCAGCGAT GCTTAGTAGG CATCGAAGGG TTCACGCATT TCGCGTGCGC GTGCGACATG
GTGCTCAGAG GCGCCGAGCG AGCGTCGTTC ACGCGCGTGC TCACGTTTGG CACGGATTGC
ATCGCTCAAC TTTGGCCCGC CCTCGAGCAT TGGAGGCAAA CTTCCGGAGG TGGGCAAGAA
CGCTCGTTCC GCAGAGCCTT GGGCGTTTTC GCAAAGTTGT ACAACACCTA CACCGCTATT
TCCGACGATG AAGAATTTTA TCGTCTGGGA AAACCGCTCG GTCTCGAGGC GACGACGCGT
CTCGTCGCAT TTTTAAGAGA CACGCTCTGG ACGTTGCT
 
Protein sequence
MAPVFDGSTT KKRHISLGGR RKDEPSTQRA LIESAREARE RRAAQRATAT ATATAQRFRR 
GAREVRRAKA AVREAYANAS EGVMVDDASC RALVYFGDGW RDWAACAAAC ERLYAAATTT
TSVAPSEAFV ERWDDAMRAS GVERARWVIR MRKMVRLIVG SLRATTRLEE TTGRDVAETH
LASALLALMA RDDARWKDTG VQLCATLCSN DGLEDVREIL LDILRNQSRR DDVVRAAMLR
TCENIARCSG DDASGALASM LATIPGVWDT FGPEIHSRDL WSFVVDAFKS DRHVDSVATV
ALETPLSGVD AALGNVLQLT KTFIGTMDFS QSQNVVVAVT KLMESSVLSG ALFATTPNAM
EDEDDDEGGA DDDDDDDDSV KIDLLNLRAV RAEARKIRRV PPDVDMEAVR ATRTWMQTRE
FFGDNACVAR LVTTIMPAND AQRCLVGIEG FTHFACACDM VLRGAERASF TRVLTFGTDC
IAQLWPALEH WRQTSGGGQE RSFRRALGVF AKLYNTYTAI SDDEEFYRLG KPLGLEATTR
LVAFLRDTLW TLL