Gene OSTLU_36544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36544 
Symbol 
ID5006942 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp45928 
End bp47884 
Gene Length1957 bp 
Protein Length613 aa 
Translation table 
GC content59% 
IMG OID640422363 
Productpredicted protein 
Protein accessionXP_001422794 
Protein GI145357170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.263627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0195564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA CGCTCGGAAG CCTGTTCGCG AGCGGACCGA GGCTGAAGTA CGACCTCGGG 
CGCGAGTTCG CGTGTCGCGC GTTCGGCGTG TGGACGCACC GACGCGGGAC GTGCCAGGTG
CGACGAACGA ACGCGACGCG ACGCGAAGGA TGAGTGAATG AATGAATGAA TGAATGAATG
AATGAATGAA TGAATGCGTC TGATGACGAT GATTGACGAA AGGAAACGCG CGCAGGAGAC
GGGAGAGGAA CACTCGATAT TCACGTTTCA CGCGGACGTG AATCGCGATC AACAGCGGGT
GAGGCTGGCG AAGAACGGGG CGCGACGGAT GAAGACGCTT CGACACCCAA ACGTGCTGCT
GGTGAAGGAC GTGATCGAGA TCGAGAGCGG GAACGAGCTG ACGATACACG TCGTGACGGA
GGCGGTGACG CCGCTGGAGG CGCACCTGCG GGAGGCGCCG ATCGGGGGGG GGGGGAACGG
GCAGAGGGAT GATTATTTTT CGCTCGGGGT GCGGGAAATC GCGACGGCGG TGGCGTTCTT
GAGCAACGAT TGTAAGCTCG TGCACGGGGG GGTCGGGCTG AGCGCGGTGG TGGTGACGGA
GCGGTTAGAC TGGAAGTTAC ACGGGTTCGA TCTGTGCAGC GAGCTGGAGA GCATCGGACA
CGGGGTGAAC GGGGACGCGG CGCTCATTCG CGGGGCGTAT CTGGTGCCGG ATCAATACAA
GCCGGAGGAG TACAGAAGGG GAGACTGGGT GATCATTCCC GAAGGGCCGC CGTGGGCGAT
CGACGCGTGG GGGTTGGGAT GCTTGATTCA GGAGGTGTAC AGCGGCGGCG CTCTGCGCGG
GACGGACCAG TTGCGCGAGA TTGATCAAAT CCCAAAGTCG CTGCTGAAGG ATTATCAGCG
TCTTCTCGGC TCGCAACCTG CGCGACGGTA CAACCCGAAG AAGCTCATCG AGAACAAAGA
GGCGTTTTCG AACAAGCTCG TGGAGACGAT CACGTTCATC AACAACTTGG CGTTGAAGGA
TTCGATTGAA AAGGAACGAT TCTTCGGCCA CCTGCCGCGC GTGTTGGAGC AACTCGCCCA
GGCGCCCGTG CAAAAGAAAA TCTTACCCAT GTTGTGCGAC GCGCTGTCGT TTAATCAAGC
GCCGCATCAA GCCATCTTAC CCATGCTCCT CGCCGCAGAG GAGGTTCCGC GCGATCAGTA
TCAAAAACTC GTCATCCCGA CGGTCATGAA GCTTTACGAG GCGCCGGATA AGATGATTCG
ACTGGATTTG TTGGAAAATT TGACGCGATA TGCCGACCAC GTGCCGGATG CCATGATGGA
TGATCCGTTG TACGAACGTC TACAGACCGG CTTCGCGCAC AACGACTCCA ACGTGCGCGA
GATGACGCTG AAGGGGGCGC TGACGTTGGT GCCGCGACTC TCCGAGCGCG TCATCACGGC
GTCGCTTTTG AGGCATTTGA GCAAGTTGCA GATTGATGAA GATCCGGCGA TTCGCGCCAA
CACGACGATT TGCCTAGGAA ACATCGCAAA GTATTTGAGT CAAGCGACGG CGAAGCGGGT
GTTGCTGAAC GCGTTCACAA GATCGCTCAA GGATGGATTT CCGCCCGCGC GACTGGCGGG
TCTCATGGCG CTCGAGCACA CGACGCAATA TTATGAACCG TTAGAAGTGA GCCAGCGGTT
GATCCCAGCG CTCGCGCCGC TCATGACGGA CATCGAAAAA GACGTGCGAA CGCGCGCGTT
CACCGTACTT GAGTTGTACG TGCAAGGGTT AAAAGGGCAC TCAGAGGCGC TCGAACTCGG
CCCCGAAGCG GCGGCGGCGC ACGTCGAGGC GCAACAACAG AAAGGGCACG CGAACGCGGC
TAAACGCGCG GCAAACATGC TTTCGTGGGC TGTGAACATG GCAGCGACGA AAATAGGCGG
ACCAGATGAC GCGCACGAAC CGGTTGATCT TCAGAAT
 
Protein sequence
MFGTLGSLFA SGPRLKYDLG REFACRAFGV WTHRRGTCQE TGEEHSIFTF HADVNRDQQR 
VRLAKNGARR MKTLRHPNVL LVKDVIEIES GNELTIHVVT EAVTPLEAHL REAPIGGGGN
GQRDDYFSLG VREIATAVAF LSNDCKLVHG GVGLSAVVVT ERLDWKLHGF DLCSELESIG
HGVNGDAALI RGAYLVPDQY KPEEYRRGDW VIIPEGPPWA IDAWGLGCLI QEVYSGGALR
GTDQLREIDQ IPKSLLKDYQ RLLGSQPARR YNPKKLIENK EAFSNKLVET ITFINNLALK
DSIEKERFFG HLPRVLEQLA QAPVQKKILP MLCDALSFNQ APHQAILPML LAAEEVPRDQ
YQKLVIPTVM KLYEAPDKMI RLDLLENLTR YADHVPDAMM DDPLYERLQT GFAHNDSNVR
EMTLKGALTL VPRLSERVIT ASLLRHLSKL QIDEDPAIRA NTTICLGNIA KYLSQATAKR
VLLNAFTRSL KDGFPPARLA GLMALEHTTQ YYEPLEVSQR LIPALAPLMT DIEKDVRTRA
FTVLELYVQG LKGHSEALEL GPEAAAAHVE AQQQKGHANA AKRAANMLSW AVNMAATKIG
GPDDAHEPVD LQN