Gene OSTLU_94665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94665 
Symbol 
ID5003727 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp148229 
End bp150194 
Gene Length1966 bp 
Protein Length570 aa 
Translation table 
GC content53% 
IMG OID640419148 
Productpredicted protein 
Protein accessionXP_001419503 
Protein GI145350201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGA GCGATGGATG CGCGACGTCG CGAGACTTTG CGAGCGTGAA GATGGAATAT 
TTTAAGACGC GCGCGCGCGA GGCGCTGGGG AACGGGAACT ATAAGGAGGC GTACGCGACG
TACACGAAGT GTCTGGAGGA ACTCGCGCCG AGGGACGCGG GCGAACGCGC GAAACTGCTG
TGTAACCGTT CGATGGCGTA CGCCAAGTCT GCAAACTTTA AAGCGGCGCT GGAGGACGCG
TCTGCGGCGG TGTCGTTGAT GCCGACGTTC GCCAAGGGGT GGTGGCGCAA GGCGACCGCG
CACGTGGGGC TGCGACAGTT CCCGGACGCG CTCGCCGCGT ACAGGCAATC GTTCGAATGT
CAGGACGAAG GCGACGCCGG CGTGATCGAC GAGCACGTGA AAGCGATGAA TAAAACGATC
ACTTCATTTA CGCGCGAGCA ACTTGCGAAC TGGATTCTTG AAACTCTGCA AGACATGGAG
AATCGCGAGC TCATCGAGCA CGCGCACCTC GAAAACGTCA CGTCGTTGGA GATGGCAGAG
GGAATGTTTT GTCAAATCAA AGGCATAAAT GAAGGAAGCA AGCCGCGCGG CGACTACTAT
CGGTTCGTAC AACATTGGAA CGTACACGGT ATGAGCGTGC CGATGGCTTA CACCCAGCGT
GCGAGCATGT ACCGCCACGC GTTGTGCTTT ATGCAAGCTC GCGCCGACGC GGCGGCGGCG
TTGACCTTGT TACAAGACGA CGCATCGCTT TCTGACAAGG AAACGGAGAT GACCTTCACG
TACAAAGTCG ATGACTTCTC GCCTTTCAAG GAAGATACAG TCATCACCAA AGCTTGGGCG
TGGTACGAGA TGGGCAAGGC TTTTGAAGGG CGCGAAAATG GTGACACGCA AGCGGCGGCG
AAGTGCTTCT CGGCGATAAC GCACTTGGAC ACCAAGTATC CGATGTTCAC GAACGCCTTC
AAGAATGTGT GCAGTCGTAT GACTGATGTC GAGGCTGGGA AAATATTGAG CGACATCAAC
GATCAGTACG GCGTGGCGGA GTATGGCGTG CGCACCGTTC CCGACGCTTT GGCAACTTAC
GTGGTCACCG TCAGCCTTCT CTTCAAAACT GGCAAGCTCG TCGGCTTCAA CTCTAAAGTG
CGAGACTCGT TTCGAGAAAA CGTCGCGAAG GGTGCGAATG TCATCAAAGA AAAGGTACTC
ATAGAAAGCG TGCGCTCAAG GTTTCGTGGA GCGCCTGGTG TCGCGCTTTC GTATCGAATT
CTCGCCGGCG AAGATAAATT AAACTCCGAG GTACGTGTTT GATATCCGTC GTTCGGCGGC
TGGTATTCAC GATGTTAGCG CAGAAAATGC TCGAGAAAAT TCAAGCACAC GATTTGATGT
TACTGGGTGG CGCAGAAACG GAGGCTGCGA TGGGTACGGC GAGCGACGTG CAAGGTGAGT
TAATTAAATG GAATAAATTG ATGAGGAATT AAATAATTCA TCACGTAGGC GAGCTCAAGG
AACTGAAGTC GGAGGACTTG GTAGATCGCA TCAACAACTT ACAAGGAATT ATCTTATCGA
CATTAAAAAG TGGCAAGGAG ATCGTAGTCG CGAAGCGACC ACCAACCGAC ATGGAGCTTC
CTTACAGAAC GTACAAATTA GTTTACAACG ACGGCACCCC GGTCGAACGA GTGAACAAAA
CGGGCTATCA AATGTCACAA GTGCACTACT CGGCGCAAGG AATGCACAAA CGCGAAGTAT
GGGCTGAGAT GATGGTGCGA ACGACCGCAA TTAATTATTT ACAATTAAAT TAAATTTGAC
ACAGGACAAG TCTTGCCGAT GGCACCAGAG CTCATCTGAG ATAGCCGTTC AAGCGTTACA
AGTGCCGAAG GACGCCAAGA AGCAAGATTT GGATGTCTGC ATCACGCCGT CGACCGTCAG
AGTGTCATGT CGAAACACTG GACGCACGTA CCTAGAGGTG ATTTAA
 
Protein sequence
MEESDGCATS RDFASVKMEY FKTRAREALG NGNYKEAYAT YTKCLEELAP RDAGERAKLL 
CNRSMAYAKS ANFKAALEDA SAAVSLMPTF AKGWWRKATA HVGLRQFPDA LAAYRQSFEC
QDEGDAGVID EHVKAMNKTI TSFTREQLAN WILETLQDME NRELIEHAHL ENVTSLEMAE
GMFCQIKGIN EGSKPRGDYY RFVQHWNVHG MSVPMAYTQR ASMYRHALCF MQARADAAAA
LTLLQDDASL SDKETEMTFT YKVDDFSPFK EDTVITKAWA WYEMGKAFEG RENGDTQAAA
KCFSAITHLD TKYPMFTNAF KNVCSRMTDV EAGKILSDIN DQYGVAEYGV RTVPDALATY
VVTVSLLFKT GKLVGFNSKV RDSFRENVAK GANVIKEKKM LEKIQAHDLM LLGGAETEAA
MGTASDVQGE LKELKSEDLV DRINNLQGII LSTLKSGKEI VVAKRPPTDM ELPYRTYKLV
YNDGTPVERV NKTGYQMSQV HYSAQGMHKR EVWAEMMDKS CRWHQSSSEI AVQALQVPKD
AKKQDLDVCI TPSTVRVSCR NTGRTYLEVI