Gene OSTLU_43023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43023 
Symbol 
ID5005558 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp35412 
End bp36818 
Gene Length1407 bp 
Protein Length451 aa 
Translation table 
GC content55% 
IMG OID640420979 
Productpredicted protein 
Protein accessionXP_001421190 
Protein GI145353802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.583618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTTA CGTTGGCGCT GCGCGGCGCG TCGTGCGACG CGCGCGCGAC GTCGTCGCGC 
AAACTGCTGC GACGCGAACC CGCCGGCGCG TCCAGGCCGG TCCATCGCGA ACACCTGTCG
TCTGGGGTGT CCAAGGAGCT CGTGCAAAGA GTCGCGCGCC GGGACGGCGG CGTCGCGGTG
ACGTTTGCGA ACGAAGGCAT GTACGACTTC GTGGTGAACT GGTGCGAACA CATGGACGAA
ATCGGGATTA CGAATTATTT AGTCGGGGCG ATGGATGAGA GCTTATACGG TCGGTTGCGA
AAGATTGGCG TCAACGCGTG GTTGATGGGA TCGAAAAATA TCGACGACGA CGAAGTGAAG
AAGGATTTCG GTTGGGGGAC GAGGACGTTT CATAAGATGG GACGGGATAA GATTCGTTTA
GTGCACGAGC TGACGAAGAC TGGGTTTGAT GTCATCGTCA CGGACGTAGA TGCGGTGTGG
TTACGCGACC CATTTCCGTT TTTGAGGCGA TATCCCAAAG CGGATGCGTT GGTGAGCATC
GATAATTTGC GCAATCATAC CTCGGTCGTG GCGACGCAAG CGAATCACGC GGTCGATGGG
GAAGGCTTAG AGCACAGCGC GTGCGGTGGG AACAAAAACA TCGGTATTAT GTGGTTTCGC
TCGACCGAAG GCAGTCAGTC GTTCACGCAA GAGTGGTTGA ACAAGCTCGA GTCAAATGAC
AAAGATTGGG ATCAAGTCGT GTTTAACAAG TTGGTCGAGC AGGGCGGGTG CGAAACGGCG
CGCGACGGGA GCGGTGTCGC CCCGGCGTAT GGTGGCGGGC TCATGCTAGG AATCTTGCCG
GTGGCGTTCT TTGCGAACGG TTACACATAT TTCACCGAAC GTCTTCACGA AATGTTCGGC
TTGAAACCGT ACGCTGTGCA CACGACGTTT GGTTACGCAG GCACGGTTGG GAAGCGACAT
CGCCTGCGAG AGGCGAACCA GTGGTACGGC GATAAATACG AACCGACTTA TTTTCAAGGG
AAATTCATGT CGTACACGCC GCGGCTGCTC AAAGATGTCG ATTACGCCGA ATTCGTCAAG
CGTGGCCACC CGAATGAAGA AAATACACCC ATGCTCGAGA GAGACGAGGA CGTTGTGTTG
GAGCACATGC GATTCGTCAA TCATCAACTC GCGCAATTGT ACGAAGCTGC GGTCGTCGCG
AAGCATCTTG GACGTGCGTT GATTTTACCG CCATTTGCGT GCGGGTTAGA TCGCGTTTGG
TTCCCTCACA AAGGGCGATA TCCCGGTGCT TTGCTCAAGC TTCCATTCGT GTGCCCTGCG
GATCACGTGC TCAAGATTGA AGAGTTGCAC GAATTCGCGC AAGACTATCG CGAATTTTCG
TTTTTAGGGC ATCCTTACAT GCCGCGT
 
Protein sequence
MALTLALRGA SCDARATSSR KLLRREPAGA SRPVHREHLS SGVSKELVQR VARRDGGVAV 
TFANEGMYDF VVNWCEHMDE IGITNYLVGA MDESLYGRLR KIGVNAWLMG SKNIDDDEVK
KDFGWGTRTF HKMGRDKIRL VHELTKTGFD VIVTDVDAVW LRDPFPFLRR YPKADALVSI
DNLRNHTSVV ATQANHAVDG EGLEHSACGG NKNIGIMWFR STEGSQSFTQ EWLNKLESND
KDWDQVVFNK LVEQGGCETA RDGSGVAPAY GGGLMLGILP VAFFANGYTY FTERLHEMFG
LKPYAVHTTF GYAGTVGKRH RLREANQWYG DKYEPTYFQG KFMSYTPRLL KDVDYAEFDV
VLEHMRFVNH QLAQLYEAAV VAKHLGRALI LPPFACGLDR VWFPHKGRYP GALLKLPFVC
PADHVLKIEE LHEFAQDYRE FSFLGHPYMP R