Gene OSTLU_24552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24552 
Symbol 
ID5001839 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp351838 
End bp353004 
Gene Length1167 bp 
Protein Length314 aa 
Translation table 
GC content60% 
IMG OID640417260 
Productpredicted protein 
Protein accessionXP_001417743 
Protein GI145346537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.647272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.779721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCGCGCGC GCGCGCTCGA ACGTCGGTGA CGATGAGCGA AACGGCGACG GTGACGTACC 
TGGAGAAATT CGTGGACAGT GCGTGCGCGA GAGCGAGGCG AGAGCGAGGC GAAGGGTGAA
ATTTCGCTCG GACGACCGAG GACGATCGAG CTCGAACGCG AAGACTGACG ATCGTTACGC
GCGCGTCGTG ACGTAGATCT CGCGGACGTC CCCGCGGAGT TGCAGCGGAT ACTACAAACG
ATCGGGGAGC TCGATAAACG GAACGTGCGA TTGCGAGATG CGGTGCAAGC AAAGGTGGAC
GAGTGCGCGT CGGCGCCGAG TCTGAGCGCG CGAGGCGCGC GCAGCGCCGA CGTCGACGCG
GTGAGCACGT TGAAGAAAGA GATCGAAGAA CTTCACGATA CGATGGCGAT GGTATCCAAT
GAGAAGATAC GGTTGGCGCA GATGGCGTTG GATTTAGTGA AAGGGAACGC GACGGTGCTC
GACGCTGAGA TGAAGACGTT CCGCACAGAG CTCGAGGAAC AAGGTATTAA CCCGGACGAG
GACGTCGATG ATGGGTACGG CTACGCGCAG GTGCAAGCGC AGTACCACCG CAAGATGCAA
AAGCCGCAAT ATCAATATCA AAGACCCGCG CCTATGCCGC AGCAGCAGCG CGCGTACGGC
GAACACGCGA TGAGCTCGAT GGACGTCGGC GACCTCGTGG CGGCAAACGT AGGGGCGTTG
AACCAAAGCG CCGGTGGACA AGAGTGGATC GTTGCGACCG TGACTCGATA TTCTCCAACT
GAACGCGAGT TTGAGATCGT TGATGCGGAC GAAGACGCGG AAAAGCACGT GTACCGCTTG
CCGCAAAAGT TTGTCATCCC GCTTCCGAAG ACGGCGTCTG TGAAGCAGTC GCAAAACTTT
CCCGCCGGGA CGAGCGTGCT CGCTGTGTAC CCGAACACGA CCACGTTCTA CAAAGCCAAG
GTCGTGCAAC CGGCGAGAAG ACTCCCGAAC GCGGAGTACA GCGAGTTCGT GTTAGAGTTT
GAAGACGACG GCGACGCCGA CGGTCAAGCG CATCGCCCCG TGCCGTTCCG CCACGTCGTC
TTATTTCCGC GATGAGCGGT GCGAGCGGGA ACACAGTTTC GCGCGATCGA TCGATCGTGC
GTGCGCGCTA TTTCGTGTAA TCAACAG
 
Protein sequence
MSETATVTYL EKFVDNLADV PAELQRILQT IGELDKRNVR LRDAVQAKVD ECASAPSLSA 
RGARSADVDA VSTLKKEIEE LHDTMAMVSN EKIRLAQMAL DLVKGNATVL DAEMKTFRTE
LEEQGINPDE DVDDGYGYAQ VQAQYHRKMQ KPQYQYQRPA PMPQQQRAYG EHAMSSMDVG
DLVAANVGAL NQSAGGQEWI VATVTRYSPT EREFEIVDAD EDAEKHVYRL PQKFVIPLPK
TASVKQSQNF PAGTSVLAVY PNTTTFYKAK VVQPARRLPN AEYSEFVLEF EDDGDADGQA
HRPVPFRHVV LFPR