Gene OSTLU_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4067 
Symbol 
ID5000769 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp390496 
End bp391503 
Gene Length1008 bp 
Protein Length309 aa 
Translation table 
GC content57% 
IMG OID640416190 
Productpredicted protein 
Protein accessionXP_001416642 
Protein GI145344235 
COG category[R] General function prediction only 
COG ID[COG1054] Predicted sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.164519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGACGGTGG ACGAGGACGC GCCGAAGTAC CAATTGATCA CGTTCTTTCG ATTCGCCGCG 
ATCGAGGACC CGGTGGCGGA GGTGGAGCGG CATCGCGCGC ACATCGAGCG ACGGGGCTGG
GAACTGCGAG GACGGATATA CGTCAACGAG CAAGGCATTA ACGCACAGAT GTCTGGACGA
GGGCGAGAGG GAGAGGAGTA CGCGCAATGG GTCGAGAGCG ACGCGCGATT CGCTGGGATG
CGGATATCGG TGTATCCGAT GGATGCTCAG GCGCACCCGA GACTGGCGTT GCGATACAAA
CCCAACTTGG TGCAACTCGA GGGAGGGACG AATCATTTAC CGTTGACCGA TCGAGAGAAG
CGCGCGAAGC CGTTGTCGCC GAAGGAGTGG CACGATAATC TCATCAAGGT GAACTCGGGC
GCGGAAGACG CGCCTTTGCT TTTGGATGTG AGAAACGGGT ACGAGTGGGA CGTCGGACAT
TTTCGCGGCG CCGAGAGACC GGTGCAAGAG TCTTTCAGGG AAACCGTCTA TACGAACGTG
CAAGACGGCT TAGGACCGCT GGCAAACGTG GATAAAGAAA AGCCGATCAT GATGTACTGC
ACAGGTGGCA TCCGATGCGA CGTGTATTCT ACAGTATTGC GAGAGCAAGG GTACAAGAAC
GTGATGACGC TCGAGGGCGG CGTGCAGGCG TACTTTGATG AGTACGGCAA GCGCGATGAT
CAACTTTGGG ATAACCATTT GTTTGTGTTC GACAGTCGAC TCGCAATGGC CCCTGATGGA
CGTCCGAGCG CCGAGCTAGG CGAAGCAGCG GCGACTTTGC GATGTTACTG CTGTGGCGAC
AGTTCGGCGC CACCGCCGCA CCGCAACTGC CCCAACGTCG ATTGCAATAG GCTCTTCCTC
GTGTGCAGTA AATGCACCGA TAAGCTCGAT GGATTTTGTT GCGAAGAATG CACGAAATCC
GCGCACGTTC GACCGCAACT CGTCGTCCCT GGACGATATG AAAAGTAT
 
Protein sequence
ATVDEDAPKY QLITFFRFAA IEDPVAEVER HRAHIERRGW ELRGRIYVNE QGINAQMSGR 
GREGEEYAQW VESDARFAGM RISVYPMDAQ AHPRLALRYK PNLVQLEGGT NHLPLTDREK
RAKPNGYEWD VGHFRGAERP VQESFRETVY TNVQDGLGPL ANVDKEKPIM MYCTGGIRCD
VYSTVLREQG YKNVMTLEGG VQAYFDEYGK RDDQLWDNHL FVFDSRLAMA PDGRPSAELG
EAAATLRCYC CGDSSAPPPH RNCPNVDCNR LFLVCSKCTD KLDGFCCEEC TKSAHVRPQL
VVPGRYEKY