Gene OSTLU_25644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25644 
Symbol 
ID5006115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp51512 
End bp52750 
Gene Length1239 bp 
Protein Length385 aa 
Translation table 
GC content60% 
IMG OID640421536 
Productpredicted protein 
Protein accessionXP_001421825 
Protein GI145355138 
COG category[K] Transcription 
COG ID[COG5641] GATA Zn-finger-containing transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.10622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTGACGCGT CGCGGCGCGG CGCGTCTCGG CGCGCGTCGC GGCGCGACGA CCGGTGCGCG 
CGCCCTGGAC GACGCGGTCG ATGCGCGCGG AACCGCTCGA CGCGACGGCG TCGAACGCGT
CGGCGTCGTC GCCGCGGGAG ACGAGGAAAC GCGCGAGGGA GGCGATTTTC GCGCGCTTCG
ACGCGCTCGC CACCGCGCGC GGGGCGAAGC TGGCGAAGAG GCGAGGAAAA AGGAACTGCG
TCGAGCTGGA GTTAGAGGCG TTGGGAGGCG TGAGCGACGA GACGCTCGCG AAGTTTGCGA
CGTACGGGGA CAAAGTGCAG AAAATGATGC AAAAAGCGAC GCCGCGAGGG GCGACGCCCG
GGCTCGCGCC GATTCGGTAC GCGGCGCGAG GGACGCTTTT TAACGAGCCG TATCGCGTGT
CGGTGGACGA GCAAGGCGTG CATCGCGTGG TTTGGGGGAA GGAAGGCGAG TCGCGCAGGG
TCGTGCGGTC GAAAGATGGG GCGCGTACCC CGCAAGAGGC GGTGGAATGC GTGCACGTCG
AAATTCTGAG GCGAATGAAT GAAAACATTG CCTCAGTCAG CGACGAATCG CATGATGGTA
CATTCGAAGC CCGCATATGC AGAAATTGTC TATGCGACTG CTCGAAGACG CCACTGATGC
GTCGTGGTCC CGATGGGATC GGTACGCTTT GCAACGCTTG TGGGCTGTGG TGGAGTCGAC
ATCAAACGAT GCGCGAGTAT CCGTCGGTGG TGCCAGAAGA AACGCCGCAC AAGGCGATAT
TTATCCGAAA TCCAGTCAAA AGCCGAAGAG CACTGAAAAC GTTAGACGTT TTTGGATACT
ATTCATCGAG CGTGCAAGCG ACGCTCGCCA AGGCTTGCGC CGCCGTGCTC CAAGAAGAAC
GGCGTTCACT TCGGCTTCCG CGCGTCAAGT CGAAACCAGA CGTACGATAC GACTTCAAGC
GAGCTATCCA CGACGTGTTC GATCGGTGCG ACTGTGTCTT CGCGGACTCC CCTTCTCATT
CAGATGATTT AGGATGGGAA CATTACCCCC CATCGATCAC AGACTTCGCA GCTGAACAAC
TCGAAGAACT CGAAGGAGTC AAACTCGAAG GAGTCAAACT CGAAGGAGTC AAACTCGAAG
GAGTCGAACT CGAAGGAGTC GAACTCGAAG AACTCGAACT CGAAGAACTC GAAGCTTTCG
CGTTCGGTTT CGAGCCCCCG CGGTTGGATC CAGTCTGAT
 
Protein sequence
MRAEPLDATA SNASASSPRE TRKRAREAIF ARFDALATAR GAKLAKRRGK RNCVELELEA 
LGGVSDETLA KFATYGDKVQ KMMQKATPRG ATPGLAPIRY AARGTLFNEP YRVSVDEQGV
HRVVWGKEGE SRRVVRSKDG ARTPQEAVEC VHVEILRRMN ENIASVSDES HDGTFEARIC
RNCLCDCSKT PLMRRGPDGI GTLCNACGLW WSRHQTMREY PSVVPEETPH KAIFIRNPVK
SRRALKTLDV FGYYSSSVQA TLAKACAAVL QEERRSLRLP RVKSKPDVRY DFKRAIHDVF
DRCDCVFADS PSHSDDLGWE HYPPSITDFA AEQLEELEGV KLEGVKLEGV KLEGVELEGV
ELEELELEEL EAFAFGFEPP RLDPV