Gene OSTLU_29301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29301 
Symbol 
ID5006690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp19272 
End bp20919 
Gene Length1648 bp 
Protein Length548 aa 
Translation table 
GC content59% 
IMG OID640422111 
Productpredicted protein 
Protein accessionXP_001422452 
Protein GI145356470 
COG category[A] RNA processing and modification 
COG ID[COG5183] Protein involved in mRNA turnover and stability 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.106062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGC GCCGACGCGC GCGCGCCTCG CCGACGCGCG AGAAAACGAC GACCGAGGAA 
GATGCCGACG CGTGTCGGTT TTGTTTCGAA AGCGCGCGGG AAGACGACCC GTTGATCGCG
CCGTGCGCGT GCAGAGGAGG ACAAGAGTAC ATACACGCGA AGTGCTTGCT TCGATGGCAG
CGCATGGTGG TGGTGCAAGC GCCGACGCAT CCGGCGTTTT GGAACGAGGA CACGCGGAGC
AACGTGTGCA ACGTGTGCAA GGAGGCGTTT ACGACGCCGC CGCCGACGCG AATGACGCTG
ATGAGCTCGT TCACGGGAGC GGAGATCGCG GCGATGTGCG CCGTCGGGCA CTTGTTGGTG
TCGCACGCCG CGTTTAGCGC CAAACTTCGA GAGAAATTGC AAGACATGAA CCCGGCGATG
CGGCGGATTT GCTCGTACGA GTACTGGATC GAAGGGACGT ACTTGATCAC GGAGACGCGC
GCGTCGTCGG ACGAAGCGGG GGAGTCGAGC GAAGGAGACG TAGGCGATGA CACAATCGTG
GCGGTGAACT TGAACGGGCG ATGCGACGTG AGTGAGTTCA TCCAGGGCGA AAGTCAGTTG
TTTGAGATTG TCGGCGCCGG CGGACAATCT CGCGTGCGCT TGCGCCAAGA ATTTGAAGAA
GGAAACGATG ACGATGACGA CAATGATGAC GACAATGATG GCGATAATGA TGACGACAAT
GATGACGATA ATGATGACGA CGAACGAAAT ACGGCAGACG TGGATGACGC AGACGTGACG
AACGAGGACG CCGACGAGGA CGATTTGCCC CGAGACGAAA TCGCCGACGA CGCAGAGCAA
ACACCGGAAC TTGTCATCGA AGTTCCCGAA GACGTTGCTG ACGATCGAGA GGCGTTTATC
GAGCACCTAC AACAGCTATT ATCACCGTCC ATTTTTGAGG TGTATCAGCG ATTTCGCAGG
CGGCGAGTCA TCGAGGACGC CTACGACGAA GTCGCCAGGG AATGGCGAGT CACGCGCCAG
GACGTCGAGA ACGCGGTTGA GATAGAGCCG TTCGATGGTG GACCGTGCGA TCACGACGAA
GTCGCGTTGT GCATTGTCGT CGGTACGGAC ACGTCGTGCG GCTACACGAA AGTCGAGGGA
AGTTTGGCGG GCGCCATCAG CGTAGCGTTT AGAAATTCTC GCGCGTACGC CGACTCGACC
GACGGTTTGC GAGCGGGTGC CGTCGTGACA TGCGCCGCAA CCGCCGACGT GCGCGAGGCC
GTTGGCGTTC TGTGCGGGTT TTCGGAAGAA TCGAACACGT GGAACGTCGC TTCGCCTTTC
GGCGTGCTGA AACGAACTCG AGAGGAGTTC GAGGTTCTTC GAAGCCCGAC GCGCGCCAAA
GTGCTCTGCT TCTTCGGCAC CGCGCAATGG AACCGATCGC AACTTTTGGG TGAAATCGCG
AGAGGACACT GGGGGTTGAC GAAATCAGAG CCCGTCGACG TGGCGCGCGC AGAAACTGCG
TATCGCCGCG CGATGGATTC TGGATCGCTC GTGTTTGCGC CGTTGACTGA AATGACGGAG
GAGTTTATGC GCGACGAACT CGCAGAGATG TCGCGCATTC GATCGAGCGG TCAGCTCGAC
CGCGCGGGGT CATCCGCGTC CCACTGAT
 
Protein sequence
MSTRRRARAS PTREKTTTEE DADACRFCFE SAREDDPLIA PCACRGGQEY IHAKCLLRWQ 
RMVVVQAPTH PAFWNEDTRS NVCNVCKEAF TTPPPTRMTL MSSFTGAEIA AMCAVGHLLV
SHAAFSAKLR EKLQDMNPAM RRICSYEYWI EGTYLITETR ASSDEAGESS EGDVGDDTIV
AVNLNGRCDV SEFIQGESQL FEIVGAGGQS RVRLRQEFEE GNDDDDDNDD DNDGDNDDDN
DDDNDDDERN TADVDDADVT NEDADEDDLP RDEIADDAEQ TPELVIEVPE DVADDREAFI
EHLQQLLSPS IFEVYQRFRR RRVIEDAYDE VAREWRVTRQ DVENAVEIEP FDGGPCDHDE
VALCIVVGTD TSCGYTKVEG SLAGAISVAF RNSRAYADST DGLRAGAVVT CAATADVREA
VGVLCGFSEE SNTWNVASPF GVLKRTREEF EVLRSPTRAK VLCFFGTAQW NRSQLLGEIA
RGHWGLTKSE PVDVARAETA YRRAMDSGSL VFAPLTEMTE EFMRDELAEM SRIRSSGQLD
RAGSSASH