Gene OSTLU_32043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32043 
Symbol 
ID5002356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp175037 
End bp178202 
Gene Length3166 bp 
Protein Length889 aa 
Translation table 
GC content57% 
IMG OID640417777 
Productpredicted protein 
Protein accessionXP_001418171 
Protein GI145347434 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0485159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0952764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCGCGCGAC GCGCGACGCG CGACGCATCG CGAACGAGCG CGCGACGCGA CGGTCGATCG 
ACGAAACGCG TCTCGACGCG CGCCGACGAC TCGCGACGGG CCCGGACGCG CGAGCCGAGC
GCGAGGGCGC GGTGAAGACA CGAACGGGGG ATTGACCGCG CGGCGGCATG AGCGCGACGG
CGAGCGCGCG CGGCGGCGCG GGATATGCGC GGGATGGACG CGCGAGCGCG CGAACGATGC
GAGGGATGCG AGCGCGACGA ACGACGGGCG TCGGGACGCG CGGCGGTGGA CGCGGGACGA
GGGAAGAGGG AGGAACGGCG GCGTCGCGCG CGCGAGGGTG CCCCGCGACG ACGACGCGGG
CGATATCCGA GGACGCGGCG TTCGGCGCGA AGGAGGCGTC GAACGCGAGC GGGAAGATCG
CGAAGGGCTA TGGACGGTGG TCGACGATGA CGCTGGGGGA GCTGATCGAT CGCGCGGGGG
AGCTCGAGGC GAAAGTGGAC GGGACGCGCG CGAACGAGTC GGAGTACTTT TACATTTTTC
GCGAGTTGGT GCGATGTAAA AGACTTCATG ATTCGGTGGA TTTATTGAAG CACATGAAGG
AGCGAGGGGT GAAAGAGCTC GGGCGAAGAG TGTCGCACAG AGACTTCTTC AGCGCGTGTC
GTTCGCTGCG CGTTGTGTCC GTCGGATTTG AGTTCGTGGA CGTCATCGAG AGCGGAGACA
TTCGACCGTA CAACATGTTG GTGCACGCAT GCGCCACGGC GGGGGATTTA CAAGCGGCGA
CGCTGGCGAT AGAGAAGATG AAAAATGCGG GATTTGAGCC TGATTTACAA GCGTACACCA
CGCTTCTTGG AGCTTGCTCA AAGTGTGGGG ACGTCGAACG CGCGTTCGAG GTGTACGCCG
AACTCAAGAG GGCGGGGTTT GAGCCAAACG AGAAGACGTA CGGATCGATG ATCGACGCCA
TCTCGCGAGA CCTTGCGACG TCTCTTAAGG GTTCGAGGAA GAGACGAGTA GACTCCGAAC
ACGTTCGTTC GACTTTACAA AGCTGTTTCA TGATTTTTGA GGAGATTAAA ACCACGAATA
TGAAGCTGGA TAAGATAGTG ATGAATAGCT TGCTCACCGT GTGCGCTCGC GCGGCGGTGG
TTCCGAGTGT GCGCAAAGAA GCTTGCGAGA AAGTGGCGAT GGTACACGAC GAAATGATTG
AGCGTGGCTT TGAGCTCGAC TCTTACGCGT ACCAGGCTCT CATCTGCTGC GCCTTGGCTG
AGAAGAATTA CACGAGGGCT TTTGAGTACT TTGACGAGAT GCACGACGCC GGTATCAATG
GTACCACCGA AGTATACACA GTGATGATCA GAGCGTACGG GAAGCTTGGT AAAGCTGATA
AAGCTAAGCT GATCTGGTAC GCCATGTTGG AGGATAACAT CATTCCGGAT CAAATGAGTT
ACGCGACGAT GATGCGACTC GCCCTGTTGG ACGAGGACGA CGATTTCTGC GACGAGCTGA
TGACGTCCAT GAGACGCAAT CGCGTCCGCC CAGGGCCTGA GTTGTACTCT ACGCTCACTG
GTGTGGCCGC GCGACAAGGC GATGCTTCAC AGGTGGAAGA GATCATGCAA AACGCGAAGA
AGCGAGGCGT GGTGGCTCCC ATCGAATGCT ACAACTCGCT CATCGCCGCG CACGCTCGCG
CCGACCGGCC AGATCTCGCC GTCGAGGCTG CGGGCAAGCT CGAAGCGGCT GGGTACGAGC
TTGACGCTAT TTCATACGAA GGACTTATTT TCGCGTACGC TTTCGCGAGA GATGTCGAAG
AAGCAAGTAA CATGTTTGAG CGTCTCCTCG AGTCTGGTAT TCGCCCGACA TTCCCGACAT
TCAACTGTCT CGTGGCCGCT CACGCTCGAA GTGGTGATAT GGACGAGGCA TGCCGCTTGG
TAAGTGTTAT GAAACAGCAT GGATACGTGG AGGATTCGAT CACGTGGCGC GAGCTTCTTT
TGGGCAGCGT TCAGTCGGGT GACATTGAAG CCGCATGGAA GATGTACAAA GAGTCCCGCG
CGTCTGGAAA TGCCGATTCC GAGCGTGCAC TCAACACGAT TCTCGGTCAA ACTTTAGTGC
ATATTAGAAG TCTCACGGAT ATGAAGAACC GATCGAACGG GAAACCGAAC GAGTTCGGCT
CATTTGACGA CGAAGGGGAT TACATTGCCC AGGAATGGAC GGAACGTGCG GTCGCGGCTT
TCCACGAAGC CACGCTCGCT GGAATCAAAC CTCGGGTTGA AACGTTGTCC ACAATGCTCG
CTTGCTTACG TCCACCTTCG ACGGACGAGC AAAATGCAGC TGAGTACAGC GAAGTTGCCC
GAGCCGTGAG TCACGAGACG AGCTCCCATG AAGACGCCGC CAGGTACTAC CCTTCGCAAG
CCCTCATCAT GTACGAAGAA GCTCAAGGCT TGGGTATTGT ACCGAAGTTT AGTCGTGATG
ATGAAGACTT TGTCTACGAT ATCCGAGAGT TCCCACCAGC GGCTGCCGAG GTCATGTTGT
TGACGTGGTT GCGTGTCGTT CGCCGGCGCA CAGACGCGCA TGGATTAGAC GCTACGATAC
CGACTATGAC TATTCGTGTG AGAGCTGACG AAGAAGTCGT TCGAATGATC AAGGAGCAAC
ACATGGATCG AATTGATCAC TCGCTGGGTC GCTTGTGCAA GACTGGAGAA CGGTTGCTGA
CATTATTGCG CCGCCTCCGT ATCAATTACG GTGGTGGCTT GCAGGAGGGC ACTATCGAAT
TGAGTGGCCA CGCGCTCGGT CGATGGCTTC AAGGTTTTGT TCCGGGCGAC TTCGGCAATC
ACACCGGTTC CGTGTTCAGT GAGCATTCAC TTTCGGGCGG GGTGAGAGAC CAAGCGATGC
GCATCCGCGC CAATTCTTTC GGCAGTAAAG ACGACGACGT GTGGACTCCG TCAAAGATGC
GTCAAGCCGC ATTCAATATC CACGATTATT ATGGCAATGA TGATGACGAC CCGTCAGATT
TTGGTGCCAG GCCATTCTAT CCCAAGAACT GGGTATCACA GAGTTACGTG TCGAGCTATG
ATGAGGACGA TGACGACGCC ACAGATCTCG AGCGCATCTT AGGAAGTCGC AAGTAACGTA
CGCACGCGCG CGCAGAAGTA CGTAGATTAT TTTTTTAGGA ATTCAT
 
Protein sequence
MTLGELIDRA GELEAKVDGT RANESEYFYI FRELVRCKRL HDSVDLLKHM KERGVKELGR 
RVSHRDFFSA CRSLRVVSVG FEFVDVIESG DIRPYNMLVH ACATAGDLQA ATLAIEKMKN
AGFEPDLQAY TTLLGACSKC GDVERAFEVY AELKRAGFEP NEKTYGSMID AISRDLATSL
KGSRKRRVDS EHVRSTLQSC FMIFEEIKTT NMKLDKIVMN SLLTVCARAA VVPSVRKEAC
EKVAMVHDEM IERGFELDSY AYQALICCAL AEKNYTRAFE YFDEMHDAGI NGTTEVYTVM
IRAYGKLGKA DKAKLIWYAM LEDNIIPDQM SYATMMRLAL LDEDDDFCDE LMTSMRRNRV
RPGPELYSTL TGVAARQGDA SQVEEIMQNA KKRGVVAPIE CYNSLIAAHA RADRPDLAVE
AAGKLEAAGY ELDAISYEGL IFAYAFARDV EEASNMFERL LESGIRPTFP TFNCLVAAHA
RSGDMDEACR LVSVMKQHGY VEDSITWREL LLGSVQSGDI EAAWKMYKES RASGNADSER
ALNTILGQTL VHIRSLTDMK NRSNGKPNEF GSFDDEGDYI AQEWTERAVA AFHEATLAGI
KPRVETLSTM LACLRPPSTD EQNAAEYSEV ARAVSHETSS HEDAARYYPS QALIMYEEAQ
GLGIVPKFSR DDEDFVYDIR EFPPAAAEVM LLTWLRVVRR RTDAHGLDAT IPTMTIRVRA
DEEVVRMIKE QHMDRIDHSL GRLCKTGERL LTLLRRLRIN YGGGLQEGTI ELSGHALGRW
LQGFVPGDFG NHTGSVFSEH SLSGGVRDQA MRIRANSFGS KDDDVWTPSK MRQAAFNIHD
YYGNDDDDPS DFGARPFYPK NWVSQSYVSS YDEDDDDATD LERILGSRK