Gene OSTLU_19174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19174 
SymbolPPR3_21 
ID5006940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp39426 
End bp41315 
Gene Length1890 bp 
Protein Length629 aa 
Translation table 
GC content62% 
IMG OID640422361 
Productpentatrichopeptide repeat (PPR) protein 
Protein accessionXP_001422791 
Protein GI145357164 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0365143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA ACGACGGCAC CGAACGCGAG ATCACGGTGG ACGCGCGGTT TAAGTGCGCG 
AAGTGCGCGC GCGGGAGCTC GTGCTTCGCC GTGCACCGAG CGACGAGCGC GGTGAGCGGG
ATGCGAGCGA AACGGCGGTA TAACGAACGG AAAAAGGCGA AAAAGACGAA GAGTAAATAC
GCGTCGGACG TCGAGGGCGG CGGGCGAGGG TCGAGGGCGC GCGCGGACGC GGATCCGGTC
GGGGCGACGC TGGCGCTGAT ACGAGACGGA CACAGGCCGA CGGCGAAGAC GTTTACGGCG
GTGATTAGCA CGCTAGGCGC GATGGGACGC GCGAAAGAGG CTTTGGACGA TGTGTTGCCG
ATGATCGAGG AGTATGGCGA TGACGTCGAT GTGGCGGTGT GGAACGCCGT GGCGCACGCG
TTTTGCGCCG GCGGAGACCC GGCGGGGGCG GAGAAAATCG TGGATAGAAT GGTCGGCGAA
GACGGCGTGG GCGTGAACGG GGCGACGCAT CCGGAGATCA TATACACGTA CGCGAAGCGT
GGAGAGGCAA ATAGAGTGTA TAGGTTGATC AGACGAATGT CGAGCGAGCA TGGAATAATC
CCAGACGAAC GCGCGTATAA CGCGTTTTTG CGAGGCTTGT GCGAGCGGGA CGATCTCGAG
GACGCCGAGG AAGTGCTCAG ACGATGGAAC AACGAAAAGT TCGACTTGGA GCGACAGAGT
GACCGCGGTG GGCGAGTGTC GAAGCCGAGC GCGGCGTCAT ACGGGTTACT CATCGACGCG
TGGACGCGTC GCGGAAACAT GCTCGCCGCG CGCAAGCTTT TACAACAGAT GCAATGGGAG
CGCATCGCGC CCTCACTGCC GCTGTTTAAC ATGCTCATAG ACGGATACCT CAAGCAAGAA
AACATGCGCG CTGCGGAAGG GCTGTTTCGC GAGCTCGAAT CGAGTGGAAC GTGGGATATG
GAGTCGTTGG GGATCAAACC AGACAACGTG ACGTACACGT TGTTTTTAGA TTATTGGGCG
AATCAAGGCC AAGTCGACGC GTGCGAGCGA ATCTTCAATC GCATGCATCG CAAGGAAGTC
GCGCCAGACG TCACAGCTTA CGGGACGTTG GTAAAGGCGT ACGCGCGCGC GCGCGATTCC
GACGGCGCTG AGGCGGTTTT GGATCGACTC GCCGAGGCAA AGGTGGCTCC GTCGGTGGCT
ATTTACAGCG CCGTCGTCGC CGCACATTGT ACGATTGGTA ACATGTCGCG CGCGCGCGAC
GTACTCGAGC GCATGTTCGA CGCGGGCTTG CGCCCGAACG AGCGTACGTT CGCTCATTTC
GCGTGGGGAT ACGGCCAACT GGAAGACATC AACGGTATTG CCGAGGTGGC GAAGTTAATG
CTCGCGAGCG GGCTCAAACT CAAGGGTGCG AACCGCACCG CCATCGTGCG CGCGTGCGAA
GAGTGCGGAA TGAGCATGAG CGCCGTACAA GCGCTGCTGG ATCGAATCAA TCCCGAAATG
ACGCAGCGCA AGGGCGTGTG GAAACGAGAC GGCGGCGAGC CGAAACCCAA ATCCAATCGA
GCCGCCGCCG CCGCCGCCGA CGAGGACCTC GAACCGTCAA CGCTCGAACC GACGAAGACG
AAAGAAATCT ACGGAGGTCC GGAGTCCACC GCGCGTCGGG TGTCGCTTTC CCAGCGTGTA
GAGTCCCTCG ACGAAGACGA CGACGACGGC GATACGGGCG CCGTGGACGC GCCGCCGGCG
AGCTCGGATT GGCCTCGCAA AGTCTCCACT CGCGCCGTCG CCATCGCGCA CCGCGCGTCC
TCGCGCGCGC GCCCTATCCG TCGCACATTT ACACGAACGA ACGCGCGCGC GTTCGCGATC
ACGCGAGCCC TCGGCGCTGC TTCGATGTAA
 
Protein sequence
MTMNDGTERE ITVDARFKCA KCARGSSCFA VHRATSAVSG MRAKRRYNER KKAKKTKSKY 
ASDVEGGGRG SRARADADPV GATLALIRDG HRPTAKTFTA VISTLGAMGR AKEALDDVLP
MIEEYGDDVD VAVWNAVAHA FCAGGDPAGA EKIVDRMVGE DGVGVNGATH PEIIYTYAKR
GEANRVYRLI RRMSSEHGII PDERAYNAFL RGLCERDDLE DAEEVLRRWN NEKFDLERQS
DRGGRVSKPS AASYGLLIDA WTRRGNMLAA RKLLQQMQWE RIAPSLPLFN MLIDGYLKQE
NMRAAEGLFR ELESSGTWDM ESLGIKPDNV TYTLFLDYWA NQGQVDACER IFNRMHRKEV
APDVTAYGTL VKAYARARDS DGAEAVLDRL AEAKVAPSVA IYSAVVAAHC TIGNMSRARD
VLERMFDAGL RPNERTFAHF AWGYGQLEDI NGIAEVAKLM LASGLKLKGA NRTAIVRACE
ECGMSMSAVQ ALLDRINPEM TQRKGVWKRD GGEPKPKSNR AAAAAADEDL EPSTLEPTKT
KEIYGGPEST ARRVSLSQRV ESLDEDDDDG DTGAVDAPPA SSDWPRKVST RAVAIAHRAS
SRARPIRRTF TRTNARAFAI TRALGAASM