Gene OSTLU_29142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29142 
SymbolPPR7 
ID4999912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp888728 
End bp890675 
Gene Length1948 bp 
Protein Length632 aa 
Translation table 
GC content57% 
IMG OID640415333 
Productpentatrichopeptide repeat (PPR) protein 
Protein accessionXP_001415964 
Protein GI145341746 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00572015 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG ACGCGTTCGA GAGGTTGGAT AAGCAAAAGG CGAGTCGACG GGAGAAGCGC 
ACGGCGGTGC GGGCGGTGCG CGAGGAGGCC GCGAGTGGTG GTGCGAGAGG CGGGACGGAC
GCGCGGTTGT CGTTTTTGTA CGAAGAGGGG GTGAACGCGA ACGACGCGGG GGCGGTGACG
AAGTGCAACG ACGTGTTGCG GCGATGCGCG AGCATGGAGG AAGTGCTCGG TTTGGTGAAG
GAAATGCGAG ACGCGGGGTT GGAGCCCGTG GAGAGCACGT ACGTAGCGGT GATGCTCGTG
TGTCAAAACG TTGGCTCGCC CGAACGTGCG GTGCAGGTGT ACGACGCGAT GACGGAGGCG
CAAGTGCGCA TGACGGGACG TACGTTTCAT CTGGCCATCG AGTTGGCGCT CAAGGCTGAG
ATGTTCAAGG ACGCGCTTCG TATCAAAGAT GATATGCGCT ATGTCGGGTT GCCGGTGAGC
TCGAAGCTTT ACGTGACGCT TCTGCGCGCG TTGGCGGACA ACGACATGGG AAAGCGAAAG
GGGCCACAGG AGCGACTCAT TCGCACGTGT CGTCTTTTTG AAGAGATGCT CAACGAAGAT
GTCGACCCAG CGCCGGCGGC GTATCACACT CTCATCGTTG CCGCACATCG CGCGAATCAG
CACGACCTGG CGGTGCGCAC TTTCGATGAA CTCATCGAAG AGGGGATCAC GCCGGCGAGA
CAAACGTACG AGACTGCCCT CGATTCCATG GCACGAAGCG GACTTCTCGC GCAAGCTTTG
GAAGTTTTTG TGCAGATGAA GTCGAACGGA CTCATGCCAC GAAAGGTGAC GTACAACACA
CTTTTGAGTG CTTGCGTGAA CGCACCGCAA CCGCGCGTGG AGGCGGCGTT TGAAATTTTT
GAAGAAATGC AAACGAAAGG CAACGTGACG CCGGACAGGA GCACGTACTC ACTACTCATC
GACGCCGCGT GTAAAGCGGG TAAACCCGAG ATGGCGTTTG ACGCGTTCAG CCACATGCGG
GATTCTGGTA TTGAAATCCA AGTCGGCACG TTAAATCGAT TGATTCACGC TGCGGGATTG
AACGCCAAGG AGGACTCTAC ATCGGTACAA GCAACTCTCG AGCTTTATGA CGCGATGAAA
AAGCTTGGCG TTGAACCAGA CGTGATCACG TATGGTAGTT TGATCGCTAC ATGCGCTAAG
GCGCGCGATG CGGCAACTGC GATAAAGGTG TATGAGGAAA TGCGCGCCGC GGGCGTCGAG
CCAAACCGCA TCTTGTTCAA CGTCTTGATC AACGCCCTCG GGCGTGCGAA CCGTAGCGAA
GAGGCGATGG AGTACTTTCG CGTGATGCAG AAGCAATCCG AAGTGAATAG TTCGTTAACT
CCGAATCGGG AGACGTACAC GACGGTTTTC GATGCGTTCA TCGGCGGCGG CGGCGCAGAG
CTCGCCATGG CGAAACAATC GCTCGAATCA GGCGACGACG CATCCTTCGT GCACAGCGCC
CAGGTTGCGA AGTTGCGCGA AATCTACGCT CAGGGCGTTG AACACGGTGT GTACGAAGAC
TTGTCAGTCG TACTGTCGAC GCGCGAAGCG GAGTCTGAGT CTTGTACGAT AAACATGAAC
CAGCTGTCGC GCACCGAAGC TACGGTGGCG ACGCTCGTCC TGCTCGAACG CATGTCCACG
CTTTCGAGCG ATCAAGTACC ATCCGCCATG TTCATCTACG CGGGTAAGGT AAAGCCGGGT
AAAAACGGTG CCCAGCGTCG ATTGTTGGCG ATCGAAACTG TCCTTCGTGC GGCTAACGTC
AAGTTTGAAA TCGGCGAAGT CGGTTCCGCC GAACTCATCG CCGTGAAGGG CAAGCACTGT
CGCGCGTGGA TCGAGAAAAA CGCCGGCACG TTTGTGTAAA TACTCTAGAC CTTAGTCATT
CTTTTAACCA TCAGATGTGC TATTTGCT
 
Protein sequence
MDDDAFERLD KQKASRREKR TAVRAVREEA ASGGARGGTD ARLSFLYEEG VNANDAGAVT 
KCNDVLRRCA SMEEVLGLVK EMRDAGLEPV ESTYVAVMLV CQNVGSPERA VQVYDAMTEA
QVRMTGRTFH LAIELALKAE MFKDALRIKD DMRYVGLPVS SKLYVTLLRA LADNDMGKRK
GPQERLIRTC RLFEEMLNED VDPAPAAYHT LIVAAHRANQ HDLAVRTFDE LIEEGITPAR
QTYETALDSM ARSGLLAQAL EVFVQMKSNG LMPRKVTYNT LLSACVNAPQ PRVEAAFEIF
EEMQTKGNVT PDRSTYSLLI DAACKAGKPE MAFDAFSHMR DSGIEIQVGT LNRLIHAAGL
NAKEDSTSVQ ATLELYDAMK KLGVEPDVIT YGSLIATCAK ARDAATAIKV YEEMRAAGVE
PNRILFNVLI NALGRANRSE EAMEYFRVMQ KQSEVNSSLT PNRETYTTVF DAFIGGGGAE
LAMAKQSLES GDDASFVHSA QVAKLREIYA QGVEHGVYED LSVVLSTREA ESESCTINMN
QLSRTEATVA TLVLLERMST LSSDQVPSAM FIYAGKVKPG KNGAQRRLLA IETVLRAANV
KFEIGEVGSA ELIAVKGKHC RAWIEKNAGT FV