Gene OSTLU_2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2458 
SymbolPPR4 
ID5005867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp89686 
End bp90816 
Gene Length1131 bp 
Protein Length377 aa 
Translation table 
GC content60% 
IMG OID640421288 
Productpentatrichopeptide repeat (PPR) protein 
Protein accessionXP_001421968 
Protein GI145355437 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0995914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGCGT ACGTCAGCGA AGGGCGCGTG GAGGAGGCGG AACGCGTGCT GAGGGAGATG 
ACTGCAGAAG GCGTCAGGGC GGGACCGAGA TTGTTTAACA CGCTGATCAC CGGGTACGGG
CGAGAGAAGA ATTTACGAGG CGTCGAGGCG AGCTCGATGG CGATGCGTAC GCTCGGCGTG
ACGCCGAATC AAGCGACGTG GGGGGCGAAA GTGAACGCCT ACGTGAGCTG CGATAGATTA
GATTTAGCCA TGGATACGCT TGAGCAGGGC GTACGATTCT CTCCGCGCGT CGAACGTCGG
CCCGGGGTGC AGGCGTACAC GGCGCTCGTG CAAGGATTGG CGCACTCGGG ACGAGTGGTC
GAGGCGGACG AGCTCCTTCG TCGGATGGCT CGAGACGGCG TCAAGCCGAA CGTGTACACA
TATTCCACAC TCATCGACGG CTTGGCGAAA AGCGCGCAAA TTGGGCTCGC CGAGACGGCG
CTGGCGGAGA TGCGCCGCGC AAAAATCAAG CCGAGCGTCG TGACGTACAA TTCCTTGCTC
AAGGGCGTCG TGCGCGGCAT CGGCAAGGCG GATCGCACGG ATGAAGTTCT CCGACGTGCG
CGTGAGATGT TTGATAGGAT GCGAGACGAC GGCGTGCCAC CGGATTTGGT GACGTACAAC
ACGTTGATTG ACGCGTGCAT CAATGCTCGC GCTCCCGCCG AGGCGTGGAA CATTTTGCGC
GAGATATCAG AATCTGGTTT GAAGCCCGAT GTCGTGACGT ACACGACGCT GCTGAAGTAT
TTCGTGCAAG TCGGGGACGA CTCGGCGACG CAATGGGTGA TCGCCGAATT GGAAACCGAT
CCACAAGTCG TCGAGGACGT CGGCGTGTAC AATTGCTTAA TCAACGCCTA CGCGCGTCAA
GGCGACATGT GCCGCGCCGT CGAGACGCTC GAAGGCATGA AGGCGAAGAA CATCACGCCC
AACGTATCCA CGTACGGCTC GATATTAGAA GGGTACATTC GCCTAGGGAA CGTGGGCGAG
GCTTTTAAGG TGTACAATTT GTGCGTAAAG TCGGCTGGAC TGGCGCCGGA CGCTCGCATG
CGAAAGTCGC TCATCTACGG TTGCGGTTTG CACGGAATGT CGGACATTGC C
 
Protein sequence
VGAYVSEGRV EEAERVLREM TAEGVRAGPR LFNTLITGYG REKNLRGVEA SSMAMRTLGV 
TPNQATWGAK VNAYVSCDRL DLAMDTLEQG VRFSPRVERR PGVQAYTALV QGLAHSGRVV
EADELLRRMA RDGVKPNVYT YSTLIDGLAK SAQIGLAETA LAEMRRAKIK PSVVTYNSLL
KGVVRGIGKA DRTDEVLRRA REMFDRMRDD GVPPDLVTYN TLIDACINAR APAEAWNILR
EISESGLKPD VVTYTTLLKY FVQVGDDSAT QWVIAELETD PQVVEDVGVY NCLINAYARQ
GDMCRAVETL EGMKAKNITP NVSTYGSILE GYIRLGNVGE AFKVYNLCVK SAGLAPDARM
RKSLIYGCGL HGMSDIA