Gene OSTLU_14352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14352 
SymbolPPR1 
ID5000287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp799469 
End bp800803 
Gene Length1335 bp 
Protein Length444 aa 
Translation table 
GC content61% 
IMG OID640415708 
Productpentatrichopeptide repeat (PPR) protein 
Protein accessionXP_001416498 
Protein GI145343933 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.373458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGG ATGACTCTGA GGTGTGGCCG AACGCGCACG TGTGCACGAC GATGATGTCG 
TTGGCGACGC GCGCGCGAGA CGCGGATAAG GCGTTGGCGA CGCTGGAGTG GATGAAAAGA
CGGAGCGCGG ACGCGCGAGA GGTGGCGCCG ACGACGCACA CGTATACGAC GTGCGTGCAA
GCGCTGGACG GCGCTGGACG GTGGGAGGAG GCGCTGGGGA TGCTGGGGGA GATGAAGCGC
GCGGGGACGG CGAGAAACGC GCACACGTAC AGCGCGTTGG TCAGAGCCGG GGCGAACGGC
GGGCGCGCGG GGGCGCGCGC GGTGGTGAAG TTGATTCCAG AGATGAAACG CGATGGCGTG
GAACCGGACT TGGCGATCGC GTCGGCGCTC ATCAGCGCTT TCGGGGTTTT GGGGAGCGAA
GACAACGCGA GGGCGATGTT GCGAGCCATC GAGTCGAGCG GGCGGGCGAC GGATAGCAAG
CTGTACGTGG ATTACATGAC GGCGATGTGT CGATGTGGGA ACTACGAAGA GGCGATAGAA
ACCTTCACGC GCGTGCCTCG CACGACTTTC ACGTGCACGG CGGTCATGAA GGCGTACGCT
GAGGCGATGG ATTGGAAGCT CGCCGAGGCT TTGTTCGAGG ACATGCGTCG CGACGGACCG
GCGCCGAACG ACCACACGTA TAGCGCCATG TTGCACGCGT ACGAAAAGAG TTTGCAGTGG
GAGCGCGCGG TGCGCTTGCT CAATCAGCTC ACGAGCGAAG GAAAGGCTAA GGAGATACAT
CACAACATTG TCCTCAGCGT TTTGGGCAAG TGCACGCAGT GGCAGCGCGC GGAGGTTTTA
TTTAGAGACA TGCGAGAGTT GTACAACATT CAGCCCTCTC GCGTGACGTA TAGCACGCTG
ATTTCGGCGT ACGGACGAAG TGGGAAAACT GATCTAGCGC GCAAGGTGTT CGAGCAAATG
CTCGCGCGGC GTATTCCGCC AGACGACTAC ACCTTTGTCG GCTTGATGCT CGGTCCCGCG
AGCGAGGGCA ACTTTCGAGA GTGCTCAAAA ATCGCGACCG AGATGAAGGA GTTGTACGAC
GTCGAGCACA CCGTGCACAC GTACAACACG CTCATCCAAG CCGCGGACAT CGCCGGTAAT
TACGACAAGG CGGTGGAGGT GTACGACGAA CTCTTGCAGC GCGGCATCGA GCCGAACAAC
ACGACGAGAG AACTCGTCGT CGCCGTGAGT AAGCGCGGGG CGAACTTTTA CGATCGCCAA
CAAAAGACCG CCGCCGTCGC ATCCTACGCC GCAGGTCTCG TCGGCGTCAT GGGCATGGCG
CTCGGTCGCT GGTAG
 
Protein sequence
MTADDSEVWP NAHVCTTMMS LATRARDADK ALATLEWMKR RSADAREVAP TTHTYTTCVQ 
ALDGAGRWEE ALGMLGEMKR AGTARNAHTY SALVRAGANG GRAGARAVVK LIPEMKRDGV
EPDLAIASAL ISAFGVLGSE DNARAMLRAI ESSGRATDSK LYVDYMTAMC RCGNYEEAIE
TFTRVPRTTF TCTAVMKAYA EAMDWKLAEA LFEDMRRDGP APNDHTYSAM LHAYEKSLQW
ERAVRLLNQL TSEGKAKEIH HNIVLSVLGK CTQWQRAEVL FRDMRELYNI QPSRVTYSTL
ISAYGRSGKT DLARKVFEQM LARRIPPDDY TFVGLMLGPA SEGNFRECSK IATEMKELYD
VEHTVHTYNT LIQAADIAGN YDKAVEVYDE LLQRGIEPNN TTRELVVAVS KRGANFYDRQ
QKTAAVASYA AGLVGVMGMA LGRW