Gene OSTLU_14603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14603 
SymbolPPR2 
ID5000950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp420165 
End bp422543 
Gene Length2379 bp 
Protein Length792 aa 
Translation table 
GC content58% 
IMG OID640416371 
Productpentatrichopeptide repeat (PPR) protein 
Protein accessionXP_001416655 
Protein GI145344261 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGCG CGGTCGCGCG CGCCTGCGCC CGGGCGCGAT CGCGCGCGGA CGCGCAAACG 
TGCGGTCGAA CGCGCGCGCG CGACGAGTCG GCGGTGTGGA CGTCGGTGCG CGCGACGACG
CCGAGGTGTT GGCTCGGGAC AAAGGTTGAA CTCGCCGAAG ACGACGGGAC GAGCGCGAGC
GCGAGAGAGG ACAGCGCGAG AGAGCAGACG CTGCGCCGAC GGCACGTGGC GAGCATGCGA
CGCGCGCTTC GCGCGAATGA TTTTGAGGGC GTGCGGAGAG AGTTTGAGGC GCTCGCGGCG
TCGACGACGA AACCGGGACG CGCGGCGTGG AACGTAAACG TGCTCGCGGC GGCGCGAGCG
CGAGAGCCCG AGGCGGCGCT GTCGATCGTC GAGTCAATGC TCGCGGATAA TATCGGACCG
GATGTGAGCA CGCACACGGC GGTGATGCAG GCTTACGTGC GCGCGGGACG ATACGGGGAC
GCGTTTAACT GGTTGAAGAT GCACCTGTAC GGCGTACCGA AGGACGTTGA ACAAGGCTTT
GATGGCGAAG ATAACGTAGA AATGGACACG AGGTATGATC TATCACATGT GATTGGGGAC
ATGATCGATG GCGATGAGGA ACACGTGTCG AAGACGCGCG TGAATAGCAT GATGTTTACA
ACCGTGATGA ACGGCGCGGC AAAGGCGGGC GACCGAGCAA CGGTGCAAGA AATCGAGACG
ATGATGTTTG AGTTCGATGT GGCACCTTTA GCAGATACGT TACACTGTTT GCTCAAGCTC
GAGCGCGTCG CGGGGACGAG TGCATCGGTG GAATCTGTGT GGGATCGTTC CAAGCTCAAA
GCGCGCGCGT TGAAAGCACA TCAAGAACGC GTCGTCGCTC ACGCGTTTCT GGGTCGCCAA
AGAACGCAGC GCGCCGCCGA ATCGCGTGCG CTCGCAGTGA AAGCATTAGA AGATCTGTAC
GATCGCGTGG GTCGGAAGCG CTCGGACGAA GACTTTGCGA AGCTGGAGAA GAACAATCAT
TCAGGGTCGA GATATGCTCG CTGGACCGGC GAACGTGGTT TAGAAGAGTA CCGAAACAAC
TTGGTGAGCG ACGACGATAT CAAGAAAGTC GGAACGAAGG AAGTTTTAAT AGCCACGAAC
GCCGTGATGC TCGCGCATGC TGCGGTCGGT GACACGGACA CAGTCAATAC ATGGTTTGTT
CGAATGGAAG AGGATTTGGG CATCAACGCA GACGTGCGAA CGTTCAACGC AGTACTACGC
GCCGAATACA TGCGAGATAA GATAAAGCTC GGTGTCGTGG ATCATGAGCA AGCAATGATG
CGATTTCAAG AAGTGATGCT TGCGATGGAA GATCGAGAAG TCGAGCCGAC GACGTACACG
TTTTCGACAC TTCTCTTGGC GCACGCAGAA CAAAGCCACT TACCCGGTGT CGCAGAGGTT
TTGAAGCTCA TGCAAGAGCG AGGACTCGCG CTTGATACTA CGATGTACAA TATTTTATTG
GGTGCTTGCG CGCGAGCGGG TGATTTAGAC GCTGCGCTCA ACGCGCGGGC AAGCATGGTC
AAGTCTGGCA TTGCCGCCGG CCCAGACACG TTCGTGCCGC TCTTCGCCGC GTGCACGAAA
CAGGCGGAGG AACTGGACTT GGACGGTGAT ATGTTTGAAT TAGATGAAGA AGTCGTTGGA
CCTCTGTTAG ATCGCACACG AGCGACTTTA GATAGCGTGG AGTTGGATAT GCTCTCAAGC
AACGTCGAGC ACAACACGCA GTCGTTCACG GCTTTGGTCA AAGCCAGAGG GGCTTTGGGT
CAGCCAGACG CCGTGTTCGA CATGATCAGC GAATTGCCAG AAAACATCGA ACTCGACGAA
GTCGCCATCG GCCTGAGTGT TTTGGCGGTT GCGAAGCGTG AACCAACCAA GGCTATTGCG
CTCGCCGATT CCCTCATGGA CGAGTCGATG GTGCTGGATC CGTGGCTTCT CAACTGCATC
ATCACCGCTT ACTCTCATCT CGGTCAAATC AAAGCAGCTA CCGAGCGCGT GCAAGACTTC
ATCAAGCGAG GCGGTGTGCC GACTGTCGCA ACGTACAACG CGCTCTTCCG ATGCGCCGCC
CAGAGCGGCA GCTTTGCCGA ATACGCCCCG CTCGTCATGA ACGAGATGTC ATCGCGCGAT
CTCAAACCGG ACAAGTACAC TCGCAAATTT ATCGCTGCTG TCACCGCCGG TTCATCCGTC
CTCGATCGCG AGACGGCGGA AGCGCTTCTC AAAAAGTGTC CCGGCGGCGA CAAGTTCTTC
GTCGCCGAAG ACGCCGCCGG ACAGGACGAC TTCGATTTCC TCGCCGACGC CGGCTTCGAC
GACGACGACG ACGACGACGA CGACGATCTA TTATTGTAA
 
Protein sequence
MFRAVARACA RARSRADAQT CGRTRARDES AVWTSVRATT PRCWLGTKVE LAEDDGTSAS 
AREDSAREQT LRRRHVASMR RALRANDFEG VRREFEALAA STTKPGRAAW NVNVLAAARA
REPEAALSIV ESMLADNIGP DVSTHTAVMQ AYVRAGRYGD AFNWLKMHLY GVPKDVEQGF
DGEDNVEMDT RYDLSHVIGD MIDGDEEHVS KTRVNSMMFT TVMNGAAKAG DRATVQEIET
MMFEFDVAPL ADTLHCLLKL ERVAGTSASV ESVWDRSKLK ARALKAHQER VVAHAFLGRQ
RTQRAAESRA LAVKALEDLY DRVGRKRSDE DFAKLEKNNH SGSRYARWTG ERGLEEYRNN
LVSDDDIKKV GTKEVLIATN AVMLAHAAVG DTDTVNTWFV RMEEDLGINA DVRTFNAVLR
AEYMRDKIKL GVVDHEQAMM RFQEVMLAME DREVEPTTYT FSTLLLAHAE QSHLPGVAEV
LKLMQERGLA LDTTMYNILL GACARAGDLD AALNARASMV KSGIAAGPDT FVPLFAACTK
QAEELDLDGD MFELDEEVVG PLLDRTRATL DSVELDMLSS NVEHNTQSFT ALVKARGALG
QPDAVFDMIS ELPENIELDE VAIGLSVLAV AKREPTKAIA LADSLMDESM VLDPWLLNCI
ITAYSHLGQI KAATERVQDF IKRGGVPTVA TYNALFRCAA QSGSFAEYAP LVMNEMSSRD
LKPDKYTRKF IAAVTAGSSV LDRETAEALL KKCPGGDKFF VAEDAAGQDD FDFLADAGFD
DDDDDDDDDL LL