Gene OSTLU_30981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30981 
Symbol 
ID5001144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp86899 
End bp88445 
Gene Length1547 bp 
Protein Length439 aa 
Translation table 
GC content59% 
IMG OID640416565 
Productpredicted protein 
Protein accessionXP_001417423 
Protein GI145345872 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.116585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCGCGCG AGGGAGGGAG GACGAGGGAA GGCGACGACG GAGGTGGATC TGAGGGTGAC 
GAAGTCGTCG CACGGGGCGC TGCAGAGGCA CGTGAACGCG AGTAAGAGGG AACCGACGCA
CGCGCAAAAA AGGGAGACGA GAGGCGGTGG ACGCGGGGAT GGCGCGGCGA GTGGAAGCGG
GAGGCCGCCG TTGCGCGCGG GGGAACGGAT CGTGCGGGCG ATGGAATCTA TCGACGCGCC
GGCGCAGGGC GAACGCATCG CGTGGGACGT CGTCGCGAGC GCGGTGAATC CCGAGGGTGG
GAAGTTTAAA TTCACCACGT CGACGCTTAA CTTTGCCATT AAGGAGCTCG GTGAGCGTGG
GAGCTTCGAT CGCGCGCACG CGCTGTATTT GTGGATGTCG CGCAAGCAAG GGCGGTACGC
GCCGAATGAG TTTACGTACG TGTCGCTCGC CAGCGCGGCG AAGACGCTGA CGCAAACGCG
AACGGTGCAA AACTTGTGGC AGCGCGGTAT CGCGGAGAAT GATGAGAGCT TGATTTGTAA
CGAGATCGCG TCTGCGGTGA TCGCCGCACT CAATCGCGTG AGTGATTGGA GCGGGGCGTA
TCAAGTGTTT CGGGACATGG GCGACAAGGG CAAACCGAGA AATTTGTACA CTTACACAGC
CGTGCTCACG GCGTTGAGAG ATGAGGCGAA GCCGGACGAG GCGTTGGCCG TACTCAACGA
GATGGCGCGT GAGCCTGGAG TTCAACCGAC GAGCTTAGCG TTTTCGTTGA CGCTGACGGC
GTTCGATAAT TGTCGACGCT GGATCGAAGG TAACGCGGTC GCAAAACGTA TTAAAAAGTA
CGACGTGCGT CCGGACGCGA CGTTGATGCA TGCCATCATC ACCATGGCAG GACGTGCCGG
CGACATGGCG CACGCGAACG AAGTTTTCGA CGCTATGCGC AACTCAACGA TGATCGTCAC
GACGTACACC TTCAACGCGC TCTTGGGCGG ATACGCGAGG TACGGGGACT GGGAAGGATG
CACTGAAGTG TATGACGAGA TGAAACGCTC GAAAATCCAG CCAGACTCGT ACACTTTCAC
GCAGCTCATA TCTGCTGCCG AGCGAAGCGG AGAGTACATC GCGGCGGACG GCGTTTGGAC
GGAGATGTTG CGCAACCGAA TCATCCCGCA CACCGTCATG TGCGGAGCGT ACATTCACTG
CTTGGGATGC CAAGGTCGAG ACCTCGAAGC GGAGGCCGTG ATGGAGAAAA TGCGCAACTA
TTGGGATGTT CCACGGAACG CTGCGGTGTA CAATGCTCTC ATCGGGGCGC ACGTGCGAAG
CGGCGAAGTC ACGCGAGGAC TCAGCGTGCT CGACGATATG CAACGCATCG ATGGACTTAT
GCCCACCGAA ATCACATTCG CCGTGCTTAT TCGCGCATGC CAGGAAAGTG CGCTACACAA
ACGCGCGGAA GGTTTAGAAG GCATGCGCGC CTCGCTCGCG AACGCTGGAC AACTCATTCA
AGATTTGTCT GGCGCTTCAA CCTCGACGGC GAAAGCATAA CCATTGT
 
Protein sequence
MESIDAPAQG ERIAWDVVAS AVNPEGGKFK FTTSTLNFAI KELGERGSFD RAHALYLWMS 
RKQGRYAPNE FTYVSLASAA KTLTQTRTVQ NLWQRGIAEN DESLICNEIA SAVIAALNRV
SDWSGAYQVF RDMGDKGKPR NLYTYTAVLT ALRDEAKPDE ALAVLNEMAR EPGVQPTSLA
FSLTLTAFDN CRRWIEGNAV AKRIKKYDVR PDATLMHAII TMAGRAGDMA HANEVFDAMR
NSTMIVTTYT FNALLGGYAR YGDWEGCTEV YDEMKRSKIQ PDSYTFTQLI SAAERSGEYI
AADGVWTEML RNRIIPHTVM CGAYIHCLGC QGRDLEAEAV MEKMRNYWDV PRNAAVYNAL
IGAHVRSGEV TRGLSVLDDM QRIDGLMPTE ITFAVLIRAC QESALHKRAE GLEGMRASLA
NAGQLIQDLS GASTSTAKA