Gene OSTLU_31572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31572 
Symbol 
ID5001982 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp153023 
End bp154710 
Gene Length1688 bp 
Protein Length536 aa 
Translation table 
GC content60% 
IMG OID640417403 
Productpredicted protein 
Protein accessionXP_001417933 
Protein GI145346927 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.314666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGCGG TGGATGGGGC GGTGATGCTG CGCGCGGCGG ATACCGGAGA CGCGCCTGAG 
GCGATTTCGG CGCTGCAGCG GATGAAGAAG GCGGGCGAGC GAAACGGCGT GGCGGTGCCG
CTGGCGTTTT ACAACATGAC GCTGCGAGCG TGCAAGCGCT CGCGTCCGCC GGCGAGCGCG
GACGCGACGA GATTGCTGCG CGAGATGCGA GAGCACGGCC CGGGTCCGGA TGCCAAGACG
TATCACGAGG TGATCGCGGC GTACGCGCGT GCGATTGAGT GGAAGCTCGC CGAACAAACG
TTTGAGGAGA TGAAGCGCGA CTTCAGAGGT CGCGGGCCCA CGTGGCACCC GAGCGTGCGC
GTGTACACGT CCCTCATCAG CGCGTACGGC AAGGGCGGAC AATTCGAAAA GGCGAGCGAG
TTATTCGAAA GCTTGTTTGC GAGCTCGCAC GTGCAATTGG ACACGGGTGT GTACAACGCA
TTGCTCTCTG CCGCGGTGAA CTCGGGCCGT TACAAGGACG CCGCCGCCGT GTTTGAGCGC
ATGCAAACGG AAGGTGTCAG GCGAAACGTG ACGACATACA ACGGTATGTT GCAATCGCTC
GGAAGGCAGC GACGCATTCG TGACATGGAA AATATGAGCC AATCCATGCA GCGCGCGGGG
GTCATGCCGA ACGAAACCAC GTACAGCGTG TTAATCACCG CGCACGGCAA TAGCGGCAAC
ATCGATCGGG CTCTCGAGCT CCTGCATCAA GTCATCATTG CCCCGCGTTT GCACGCGACG
GCCGTGATAT TCAACAGCGC GCTCGGGGCG TGTGTCAAGG CGGGTAATCT TGAAGGCACG
CAACGGGTTT TACGAGTGAT GGAGACGGAG GGCGTGCGAT CGACTCTCGT CACGTACAAC
ACGCTCCTGA TGGAGGCGAG CGCAGAGCGT GACTGGGTGC GCGCGACGAA AATATATAAA
GAACTTCTTC TTTCGGGATT CGCGCCGGAC ACCATCACAC TCGATTGCTT GTGCGGTATT
GAAAAGCTTC AGGCGTGTCG CGAGGAAAGG CTTCGCGAGG AAATCAAGCG AGCCGAATTG
GAGGGCATCG ATTTGCCCGA ATTCGAACGC ACGTGCGACG TGAGCGATAG TCCAGTCGGG
AACCTCCCGG TTCTTATACG CGCGCTCGCC GACGATCGCG AGCTCGAAGA AGTTCCAGGA
TGGCGAGGGT TCGTTTCCGA CGCCCTGCTC CGAGTGCTTC ACGTTAACAA CGAGTACGCC
GAGGTGGAGG ACACATTCAA ATACATGCTG ACGAGCGACG TCACGCGCAC CGTGCACACG
TACAACTCGC TATTGATTTC GTATGAGGCT CGTAAAGAGT GGCAAAAGGC GGGCGAGGCG
ATGACGCAAA TGACGAGTGA AGGGATCGTG CCAAACGCGC TCACGTTTGA CGCGCTCATC
GATGTTTGCG AGGAGATGGG TCAATGGGAT CGCGCAACGA CGTGGCTCGA ACAAGCTCAA
GCGGCTGGGC ACTTCCAATG CGAGGACGAT CTCGGTGTTT TAGACCTGCA CCGTATTCGT
TCCGCCGGCA CCGCGCAGTG CGTCTTACGT TGGTGGTTGC GCCGAATGCG TCAACGCGCT
TTGGCACCGC TCGACGTTCG AGCCGCGGGC AAAGGAACGC GTGCGCTCGT ATCAGGGTTG
AAGAATAA
 
Protein sequence
MWAVDGAVML RAADTGDAPE AISALQRMKK AGERNGVAVP LAFYNMTLRA CKRSRPPASA 
DATRLLREMR EHGPGPDAKT YHEVIAAYAR AIEWKLAEQT FEEMKRDFRG RGPTWHPSVR
VYTSLISAYG KGGQFEKASE LFESLFASSH VQLDTGVYNA LLSAAVNSGR YKDAAAVFER
MQTEGVRRNV TTYNGMLQSL GRQRRIRDME NMSQSMQRAG VMPNETTYSV LITAHGNSGN
IDRALELLHQ VIIAPRLHAT AVIFNSALGA CVKAGNLEGT QRVLRVMETE GVRSTLVTYN
TLLMEASAER DWVRATKIYK ELLLSGFAPD TITLDCLCGI EKLQACREER LREEIKRAEL
EGIDLPEFER TCDVSDSPVG NLPVLIRALA DDRELEEVPG WRGFVSDALL RVLHVNNEYA
EVEDTFKYML TSDVTRTVHT YNSLLISYEA RKEWQKAGEA MTQMTSEGIV PNALTFDALI
DVCEEMGQWD RATTWLEQAQ AAGHFQCEDD LGVLDLHRIR SAGTAQNACA RIRVEE