Gene OSTLU_29618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29618 
Symbol 
ID5006682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp534310 
End bp536599 
Gene Length2290 bp 
Protein Length744 aa 
Translation table 
GC content59% 
IMG OID640422103 
Productpredicted protein 
Protein accessionXP_001422624 
Protein GI145356823 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAGC CGAAGCCGAA GGAGCGGGGG GATCGGGAAC AGCGGAATCT GAACGAACAT 
CACGCGCGAA GGGAACAACA GCGCGCGCGG CACGATGGCG CGGTGAACGG CGCGGGCGGT
GGGAAACGGG GGGGGGCGAG AGATTTGGAG GATTTGATCG CGCGCGTCGA GGACGCCGAG
GACGCTGCGG CGGTGGAGCG CGTGTTGCGG CAATCCGTCT TCAAACCGGG CGCACCGGCG
TACACGACCG TAATCAAGGC GTGTGGGAAA GCGTCGCAGT GGGAAAAAGC GCTCAAGGCG
TACGAGATGA TGACGAAGTT ACACGGCGCC AAGCCGAACA CGATAACGTT TTCGGCGTTG
ATCAACGCGC TGGGCAAGGC AAAACAGTGC GACAGGGCGT TTCAAATCTT CGACGAGATG
AATGATCGAG GAATCGAGCC GAACATTTTC ACTTACTCCG CACTTTTGGG CGCGTGTGCC
AGGGCGAAGC AGTACCAGAG AGCGATGGAC ATCTTTCAAG ACTTGATTGA AAATCATCGC
GACGTCGAAG TCGATCGAAT CACGTACGGC GCCGCGATTC AGTGTTGCGT GCAAGGTCGG
CGAGCGGACA AGGCAATTGA AATTTTCGAG CGAATGCTTG CGTCTGGAAT CAAGGGGAAC
ATCATCACCT ACAACGCCGT GCTGATGGCG TGCGAGAAGA GCGGCGACTC GGATGGGGCG
CAGGAGATGT TTGAACGCAT GAGCGCCGAG CAAGTTCCGA TGGATCGCGC GACGTTTCAC
GCCATGATTG GCGCGTGCGA TCGCGCTCGA CAGCTTTCCA AAGTGATGGA CTACTATCAC
ATGATGCCAG AGCGCGGCGT TACGCCAGAT GCGGGGACTG TGTCGAACGT TTTGATGGCG
TGCAGTAACG CCAAGGACCC TTTAACGGCG ATTAAAGTGT ACCACGACGC TCGAGCGAAG
TTTGACGTCC ATCGCACGCC CGGCATGTTC AACAACCTCA TCGGTGCGTT ACACCGTGGC
AAGCGATATG ACTTGGTTTA CGAACAGCTG TTTCACGATA GTATGACGCA AAGTGCGCTC
TCGGTGTCGA CGTACGTTCA TCTCATGATG GCGTGCGAGC GATTCGGTAA CTGGGAGAAA
TCTTTTCAAC TTTTTGAAGA GTTCAAAAGA CACTCTCCTT CGAGCGTAGA TTCATACGTC
TGGACGCGAG TGTTTTACGC GTGCGGACGA TTTGAAAGTG ACGATCCGGT ATGGAAGAAT
GGAGCCATTC CTGGGTACAC GAGTCCGACG CTCCAACGCG CGCAAGCCAC GATGCGAGCG
CTTTGGGCCG AGTACAAGTC GAAATTGAGC GTGTTTAAAG AAATCAGCGC GGTGGCGGCG
CGAGTCAGCA GAGCTGGTAG CAATCATCCG TCGGCACAAA GTGACGTCGA CGCGTCTCTC
GCTTTGGCGT CGGCGAAGCA TTCAGAGCTC GCGCAGACGC ACGAGGCGGC GCCCGCCGCG
GAGTCAGCCG TCGAACAAGA ATTCGTGCAA TTGATCAACG CATACAGATC CGCCGCGGCG
GCGGCAGCGC GATGCTCTGA TCCGAGCACG GTTCTAGATA TCCTTCGCAC GTGCGAGTAC
GACGGAATTC CACAGGATTC GGTGATTTAC GGCGCTTTCG TCGCCGCGCT CGCCCTCAAC
GGCGACCGCG TGGGGGCGAA TGAAAAGTTC AGAGAGATGT TCACGTTGGG ATTACAGCCG
GGCGTGGCGG TGTACGCTGC GCTCGCGCGA GCTGCTGCAC GAGCGGGTGA CGCTAATACC
GCGCTCAGTC TCGCCGAGGA TGTGAGATCC ATGGTCGGAA GCTCGGTGGG CGACGGCGCC
ATCATCGACG CGGTGGTGGC GGCGTGCGAG GCCGGAGGGG ACTGGACGCG CGGCGCGAGT
TTGTTCACAA TGTGGGCGGC AAATGGCGTC CAGGGGGCGA ACGACGAATT GCGACGTGCA
ATCGCGGCGA GCAGTGGCCT CGATCCGGGG CCCACGAGAC GGGGCACAAG TGTGTACAAC
GCTTTCCGAC GCGAACCGCC GGCTGAGTCC CAAGGCGTCG CCGCCCCGCG AGATGCAAAA
ACGACGACGC TCACGGCGAA AGTTCAACCG TTCGTCCCGC GGGGAAGACC TTCGGTGCTG
CGTGACGACG CAAAATCGAG CGAATCGAAG TCACCGCCAG ATGAAGGAGA AAATCCTCCG
GACGAAGCTT TGTGAATGGG CGCCGCGGCG TCTGACAACC GACGAGTCGA GTCGTTTCAT
TCACCGACGC
 
Protein sequence
MVKPKPKERG DREQRNLNEH HARREQQRAR HDGAVNGAGG GKRGGARDLE DLIARVEDAE 
DAAAVERVLR QSVFKPGAPA YTTVIKACGK ASQWEKALKA YEMMTKLHGA KPNTITFSAL
INALGKAKQC DRAFQIFDEM NDRGIEPNIF TYSALLGACA RAKQYQRAMD IFQDLIENHR
DVEVDRITYG AAIQCCVQGR RADKAIEIFE RMLASGIKGN IITYNAVLMA CEKSGDSDGA
QEMFERMSAE QVPMDRATFH AMIGACDRAR QLSKVMDYYH MMPERGVTPD AGTVSNVLMA
CSNAKDPLTA IKVYHDARAK FDVHRTPGMF NNLIGALHRG KRYDLVYEQL FHDSMTQSAL
SVSTYVHLMM ACERFGNWEK SFQLFEEFKR HSPSSVDSYV WTRVFYACGR FESDDPVWKN
GAIPGYTSPT LQRAQATMRA LWAEYKSKLS VFKEISAVAA RVSRAGSNHP SAQSDVDASL
ALASAKHSEL AQTHEAAPAA ESAVEQEFVQ LINAYRSAAA AAARCSDPST VLDILRTCEY
DGIPQDSVIY GAFVAALALN GDRVGANEKF REMFTLGLQP GVAVYAALAR AAARAGDANT
ALSLAEDVRS MVGSSVGDGA IIDAVVAACE AGGDWTRGAS LFTMWAANGV QGANDELRRA
IAASSGLDPG PTRRGTSVYN AFRREPPAES QGVAAPRDAK TTTLTAKVQP FVPRGRPSVL
RDDAKSSESK SPPDEGENPP DEAL