Gene OSTLU_50201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50201 
Symbol 
ID5003343 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp124919 
End bp128131 
Gene Length3213 bp 
Protein Length1043 aa 
Translation table 
GC content58% 
IMG OID640418764 
Productpredicted protein 
Protein accessionXP_001419289 
Protein GI145349746 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00844273 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGCGC CTCGACGTAA AATCACGGGC CGAAGCGTGG TTCTCACCAC CAACGCTCTC 
GAGCCGCTCA TCTGCCACTG GGGTGTCGCC AAGGAAGAGC CCGGGGAATG GGTATTGGCT
CCAAAGCTCG TGCATCCCGC CGGTACGGAG GCGGTGTCAC ACATGAGCTG CGAGACGCCG
CTCGAGGAAT TCACCGGTTG CTTTCCGGCG GATCGCGTAT CGTCGACACC GTCCATGGAT
GACGCAGATC TGTTCGACGA GTGCGCTTAC ATGCAAAGAC TCGAGTTCAA TCTTCCGGGT
GATGGCCCAA ACGAGCTCAT GGGGTTGCAT TTCGTCTTTC GCAACGTTGA CGGGACCGCG
TGGTACAAGG ACACGTCCAA CGGCAACGCC AATTATCACG CGTCGTGCAT ACGAGTCAAG
GAAGAAGATC GCGTCGCGGA TGAGCTCGTG GATACCATCA CGCGCGCCGA AGCTGGAGGA
TCGTGGTGGT CGCTCATGCA TCGTTTTAAT CTCGCGCGAT CGCTTTTGGA AAAATATTGC
GTCCAGGGCG AAGACTCGGC CAAGGCGCAA AGCGCGGCGG CTAAAATTTT TGTATGGCTT
CGATACAGCT CCATTCGACA GTTGACCTGG CAACGGAACT ACAACGTCAA ACCACGAGAA
CTGTCGAGCG CGCAATCTCG TCTGACGCAC ATGCTCGCAG AAATTTACGT CACAAAGCCG
CATTTACGCG ATATGGCGCG TCTCATGCTC GGCACCGTGG GCAAGGGCGG CGAGGGTGGA
CAAGGCCAAC AAATTCGCGA CGAAATCTTG AACATTATGC ACCGAAACAA CATTAAGGAG
GTTAAGGGCA TTTGGATGGA GGAGTGGCAT CAAAAGCTAC ACAATAATAC TACTCCAGAC
GATATCATTA TCTGTCAAGC CTACTTGGAC TTTTTACGAT CCGACGGTAA TTTGGGGGCG
TACTGGCATA CTCTCTCCGA AGGCGGCGTC ACCAAGGAAC GGTTGGAATC TTTCGAGCGT
CCCGTGAGAA GTGAACCGAT CTGGCGACCA AATATTAAGG ATAATCTCAT TCGAGACTTC
GAAAACTATT TAAAGATTCT CAAATCCGTG CATAGCGGTG CAGACTTGGC GGAGAGTTAC
GACGCGTGCC GCAGTCGTTT ATCCGACGTC ACGCGAGGCG CAGTCGAGTA CGTCATCGCG
CAACAATCGA GCGGTGATAT CTTTCCCGTC GTGAACGCGT GCTTGGAGGC TCGACACGGT
TTACGTGATG CTGGTTTGGG CGATCCTTCG GATGCGCCAT GGTGTCGCGA GTTGCTGTAT
CTCGACTTGT CCATCGCCGA CATCAGCAAT CGCGCGATTC AGCGCGGCTC GGACGGCGTG
ACGGATACGG AGGGTTTGTT AGAGCTGACG GACATGGTCC TCGAAGATTT GTGTCTGTCT
CTACCATCTA CAAACGATGA CTTGCTGTAT AGTTTGATGA ACTGGCGACG CATCCGCGAC
TTACAGCGCG CTGGCGACGC CGCCTGGGCG CTTCGAGCAA AGGCGACGGT CGACCGCGTT
CGTCTCGCCG TCACCGAACA CGCCGTGGCG ATTTCGGATA GCATGCAACC GGCGGCGCAT
ACCATCGGCA CTCGATGCGA TTGCGATAAA TGGGTGGTCG ATCTCTTTTC AGAAGAAGTC
ATTCGCGGCG GTCCGGCGTT CGCGCTTTCG CTGATGCTTA CAAGACTCGA CCCGTACTTG
CGCCGCGAGG CGAACATGGG CGATTGGCAA ATTATCTCTC CCGCGACGTG CGCCGGAGTT
GTTGCGCACG TGAAAACCCT CGCAGAGGTG ATGAACGAAA CATTCAAAAC CCCAACCGTG
TTGGTTTGCG ATCACGTAGG CGGAGGCGAG GAGATACCTT CCGGCGCAAT CGCCGTGCTC
ACGGGATCCT CTGTCGACGT GCTGTCGCAC AGCGCCGTTC GAGCGCGCAA CGGCGGCGTT
CTGTTCGCCA CTTGCTACGA TCCGACTCTC TTGGACAAGT TTTCTGGGAT GAACAAAAAG
GCAGTCAAGT TACACGTCAC CGCGGACGAG TGCGTGGCGT TCGATGAGAT TGCATTTGAG
AACATTGGCA AGGAAAACTC GGCAGACGGC GCCTCGCACA ACGGTGACGC GCAGCGAATC
AACATTAAAG CCATCGATTT CGCGGGCGAT TTCGCCGTTT CAATGGAAGA CTTCCGCGAG
GATCTCGTTG GCGCGAAAGC GCGCAATACG AAGGCGCTGC GCGACGCCTT GAAGAATGGC
GGGATTCCCG ATTGGATCAA CCTCCCGGTG TCCGTGGCGA TTCCGTTCGG CACGTTTGAG
CACGTCCTCG CGAGACCGGA GAATGAAAAG CAAGCCGAGA CACTCAACAA GCTTTTGAGT
GAAATCGATG ACACTACTGG TGTGACGCTC TCCGCGTCAC TACGATCGTG CCGTCGTTGC
GTTCGCACCA TCGTACCCCC CGCTGGGATG CTCGAAAAAC TCGCCGCCGT CATGCGAAGC
GGCGGGTTAA CCCCTCCGGA GGACGACGAC GCGTGGGAGC TCGCGTGGAA AGCCATCTGT
GACGTGTGGG CATCAAAGTG GAACGAGCGC GCGTTTGTCA GCATGCGTAA TCGCGGTCTC
GATCACAATA ATCTGCGCAT GTCTGTGCTC GTGCAACCCG TCATCAACGC CGATCACGCC
TTCGTGATTC ACACCGTGAA TCCGAGCACG AACGCCGCCG ACGAGCTCTA CGCCGAAGTC
GTCCAAGGCA TGGGCGAAAC GCTCGTCGGG AACTATCCTG GGCGCGCGCT GAGTTTTACG
GTGAAGAAGA CGCCCGACGG CGACGTCTCA CCGCCGGAAA TCGCTGGATT CCCATCCAAG
AACACCGTCC TTCGCGTCCC CCGCGAGACG CTCATCTTCC GCTCCGACTC CAACGGCGAA
GACCTCGAAG GCTACGCCGG CGCTGGCCTG TACGAATCCG TCCCCATGCA CGCCACCGTG
GAACACCACG CCGATTACGC GTCTGATCCG ATGATATGGG ACCACGACGC GAGCCAGGCC
ACGCTCGCCG CCGTCGCCCG CGCCGGCGCC GCCATCGAGC GCGCGCTCGG CGCTCCCCAA
GACGTCGAGG GCGTCGTCCG CGACGGCCGC GTCTTCGTCG TCCAAACCCG TCCGCAGGTC
TAGCCTCCTG TATTTCATCG CCTCGCGCCG TCC
 
Protein sequence
MSAPRRKITG RSVVLTTNAL EPLICHWGVA KEEPGEWVLA PKLVHPAGTE AVSHMSCETP 
LEEFTDLFDE CAYMQRLEFN LPGDGPNELM GLHFVFRNVD GTAWYKDTSN GNANYHASCI
RVKEEDRVAD ELVDTITRAE AGGSWWSLMH RFNLARSLLE KYCVQGEDSA KAQSAAAKIF
VWLRYSSIRQ LTWQRNYNVK PRELSSAQSR LTHMLAEIYV TKPHLRDMAR LMLGTVGKGG
EGGQGQQIRD EILNIMHRNN IKEVKGIWME EWHQKLHNNT TPDDIIICQA YLDFLRSDGN
LGAYWHTLSE GGVTKERLES FERPVRSEPI WRPNIKDNLI RDFENYLKIL KSVHSGADLA
ESYDACRSRL SDVTRGAVEY VIAQQSSGDI FPVVNACLEA RHGLRDAGLG DPSDAPWCRE
LLYLDLSIAD ISNRAIQRGS DGVTDTEGLL ELTDMVLEDL CLSLPSTNDD LLYSLMNWRR
IRDLQRAGDA AWALRAKATV DRVRLAVTEH AVAISDSMQP AAHTIGTRCD CDKWVVDLFS
EEVIRGGPAF ALSLMLTRLD PYLRREANMG DWQIISPATC AGVVAHVKTL AEVMNETFKT
PTVLVCDHVG GGEEIPSGAI AVLTGSSVDV LSHSAVRARN GGVLFATCYD PTLLDKFSGM
NKKAVKLHVT ADECVAFDEI AFENIGKENS ADGASHNGDA QRINIKAIDF AGDFAVSMED
FREDLVGAKA RNTKALRDAL KNGGIPDWIN LPVSVAIPFG TFEHVLARPE NEKQAETLNK
LLSEIDDTTG VTLSASLRSC RRCVRTIVPP AGMLEKLAAV MRSGGLTPPE DDDAWELAWK
AICDVWASKW NERAFVSMRN RGLDHNNLRM SVLVQPVINA DHAFVIHTVN PSTNAADELY
AEVVQGMGET LVGNYPGRAL SFTVKKTPDG DVSPPEIAGF PSKNTVLRVP RETLIFRSDS
NGEDLEGYAG AGLYESVPMH ATVEHHADYA SDPMIWDHDA SQATLAAVAR AGAAIERALG
APQDVEGVVR DGRVFVVQTR PQV