Gene OSTLU_34749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34749 
Symbol 
ID5003770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp343510 
End bp344631 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content56% 
IMG OID640419191 
Productpredicted protein 
Protein accessionXP_001419564 
Protein GI145350332 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.533757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCGA AAAGCGCTGT CTTGGCGAGA CAACCTGCGA GGACGTCGGG AACGAATGTG 
AGACGCGTTC ATCGTCCGTT GCCCGCGGAC TTCAAGAACA CGATCGCCGC GCAGCGCGTC
CCGGCTGTCA TCAGTGGACT CGATATCGGC CAAGCGCCGT GGACGTGGAC GCCAAGTTAT
CTGGCTTCCC TCGACGGCGT TCCAGAGAAG CTTGTGAGCG TTCACGTCAG TCGTGATCCC
AAGCTGGACT TCGTGCGCAA AAACTTCAAA TACGTCGTGA TGCCTTTTGG TGAACTACTG
GCGAAAGTGA ACGATGCGAG CGATGACAAT TTCTACTATT TGCGAAGCAT TGGGGAGAAT
CCGCGAAAAG AGCCGGCGCA CGCGCTTCTA CAATTCCCAT CGTTTGCGCG CGATTTGAAA
CTTCCGAGCG AGTTTTGGGG ATCCGAAGAC AACTACTTCA GCGCCGTCGT TCGCGTGAGC
AGTGGCGATT TACAGCTCTG GACGCATTAT GACGCCATGG ATAACATGTT GATTCAGCTT
CATGGCGAGA AGCGTGTGCT TCTGTTCCCA CCGTCCGTGT CAGGCGACTT ATATCTTGAA
GGTTCGTCAT CCGTCGTCCG CGACGTGGAC GATCACGATC GAGAATCGTT CCCACGATTC
GCGCGCGCTC GAAAAGCGGC GTTGGAAGTC ATCTTACAAC CAGGTGACGT ATTGTACATC
CCCGCGCTTT GGGCGCACCA CGTCACCGCC TTGCACGGCC CGTCGATTGC GCTCAACGTA
TTTTTCCGAC ACCTCCCCAC GAGTGGATAC CCATCGAAAG ATTTGTACGG GAACGCCGAC
CCAATCGCGG CTGCGAGTGC GCTCAAATCA ATAAACTCCG CGATCGAATC CTTGAAAGAG
TTGCCGCTAG ATTACCGTGT ATTTTACGCT GGCGTCGCAG CGGCGAGACT GGAGAGTGAG
CTTGGCGTCG AATCCGCTCG AAGAGCGCTT GCAACGGTGA ACGACGACAC GCCGAAATCG
CGCGGGATGA ATTCGCGCGC AACAAAAGGT ACAGGCGTGG TCGGCACAGT CTTATCTGCT
CTTGCATGCC TGCTCATTAC ACGCCGCGCT TCGCGGAAGT GA
 
Protein sequence
MAPKSAVLAR QPARTSGTNV RRVHRPLPAD FKNTIAAQRV PAVISGLDIG QAPWTWTPSY 
LASLDGVPEK LVSVHVSRDP KLDFVRKNFK YVVMPFGELL AKVNDASDDN FYYLRSIGEN
PRKEPAHALL QFPSFARDLK LPSEFWGSED NYFSAVVRVS SGDLQLWTHY DAMDNMLIQL
HGEKRVLLFP PSVSGDLYLE GSSSVVRDVD DHDRESFPRF ARARKAALEV ILQPGDVLYI
PALWAHHVTA LHGPSIALNV FFRHLPTSGY PSKDLYGNAD PIAAASALKS INSAIESLKE
LPLDYRVFYA GVAAARLESE LGVESARRAL ATVNDDTPKS RGMNSRATKG TGVVGTVLSA
LACLLITRRA SRK