Gene OSTLU_31869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31869 
Symbol 
ID5001995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp670087 
End bp671679 
Gene Length1593 bp 
Protein Length530 aa 
Translation table 
GC content58% 
IMG OID640417416 
Productpredicted protein 
Protein accessionXP_001418081 
Protein GI145347239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.736523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.605369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGG CGACGCTCGA TGATTCGCTG TGGAAGTCGC TGTGTGAAGC TCGATGGCGG 
TGGCGGGCGA GCGGGACGTG GGCTCGGAGC GCGCGGTTGC CTTTCGAGTC GGTGTTTGGC
ACGAGCGCCA ACGCCGAAAA GCGCCGAGAT ATTTTCCGCG GGGGGGAGGG CTGGCGACGG
GCTTATAAGG AGCGTTTGCA AACCCGAGGA ATCGCACGAG TGGCGGGTAA GCGCGTGATT
CGAGTTGAAT GCGACGGAGG AGTCATGCAG GCGAGCGCGC TGCAGCGCGT TCTCAACGCC
ACCTTGCCGG GCGACGTCGT TTGCTTGGGC AAAGGCACGT ACGAAGGATC GTTAACCATT
CCTCGCGGGA TCGAGATTGT GGGTGTCGAC AAACGAGAGA ACGTGCTCAT CGTGAGCGAC
GAAACTCCAG CGATGATGAC ATCGACGGCG ATGAACGCTT CCCGTTCCGT CGCGTCCGTG
GTTACCAACG TCACGCTGTT GCGACGGGGC TCTGCGAAAC GCAGCAGCTC GTCTGGATAC
GGCCACCAGG CGTGCGTGTA CGTTTCCGAC GGCTCGCGAT TGCGACTTGA CAGTTGCGAT
ATCGTGAGCG CTGGTGAAGG CGTGGTGGCG ACGGCGCAAG ACTCCGCGGT GCACGTGCAC
GCGTGTAACA TTCACTCAGT GCTGTCGTCG TTTTTAAGCA CGTCGCGACG CGGCAGCTCG
CTCACGGCGT GCAGAATCAC CGCCGCCAAG TCTAGCGTCG AAGACGCGCA CGAAGAGGAA
GTCATCGATG AGATCGAAAG CTTACCTTCG TCTTTGGGGT ACGATCGTTT GTTTGCCGCC
GTCACGGCTT TGTCGGGCCC GGTGGAGATT TGCAACAATC GCATCGTAAA CGGGTTCGCG
CATGGCGTCG TCTTGTTTGA TTGCGCGCAC GGCAATATCC ACGACAACCT CATCGCGAAC
AACGTTGGCG CGGGTATTTC CGTGGGTGTA TCTTCCACGG CAAATATTTC TAACACGATA
GTCGCGAACA ACTCCAGTGT CGGTATCGCA ATGTGCGGTC GGGGCACGAT TCGGCACTCT
GAAGTCCGAG GCAACGCATT CAACGGGATC GATATCGCGC AGCGGTACAC GAACCGCGAC
TACCTTACTG CGAGATTTGA CGCAGGTACT GACGAAGAGC TCGATCTTGA AGAAGAATTT
TCCGCATTTC TCATGGATTT AGACTCGGAT GACTTCGAAA ACGAAAAATC TTCCGAGGAG
ATTGACGTCC TCGTCGAGGG GTGCCATGTT TCCAAAAACG CGAACGACGG CGTGTGCGTG
TCTGGTGGCG CGAATGTTGA CGTGATTCAT TGTGAGATCA ACGGAAATCT GTGCAACATC
GCGATAGATC GTGGAAACGT GCGATGGAGC CGCGTGCTTG TGGAGGGAGA AAGTATGCAC
GCCGACGCGC CAAACGTGCG CGTCGCCGAG TCGCACTCGA CGTTGATCCC GATGCCAACA
AGCATCGAAG GACCGCAGTT CGTCGATGCG ACAGTCATGC CTAAGCTCCG ACGATTCATT
CCGAATCCGT CTCCGCTTAC TGTCGTGCTG TGA
 
Protein sequence
MREATLDDSL WKSLCEARWR WRASGTWARS ARLPFESVFG TSANAEKRRD IFRGGEGWRR 
AYKERLQTRG IARVAGKRVI RVECDGGVMQ ASALQRVLNA TLPGDVVCLG KGTYEGSLTI
PRGIEIVGVD KRENVLIVSD ETPAMMTSTA MNASRSVASV VTNVTLLRRG SAKRSSSSGY
GHQACVYVSD GSRLRLDSCD IVSAGEGVVA TAQDSAVHVH ACNIHSVLSS FLSTSRRGSS
LTACRITAAK SSVEDAHEEE VIDEIESLPS SLGYDRLFAA VTALSGPVEI CNNRIVNGFA
HGVVLFDCAH GNIHDNLIAN NVGAGISVGV SSTANISNTI VANNSSVGIA MCGRGTIRHS
EVRGNAFNGI DIAQRYTNRD YLTARFDAGT DEELDLEEEF SAFLMDLDSD DFENEKSSEE
IDVLVEGCHV SKNANDGVCV SGGANVDVIH CEINGNLCNI AIDRGNVRWS RVLVEGESMH
ADAPNVRVAE SHSTLIPMPT SIEGPQFVDA TVMPKLRRFI PNPSPLTVVL