Gene OSTLU_17852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17852 
Symbol 
ID5004961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp281039 
End bp282190 
Gene Length1152 bp 
Protein Length383 aa 
Translation table 
GC content58% 
IMG OID640420382 
Productpredicted protein 
Protein accessionXP_001420956 
Protein GI145353300 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00377109 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.168751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGCG TGAGCGCGTC CAAGGAGGAC GTGCACGCGG CGATAAAGAA GGTGGACAAG 
GGATTGTTCC CGAAGGCGTT CTGTAAGATC GTGGAGGACA TCGCTGGGGA CGAGGCGTAC
TGCACGTGCA TGCACGCGGA CGGCGCGGGG ACGAAGACGA GCCTGGCGTA CGCGTACTGG
AGAGAGACGG GAGATTTGGG AGTGTGGCGA GGAATCGCGC AGGATTCGAT CGTGATGAAC
ACGGATGATT TGTTGTGCGT CGGGTGCGTG GATAACATAT TCGTCTCGAG CACGATCGGG
AGGAATAAGG CTTTGATTCC GGGGGAGGTG CTGAGCGCGC TCATCAACGG GACGGAGGAG
GTTTTGGAGA CTTTGCGCGA GTGTGGGGTC GGGGTGAAGT CCACGGGAGG TGAAACCGCG
GATTTGGGTG ATTTAGTGCG CACGGTGGTG GTGGACACCA CGGTCACGGC GCGCATGCGA
AGAGACGCGG TGGTGAGTAA CGACAACATT CGCGCCGGAG ACGTCGTCGT CGGTTTGGCG
TCGTTCGGTC AAGCGACGTA CGAGAGCGAG TACAACGGCG GAATGGGAAG CAACGGATTG
ACGTCTGCTC GACACGACGT GTTTGCGAAG AATTTGGCGG AAAAATATCC GGAAACGTTC
GATCCGAACG TACCCGAATC GTTGGTGTAC AGCGGCAAGT ATCAGCTCAC CGACGTCGAG
CCGGAAACCG GCGTGACGGT TGGCAAGTTG GTGCTCAGTC CGACGCGAAC GTACGCCCCC
GTGGTGAAAG CCGTGCTCGA TGCCATGGAC GTGAGGGATA TTCACGGCAT GGTGCACTGC
AGCGGCGGCG CGCAGTCCAA GGTTGGACAC TTTCTCCTCG ACGGCTTGCG CGTCGTCAAG
GATAACATGT TCCCCATTCC CCCGCTCTTC CGCTTGATCC AGGAGTGCTC CAACACCGAA
TGGAGCGAGA TGTACAAAGT GTTCAACTGC GGACATCGTC TCGAGTTTTA CTGCTCCCCC
GAACACGCGC AAAAGATTAT CGATATTAGC CAGAGCTTTA ACATCGACGC CCGCGTCGTC
GGCAGAGTCG AAGCCAAGGA TGGCAAGTCT GAAGTCGTGG TGAAGAGTGA ATACGGTGAG
TTTACGTATT AA
 
Protein sequence
MRGVSASKED VHAAIKKVDK GLFPKAFCKI VEDIAGDEAY CTCMHADGAG TKTSLAYAYW 
RETGDLGVWR GIAQDSIVMN TDDLLCVGCV DNIFVSSTIG RNKALIPGEV LSALINGTEE
VLETLRECGV GVKSTGGETA DLGDLVRTVV VDTTVTARMR RDAVVSNDNI RAGDVVVGLA
SFGQATYESE YNGGMGSNGL TSARHDVFAK NLAEKYPETF DPNVPESLVY SGKYQLTDVE
PETGVTVGKL VLSPTRTYAP VVKAVLDAMD VRDIHGMVHC SGGAQSKVGH FLLDGLRVVK
DNMFPIPPLF RLIQECSNTE WSEMYKVFNC GHRLEFYCSP EHAQKIIDIS QSFNIDARVV
GRVEAKDGKS EVVVKSEYGE FTY