Gene OSTLU_19257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19257 
Symbol 
ID5006908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp246279 
End bp247430 
Gene Length1152 bp 
Protein Length383 aa 
Translation table 
GC content58% 
IMG OID640422329 
Productpredicted protein 
Protein accessionXP_001422938 
Protein GI145357463 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.000016644 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0048219 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGGCG TGAGCGCGTC CAAGGAGGAC GTGCACGCGG CGATAAAGAA GGTGGACAAG 
GGATTGTTCC CGAAGGCGTT CTGTAAGATC GTGGAGGACA TCGCTGGGGA CGAGGCGTAC
TGCACGTGCA TGCACGCGGA CGGCGCGGGG ACGAAGACGA GCCTGGCGTA CGCGTACTGG
AGAGAGACGG GAGATTTGGG AGTGTGGCGA GGAATCGCGC AGGATTCGAT CGTGATGAAC
ACGGATGATT TGTTGTGCGT CGGGTGCGTG GATAACATAT TCGTCTCGAG CACGATCGGG
AGGAATAAGG CTTTGATTCC GGGGGAGGTG CTGAGCGCGC TCATCAACGG GACGGAGGAG
GTTTTGGAGA CTTTGCGCGA GTGTGGGGTC GGGGTGAAGT CCACGGGAGG TGAAACCGCG
GATTTGGGTG ATTTAGTGCG CACGGTGGTG GTGGACACCA CGGTCACGGC GCGCATGCGA
AGAGACGCGG TGGTGAGTAA CGACAACATT CGCGCCGGAG ACGTCGTCGT CGGTTTGGCG
TCGTTCGGTC AAGCGACGTA CGAGAGCGAG TACAACGGCG GAATGGGAAG CAACGGATTG
ACGTCTGCTC GACACGACGT GTTTGCGAAG AATTTGGCGG AAAAATATCC GGAAACGTTC
GATCCGAACG TACCCGAATC GTTGGTGTAC AGCGGCAAGT ATCAGCTCAC CGACGTCGAG
CCGGAAACCG GCGTGACGGT TGGCAAGTTG GTGCTCAGTC CGACGCGAAC GTACGCCCCC
GTGGTGAAAG CCGTGCTCGA TGCCATGGAC GTGAGGGATA TTCACGGCAT GGTGCACTGC
AGCGGCGGCG CGCAGTCCAA GGTTGGACAC TTTCTCCTCG ACGGCTTGCG CGTCGTCAAG
GATAACATGT TCCCCATTCC CCCGCTCTTC CGCTTGATCC AGGAGTGCTC CAACACCGAA
TGGAGCGAGA TGTACAAAGT GTTCAACTGC GGACATCGTC TCGAGTTTTA CTGCTCCCCC
GAACACGCGC AAAAGATTAT CGATATTAGC CAGAGCTTTA ACATCGACGC CCGCGTCGTC
GGCAGAGTCG AAGCCAAGGA TGGCAAGTCT GAAGTCGTGG TGAAGAGTGA ATACGGTGAG
TTTACGTATT AA
 
Protein sequence
MRGVSASKED VHAAIKKVDK GLFPKAFCKI VEDIAGDEAY CTCMHADGAG TKTSLAYAYW 
RETGDLGVWR GIAQDSIVMN TDDLLCVGCV DNIFVSSTIG RNKALIPGEV LSALINGTEE
VLETLRECGV GVKSTGGETA DLGDLVRTVV VDTTVTARMR RDAVVSNDNI RAGDVVVGLA
SFGQATYESE YNGGMGSNGL TSARHDVFAK NLAEKYPETF DPNVPESLVY SGKYQLTDVE
PETGVTVGKL VLSPTRTYAP VVKAVLDAMD VRDIHGMVHC SGGAQSKVGH FLLDGLRVVK
DNMFPIPPLF RLIQECSNTE WSEMYKVFNC GHRLEFYCSP EHAQKIIDIS QSFNIDARVV
GRVEAKDGKS EVVVKSEYGE FTY