Gene OSTLU_33376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33376 
Symbol 
ID5003586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp118169 
End bp119797 
Gene Length1629 bp 
Protein Length542 aa 
Translation table 
GC content63% 
IMG OID640419007 
Productpredicted protein 
Protein accessionXP_001419714 
Protein GI145350651 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase)
[COG0041] Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein
[TIGR01162] phosphoribosylaminoimidazole carboxylase, PurE protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0689551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCAA TCGCGGGGGC GCCGATGGGG GTGCGGTTGA AAGCGCTGGA TCCGACGGAG 
CGCGCGCCGG CGTCGATCGC GGCGACGCAG GTGGTGGGGA GTTTTAGGGA TAAGGCGGCG
GTGAAGGCGT TCGCGGAGAC GTGCGATGTG GTGACGGTGG AGATTGAACA CATCGACGTG
GAGGCGCTGC GAGAGCTGAG CGCGGCGGGC GTGGACGTGC AGCCGACGCC GGAGACGTTG
GCGACGATTC AGGATAAGTA TGCGCAGAAG GTACACTTCA CGAACGCCGG GGTCCCGCTC
GGGCCGTACG CGGATTGCCC GAATGAAGCC GCGTTGCAGA GCGCGGCGAG CGAGTTTGGG
TTTCCGCTCA TGTTAAAGTC CAAGCGTTTG GCGTACGACG GGCGCGGGAA CGCGGTGGCG
AAGACCGCCG CGGATTTGGC CGATGCGGTG GCGAAGTTGG GTGGGTTTGA ACAAGGGTTG
TATTGCGAGA AGTGGGTGCC GTTTGAGAAG GAATTGGCGG TGATGGTGGT GCGCGCCAAG
AATGGGGAGA CGCGGGCGTA CCCCGTCGTC GAAACCGTTC ACGAAAACAA TATTTGCGAC
ACGACGACGA CGCCGGCGCC GATTCCGAAT AAAGTTGCCG AAGCGGTGCA AGCGGCGGCG
AAGCGAGCGA TCGGGTCTTT CACGGGTGCG GGGATCTTTG GCGTCGAGCT CTTCCTCCTC
AAGGATGGCT CGATTTTGCT CAACGAGTGC GCGCCGAGAC CGCACAACAG CGGTCACTAC
ACCATCGAAG GCTGCGCGTG CTCGCAGTAC GAAAATCACT TGCGCGCCAT TTTGGGTTGG
CCCTTGGGTG ACACCTCGCT CAAGGTTGGC GGGGCGGTGA TGAAGAATAT TTTGGGAGAC
GGCGACGGCG ACGAGGCCAT GGGCCGGGCG CACCGTCTCA TGGGCGCCGC GCTAGCGACT
CCCGGTGCGA GCATTCACTG GTACGAAAAG CCTGACATGA AGCTCGCGCG CAAGATGGGT
CACCTCACCG TCGTCGGCCC GAGCGCGGCG GTGGCGACGG AGCGTCTGGA CACGCTTTTG
CGCGCCGCGA GCGGGGACAA GACGCCGCCG AAGAAGGCGG CTCAAGTCGG CATCATCATG
GGTTCCGACA GTGACTTGCC CACGATGAGC GCCGCTGCGG AGGTCCTCGA ATCTTTCGGC
ATCGGTTGCG AAGTCACCGT CATTTCGGCG CACAGAACGC CCGAGCGTAT GAACGAGTAC
GCGAGAAGCG CGCACACGCG CGGCTTGCGC GCCATCATCG CCGGCGCCGG CGGCGCCGCG
CACTTACCCG GCATGGTCGC CGCCATGACC CCTCTTCCCG TTATCGGCGT CCCAGTCCCG
CTCAAGTATC TCGACGGCAT GGATTCCTTG CTCTCCATCG TTCAAATGCC CAAAGGCGTT
CCCGTGGCGA CGGTGGCCAT CGGCAACAGC GCCAACGCCG GTCTCATCGC CGCTCGCATC
GTCGCCGCGT TCGAGCCCGA CGTGTGCTCT AAAATGTTAG CGTACCAAGA CGACATGGAG
AACGTCGTGT TGAACAAGGC GAGCAAGCTC GAAGAGCTCG GTTACGGCGC CTATCTCGAC
CAAATGTAA
 
Protein sequence
MLAIAGAPMG VRLKALDPTE RAPASIAATQ VVGSFRDKAA VKAFAETCDV VTVEIEHIDV 
EALRELSAAG VDVQPTPETL ATIQDKYAQK VHFTNAGVPL GPYADCPNEA ALQSAASEFG
FPLMLKSKRL AYDGRGNAVA KTAADLADAV AKLGGFEQGL YCEKWVPFEK ELAVMVVRAK
NGETRAYPVV ETVHENNICD TTTTPAPIPN KVAEAVQAAA KRAIGSFTGA GIFGVELFLL
KDGSILLNEC APRPHNSGHY TIEGCACSQY ENHLRAILGW PLGDTSLKVG GAVMKNILGD
GDGDEAMGRA HRLMGAALAT PGASIHWYEK PDMKLARKMG HLTVVGPSAA VATERLDTLL
RAASGDKTPP KKAAQVGIIM GSDSDLPTMS AAAEVLESFG IGCEVTVISA HRTPERMNEY
ARSAHTRGLR AIIAGAGGAA HLPGMVAAMT PLPVIGVPVP LKYLDGMDSL LSIVQMPKGV
PVATVAIGNS ANAGLIAARI VAAFEPDVCS KMLAYQDDME NVVLNKASKL EELGYGAYLD
QM