Gene OSTLU_14033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14033 
Symbol 
ID4999406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1098631 
End bp1100190 
Gene Length1560 bp 
Protein Length519 aa 
Translation table 
GC content62% 
IMG OID640414827 
Productpredicted protein 
Protein accessionXP_001416026 
Protein GI145341871 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0882668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAC TCGCGAGAGG GCTGGCGGGA TTGGGGTACG AAATCGTGTC CACGGGCGGG 
AGCGCGAAGG CGATCGAGGC GAGCGGGACG GCGGTGACGA GCGTGGACGC GGTGACGGGA
TTCCCTGAGA TGCTCGATGG ACGCGTGAAG ACGCTGCATC CGGGCGTGCA CGGAGGTATA
CTGGCGAAAC GTGAGGACGC GTCGCACATG GAGGCGATCG CGAAGCATGG GATCGATACC
ATCGACGTCG TGGCGGTGAA CCTGTACCCG TTTCGGGAAA CCGTGGCGGG CGGAGGAGAT
TTCGCGCAGT GCGTGGAGAA CATCGATATC GGTGGACCGG CGATGATTCG GGCGGCGGCG
AAGAACCACC CGCACGTGTA CGTCGTCGTC GATCCGAACG ATTACGAAAA GTTGATTGAA
CATTTGAAGA GTTCGCCGAG CGCGGCGGAC GACTTGAAGT TCAAGCGCGA GTTGGCGTGG
AAGGCGTTCC AACACTGCGC CTCGTACGAC TCCGTGGTTT CGGAATACTT GTGGAGTCAA
ATCGGCGAGG GTGCCCCAGC GCCCGAGCTT TCGGTGCCGA TGACGCTCAC GGCGGCGCTG
CGATACGGGG AGAACCCGCA CCAACCCGCC GCGGTATACG CGGATGGTTC GCTCGCCGAG
TCTACGGGCG AGGGCGTGGC GCGATCGATC CAGCATCACG GCAAGGAAAT GAGTTACAAC
AACTATCTCG ATGCCGACGC CGCGTACGGG TGCGCGTGCG ATTACCCGAC GAGCGATCCG
ACGTGCGTCA TCGTCAAGCA CACCAACCCG TGCGGCATCG CGAGCGCGAG CGGCGCCAAT
GGCGATTTGC TCGAAGCTTA CCGCATGGCG GTTCGCGCCG ATCCGATCTC CGCGTTCGGT
GGCATCGTCG CCTTCAACTG TACGGTCGAT GCGGACATGG CGCGAGAAAT TCGCGAGTTC
CGCTCTCCCA CCGACGACGA GACGCGCATG TTTTATGAAA TCGTCATCGC TCCGTCCTAC
ACTCCCGAAG GTTTGGAGGT GTTGAAGGGT AAGTCCAAGA CGCTGCGCAT CTTGGAAACC
AAGCCGCGCA CGGGATCAAC GAAGAGCTTG CGACAAGTCG GCGGTGGCTG GCTCGAGCAA
GCCTCGGACT CTCTCGTGCC CGAGGACATT ACGTTTGAAG CCGTCTCTGA CGTCAAACCC
ACGCCAGAGC AACTCGAAGC CTTGAAGTTT GCCTGGCGCG CGGTCAAGCA CGTCAAGTCC
AACGCCATCA CCGTCGCCAC CACCGGTCGA CTTCTCGGCA TGGGCTCCGG CCAGCCCAAC
CGCGTGAACT CTGTTCGCAT CGCCCTCGAA AAGGCGGGCG AAGAGGCGCA GGGCGCCGTT
CTCGCCTCGG ACGCCTTCTT CCCTTTCGCT TGGGGCGATT CCGTGGAGAT CGCGTGTCAG
GCCGGCATCA AAGCCATCGC CCATCCCGGC GGTTCCATGC GCGACCAAGA CGCCGTGGAC
GTGTGCAACA AGTACGGCGT CGCCCTCGTC ACCACCGGTC ATCGACACTT CCGTCACTAG
 
Protein sequence
MNELARGLAG LGYEIVSTGG SAKAIEASGT AVTSVDAVTG FPEMLDGRVK TLHPGVHGGI 
LAKREDASHM EAIAKHGIDT IDVVAVNLYP FRETVAGGGD FAQCVENIDI GGPAMIRAAA
KNHPHVYVVV DPNDYEKLIE HLKSSPSAAD DLKFKRELAW KAFQHCASYD SVVSEYLWSQ
IGEGAPAPEL SVPMTLTAAL RYGENPHQPA AVYADGSLAE STGEGVARSI QHHGKEMSYN
NYLDADAAYG CACDYPTSDP TCVIVKHTNP CGIASASGAN GDLLEAYRMA VRADPISAFG
GIVAFNCTVD ADMAREIREF RSPTDDETRM FYEIVIAPSY TPEGLEVLKG KSKTLRILET
KPRTGSTKSL RQVGGGWLEQ ASDSLVPEDI TFEAVSDVKP TPEQLEALKF AWRAVKHVKS
NAITVATTGR LLGMGSGQPN RVNSVRIALE KAGEEAQGAV LASDAFFPFA WGDSVEIACQ
AGIKAIAHPG GSMRDQDAVD VCNKYGVALV TTGHRHFRH