Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33376 |
Symbol | |
ID | 5003586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 118169 |
End bp | 119797 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 63% |
IMG OID | 640419007 |
Product | predicted protein |
Protein accession | XP_001419714 |
Protein GI | 145350651 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) [COG0041] Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein [TIGR01162] phosphoribosylaminoimidazole carboxylase, PurE protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0689551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCAA TCGCGGGGGC GCCGATGGGG GTGCGGTTGA AAGCGCTGGA TCCGACGGAG CGCGCGCCGG CGTCGATCGC GGCGACGCAG GTGGTGGGGA GTTTTAGGGA TAAGGCGGCG GTGAAGGCGT TCGCGGAGAC GTGCGATGTG GTGACGGTGG AGATTGAACA CATCGACGTG GAGGCGCTGC GAGAGCTGAG CGCGGCGGGC GTGGACGTGC AGCCGACGCC GGAGACGTTG GCGACGATTC AGGATAAGTA TGCGCAGAAG GTACACTTCA CGAACGCCGG GGTCCCGCTC GGGCCGTACG CGGATTGCCC GAATGAAGCC GCGTTGCAGA GCGCGGCGAG CGAGTTTGGG TTTCCGCTCA TGTTAAAGTC CAAGCGTTTG GCGTACGACG GGCGCGGGAA CGCGGTGGCG AAGACCGCCG CGGATTTGGC CGATGCGGTG GCGAAGTTGG GTGGGTTTGA ACAAGGGTTG TATTGCGAGA AGTGGGTGCC GTTTGAGAAG GAATTGGCGG TGATGGTGGT GCGCGCCAAG AATGGGGAGA CGCGGGCGTA CCCCGTCGTC GAAACCGTTC ACGAAAACAA TATTTGCGAC ACGACGACGA CGCCGGCGCC GATTCCGAAT AAAGTTGCCG AAGCGGTGCA AGCGGCGGCG AAGCGAGCGA TCGGGTCTTT CACGGGTGCG GGGATCTTTG GCGTCGAGCT CTTCCTCCTC AAGGATGGCT CGATTTTGCT CAACGAGTGC GCGCCGAGAC CGCACAACAG CGGTCACTAC ACCATCGAAG GCTGCGCGTG CTCGCAGTAC GAAAATCACT TGCGCGCCAT TTTGGGTTGG CCCTTGGGTG ACACCTCGCT CAAGGTTGGC GGGGCGGTGA TGAAGAATAT TTTGGGAGAC GGCGACGGCG ACGAGGCCAT GGGCCGGGCG CACCGTCTCA TGGGCGCCGC GCTAGCGACT CCCGGTGCGA GCATTCACTG GTACGAAAAG CCTGACATGA AGCTCGCGCG CAAGATGGGT CACCTCACCG TCGTCGGCCC GAGCGCGGCG GTGGCGACGG AGCGTCTGGA CACGCTTTTG CGCGCCGCGA GCGGGGACAA GACGCCGCCG AAGAAGGCGG CTCAAGTCGG CATCATCATG GGTTCCGACA GTGACTTGCC CACGATGAGC GCCGCTGCGG AGGTCCTCGA ATCTTTCGGC ATCGGTTGCG AAGTCACCGT CATTTCGGCG CACAGAACGC CCGAGCGTAT GAACGAGTAC GCGAGAAGCG CGCACACGCG CGGCTTGCGC GCCATCATCG CCGGCGCCGG CGGCGCCGCG CACTTACCCG GCATGGTCGC CGCCATGACC CCTCTTCCCG TTATCGGCGT CCCAGTCCCG CTCAAGTATC TCGACGGCAT GGATTCCTTG CTCTCCATCG TTCAAATGCC CAAAGGCGTT CCCGTGGCGA CGGTGGCCAT CGGCAACAGC GCCAACGCCG GTCTCATCGC CGCTCGCATC GTCGCCGCGT TCGAGCCCGA CGTGTGCTCT AAAATGTTAG CGTACCAAGA CGACATGGAG AACGTCGTGT TGAACAAGGC GAGCAAGCTC GAAGAGCTCG GTTACGGCGC CTATCTCGAC CAAATGTAA
|
Protein sequence | MLAIAGAPMG VRLKALDPTE RAPASIAATQ VVGSFRDKAA VKAFAETCDV VTVEIEHIDV EALRELSAAG VDVQPTPETL ATIQDKYAQK VHFTNAGVPL GPYADCPNEA ALQSAASEFG FPLMLKSKRL AYDGRGNAVA KTAADLADAV AKLGGFEQGL YCEKWVPFEK ELAVMVVRAK NGETRAYPVV ETVHENNICD TTTTPAPIPN KVAEAVQAAA KRAIGSFTGA GIFGVELFLL KDGSILLNEC APRPHNSGHY TIEGCACSQY ENHLRAILGW PLGDTSLKVG GAVMKNILGD GDGDEAMGRA HRLMGAALAT PGASIHWYEK PDMKLARKMG HLTVVGPSAA VATERLDTLL RAASGDKTPP KKAAQVGIIM GSDSDLPTMS AAAEVLESFG IGCEVTVISA HRTPERMNEY ARSAHTRGLR AIIAGAGGAA HLPGMVAAMT PLPVIGVPVP LKYLDGMDSL LSIVQMPKGV PVATVAIGNS ANAGLIAARI VAAFEPDVCS KMLAYQDDME NVVLNKASKL EELGYGAYLD QM
|
| |