Gene OSTLU_47417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47417 
Symbol 
ID5004938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp494453 
End bp495674 
Gene Length1222 bp 
Protein Length397 aa 
Translation table 
GC content57% 
IMG OID640420359 
Productpredicted protein 
Protein accessionXP_001421167 
Protein GI145353749 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00127] isocitrate dehydrogenase, NADP-dependent, eukaryotic type 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGATGGTGT ACGTGCGGGG AGAGGAGATG ACGGCGTACG TCATGGATTT GATTCGCGCG 
AAGTGGATCG AGCCGCGCGT GGACACGACG GCGTGGCGAG AGTTTGATTT GCGAGCGAAG
AATCGAGACG ATACCGAAGA TCAGGTGCTG CGAGACGTGA TCGAGGCCGG GAAGGCGGTG
AAGGCGATAT TTAAGGAACC GACGGTGACG CCGACGGCGG ATCAAGTGAA ACGATTGGGG
TTGCGAAAGA GCTGGGGGTC GCCGAACGGC GCGATGCGAC GCGGATGGAA CGGGATTACC
ATTTCTCGAG ACACCATTCA CATCGATGGC GTGGAATTGG GATATAAGAA ACCGGTGTTC
TTCGAGCGAC ACGCCGTGGG CGGGGAGTAC GCGGCGGGGT ACAAGAATGT GGGCAAAGGG
ACTTTGGTCA CGACGTTTAC GCCGAGCGAA GGCCCAGACG CGGGGAAACC TGTCGAGGTC
GATTCGCGCA CGATCACCGA CAACGAAGCG GCGGTGGTGA CGTACCACAA TCCGTACGAT
AACGTGCACG AGCTCGCGCG TTTCTTCTTC GGTCGATGCC TCGAAGCGAA GATTACGCCG
TACGTCGTGA CGAAGAAGAC GGTGTTCAAA TGGCAAGAGC CGTTTTGGCA AATCATGAAG
AAAGTTTTCG ACGAAGAGTA CAAGTCCAAG TTCGTCGACG CTGGGGTGAT GAAGTCCGGT
GACGAGCTCG TGCACTTACT GTCCGACGCG GCGACGATGA AGCTCGTGCA ATGGCGCCAA
GGCGGATTCG GCATGGCGGC GCACAACTAC GACGGCGACG TTTTGACGGA TGAGTTAGCC
CAGATCCACA AGTCCCCGGG TTTCATCACG AGTAACTTGG TCGGCGTCGA TGAAAACGGT
ACGCTTATCA AAGAATTTGA AGCCTCGCAC GGCACCGTCG CCGACATGGA CGAAGCGCGT
CTGCGCGGCG AAGAGACCTC TCTCAATCCT CTCGGCATGG TTGAAGGTTT AATCGGCGCC
ATGAACCACG CCGCCGACGT TCACAACGTC GACAAAGAGC GTACGTTAGC GTTTACCGCT
AAAATGCGCG CCGTGATTCA CCAACTCTTC CGCGAAGGCA AAGGCACGCG CGATTTGAGC
GGCCCCTCTG GTTTAACCAC CGAACAATTC GTCGACGCCG TCGCCGAACG TCTTTGATTT
CGATGGCGTC TCTCACACCC TT
 
Protein sequence
MVYVRGEEMT AYVMDLIRAK WIEPRVDTTA WREFDLRAKN RDDTEDQVLR DVIEAGKAVK 
AIFKEPTVTP TADQVKRLGL RKSWGSPNGA MRRGWNGITI SRDTIHIDGV ELGYKKPVFF
ERHAVGGEYA AGYKNVGKGT LVTTFTPSEG PDAGKPVEVD SRTITDNEAA VVTYHNPYDN
VHELARFFFG RCLEAKITPY VVTKKTVFKW QEPFWQIMKK VFDEEYKSKF VDAGVMKSGD
ELVHLLSDAA TMKLVQWRQG GFGMAAHNYD GDVLTDELAQ IHKSPGFITS NLVGVDENGT
LIKEFEASHG TVADMDEARL RGEETSLNPL GMVEGLIGAM NHAADVHNVD KERTLAFTAK
MRAVIHQLFR EGKGTRDLSG PSGLTTEQFV DAVAERL