Gene OSTLU_89603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_89603 
Symbol 
ID5006420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp7762 
End bp8808 
Gene Length1047 bp 
Protein Length348 aa 
Translation table 
GC content71% 
IMG OID640421841 
Productpredicted protein 
Protein accessionXP_001422410 
Protein GI145356381 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGC CGAACGCGTC GGCCGCCGAC GCGCTGACCG AGCTCGAACT GCGCGTGTAC 
GACCGGCAGA TTCGCGTGTG GGGCGTGGAG ACGCAGCGCC GACTCGGTCG CGCGTCCGTG
CTCGCGTGCG CGGGGGCGAC GACGACGCGC GCGACGACGA CGACGCGCGT CGGCGCGCTC
GCGGAGACGC TCAAGAACGT CGCGCTCGCG GGCGTCGGAC GCGCGGTGAT CAGGGACGAC
GCGGGCGAGC GCGCGGAGGC GTCGCGCGGC GAGGATGGGA ATTTTTTAAA CGCGGCGTCG
ACGCGCGACG ACGACGCGGA CGACGTCTCG GTCTCGCGCG CCGAGGCGAT GGCGACGACG
CTGCGAGAGA TGAACGCGTT CGGTGAATTC GAGGCGTCGA CGCCGAACGG GCGCGCGCTC
GCGGACGACG CGGAAGCCTT GGACGGGATC GAGGGGTTCG ACGCCGTCGT CGTCGCGGAG
ATGGGATTGG AGCGCGCGAT GCGCGTGAAC GAGGCGTGCA GGCGACACGG GAAGCCGTTT
TTCGCCGCGT TTAGCGGGGC GTCAGCGGCG TGGTTCTTCG CCGATCTCGG CGACGCGTTC
GAGTACGCGG AGGGAGACGA AGTAAAAATC GCGCCTCGAG GCGCGACGCT GCGACGAGCG
CTCGACGCCG CCGAGGCGGA TTTCGGGCGC GTTAAGCGGC GGTCGCCGCG CATGCCGCTC
GCCGTGCGCG TCGTCGCCGA GTTCGAGCGC GCGCACGGGC GCGCGCCGAC GATGGAGGAT
TGGGACGCCC TGGACGCGCT GCGCGTCGAG TTGCCGACGC GATTCGGCGC GAGCGCCGAC
TGCGTCGACG CCGAGCACGT GCGCGCTTTG GTGTCGGGAG AGCGCGAATT TCCCGCGATA
AACGCCATCG TCGGCGGGGT GCTGGCGCAA GAGATTTTGA AATCCATCAG CCGCAAGGGC
GCGCCGTGCG TCAATCTGTT CACGTTCGAC GTCGCGAGCG GGCAAGGCGC GACGTACGAC
TTGGGCGGCG GCGAAACGGC GCGCTAG
 
Protein sequence
MPAPNASAAD ALTELELRVY DRQIRVWGVE TQRRLGRASV LACAGATTTR ATTTTRVGAL 
AETLKNVALA GVGRAVIRDD AGERAEASRG EDGNFLNAAS TRDDDADDVS VSRAEAMATT
LREMNAFGEF EASTPNGRAL ADDAEALDGI EGFDAVVVAE MGLERAMRVN EACRRHGKPF
FAAFSGASAA WFFADLGDAF EYAEGDEVKI APRGATLRRA LDAAEADFGR VKRRSPRMPL
AVRVVAEFER AHGRAPTMED WDALDALRVE LPTRFGASAD CVDAEHVRAL VSGEREFPAI
NAIVGGVLAQ EILKSISRKG APCVNLFTFD VASGQGATYD LGGGETAR