Gene OSTLU_46799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46799 
Symbol 
ID5004140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp110121 
End bp111239 
Gene Length1119 bp 
Protein Length311 aa 
Translation table 
GC content62% 
IMG OID640419561 
Productpredicted protein 
Protein accessionXP_001419904 
Protein GI145351058 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.970391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGCGACCCT ACGCCGCGGC CGCGGACGCC GCGCGCGCGA CGGCGCCGCG GCGCGAGCTC 
GGCGCGGTGT ACAACGATTG GACCAAGGAC GAGGTGCGGG CGCTGTACGC GCGACCGCTG
CTGGAGCTGG TGTTCGACGC GGCGAAGACG CATCGCATGC ACCACGACCC GAGGCAGGTG
CAGCAGTGCA CGCTGCTGAG CATCAAGACG GGAGGGTGCC CGGAGACGTG TAATTACTGC
GCGCAGAGCT CGTCGTGGAA GGGGGAGACG AAATTGAAGG CGGAAAAACT GATGGGCGTC
GAAGAGGTGA TCGAGGCGGC GAAGAGGGCG AAGGAGGCGG GGAGCACGAG GTTTTGCATG
GGGACGGCGT GGCGAGGGCC GAGTCAAGTC GGCGCGGGAC AGTTTGAACG CGTCCTGGAG
ATGACGAAGG AGGTGAGGGA CATGGGGATG GAGGTGTGCG CGACGCTGGG GATGTTGACC
CCGGAACAAG CGCTGAAGCT CAAGGATGCG GGGTTGACGG CGTATAATCA TAACTTGGAT
ACGAGTCCGG AATATTACGA CAAGGTGACC TCGAGCCGCA AGTACGAGGA TCGGTTGAAC
ACGATCGCCG CCGTGCGCGA GGCTGGTATT TCCGTCTGCT GCGGTGGAAT TCTTGGTTTG
GGCGAAGAGG AGTCGGATCG GGCGAGTCTG ATGACGGTGT TGGCGACGCT TCCCGAGCAT
CCGGAGAGCG TTCCCATCAA CGCGCTGGTG CCCGTGGAGG GAACGCCGTT CAAGGACATG
ACTCCGCCCA GCGGTCTCGA GATGGTGCGC GCCATCGCCG TGGCGCGCAT TTTAATGCCC
GCCACCGTCG TTCGATTGAG CGCCGGGCGT GTGAACATGA GCCCGGAGAC GCAAGCGTTG
TGCTTCATGG CTGGCGCCAA CAGCGTCTTC ACGGGCGATA AACTCTTGAC CACGCCGAAT
AACGAAAAGA GCGAAGATTC TTTCTTGTTC GAAGAGCTCG GTCTCGAGGG CCGTCCGGCT
TTCGTGCCGT ACGCAGCGGG CGCGGCTTCG AGCGATGGAA GCGAGTGGAA ACACATGAAA
CACGAGTTGT AATTTTCTGA ATTTCTATCT GAGAGTTCA
 
Protein sequence
MHHDPRQVQQ CTLLSIKTGG CPETCNYCAQ SSSWKGETKL KAEKLMGVEE VIEAAKRAKE 
AGSTRFCMGT AWRGPSQVGA GQFERVLEMT KEVRDMGMEV CATLGMLTPE QALKLKDAGL
TAYNHNLDTS PEYYDKVTSS RKYEDRLNTI AAVREAGISV CCGGILGLGE EESDRASLMT
VLATLPEHPE SVPINALVPV EGTPFKDMTP PSGLEMVRAI AVARILMPAT VVRLSAGRVN
MSPETQALCF MAGANSVFTG DKLLTTPNNE KSEDSFLFEE LGLEGRPAFV PYAAGAASSD
GSEWKHMKHE L