Gene OSTLU_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1214 
Symbol 
ID5004001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp286930 
End bp288012 
Gene Length1083 bp 
Protein Length361 aa 
Translation table 
GC content54% 
IMG OID640419422 
Productpredicted protein 
Protein accessionXP_001419962 
Protein GI145351179 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCAAAGAAC TCGATTTCAA GGGCAAGCTG TACGTGGCGC CGTTGACGAC GGTTGGGAAC 
TTGCCATTTC GGCGAGTTTG CACCGATCTC GGCGCCGACA TTACCGTATC AGAGATGGCG
ATGGCGAGTA ATTTGCTCAA GGGGGATCGT AAAGAGTGGG CGCTTCTTCG TCGTCACCCA
AGCGAGAAGT GCTATGGTAT ACAAGTTTGC GGTGGATATC CAGATCTGAT GGCAAGGTGT
GCGGAATTGA TCGACAACGA AGTCTCGTGT GATTTCATCG ACGTCAATAT GGGATGCCCA
ATCGACGGCG TTTGCGCCAA GGGTGCCGGT AGCAGTCTCA TGCGCGATAC TGATCGATTG
AAAAACGTTG TACGGACAAT GGCGGCGGTT TCTTCGACCC CCGTAACAAT CAAGCTCCGC
ATGGGCTACT TTGACGACCC CTCGAAGTAC GTTGCGCACG ACATCATCCC GCGAGCGAAA
GCTTGGGGAG CGTTTGCGGC AACTCTACAC GGGCGCACTC GTGAGCAACG TTACTCGCGC
CTCGCGGATT GGTCTTACAT TCATCGTTGC GCCGACGTGG CGGCAAAGAG CGAGTTCACA
CTCATCGGTA ACGGAGATGT GTACACGTAC GAAGATTACA ACGCCCAAGT CGCCGACAAC
AAAGTGGCGA CGTGTATGAT CGGTCGCGGT GCCATCATCA AGCCCTGGCT CATGACTGAA
ATCAAAGAGC AGCGTCACTG GGACATAAGC GCCAATGAGC GATTAGATTT GTTCAAGGAC
TTTTGCCAAT ATGGGCTCGA ACACTGGGGC AGCGACTCGA TGGGAGTGGA GAAGACGCGC
CGCTATCTCC TCGAGTGGAT GAGTTACACC CATCGATACG TCCCAATAGG ACTATTGGAG
CAAAACGTCG TTCCGAAACT CCACTTGCGT CCGATGCGTT ACGTCGGACG ATCGGACCTC
GAAACCAAAC TCGCGAGCGA CAGACTCGAA GATTGGCTCG AGTTGAGTGA AATTTGCGGG
CTTGGCAAAC CCGACGCGTC GTTCAAGTTT GTTCCAAAAC ACGCTTCGAA TAGCTATACA
AAA
 
Protein sequence
RKELDFKGKL YVAPLTTVGN LPFRRVCTDL GADITVSEMA MASNLLKGDR KEWALLRRHP 
SEKCYGIQVC GGYPDLMARC AELIDNEVSC DFIDVNMGCP IDGVCAKGAG SSLMRDTDRL
KNVVRTMAAV SSTPVTIKLR MGYFDDPSKY VAHDIIPRAK AWGAFAATLH GRTREQRYSR
LADWSYIHRC ADVAAKSEFT LIGNGDVYTY EDYNAQVADN KVATCMIGRG AIIKPWLMTE
IKEQRHWDIS ANERLDLFKD FCQYGLEHWG SDSMGVEKTR RYLLEWMSYT HRYVPIGLLE
QNVVPKLHLR PMRYVGRSDL ETKLASDRLE DWLELSEICG LGKPDASFKF VPKHASNSYT
K