Gene OSTLU_33669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33669 
Symbol 
ID5003858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp608032 
End bp609078 
Gene Length1047 bp 
Protein Length348 aa 
Translation table 
GC content62% 
IMG OID640419279 
Productpredicted protein 
Protein accessionXP_001419853 
Protein GI145350946 
COG category[L] Replication, recombination and repair 
COG ID[COG3145] Alkylated DNA repair protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.421608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0171071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG CGCGAGACGC GACGGGCGGC GAAGGCGCCA GAGGGCGGTG TCAGAAGCTC 
GCGAAGGCGT TGCGGGGACA GCTGGCGAAG AAAGCGTACG ACGCGCTGCG CGCGCACGCG
CGGGCGTTTC AGCGCCGGGA GGTGACGACG TCGGAGTTCG CGAAAGTATT GGTGGAGTGC
GCGCGGACGG GGGGGGTGAC GAGGGCGACG GCGCGAGAGG TGATCGCGAC GACGCCGTCG
GCGTTGGACC GGAGGCGGCT GCGAGAGTGC GTCGGACGGG ATTTAGACGA CGACGCGGCG
GTGAAATCGA CGCGTGAACG AGAGAAAAAG CCGCACGGAT TTAAGACGGA GCGTCTCGGA
CCAGGATTGG TGTGCTTGAG GAAGTTTTTG AGCGTGGAGG CGCAAATGTG GTTGGCGAGC
GAATCGTTCG CGCTCGGCGA ATCAGGATCC GACGACGCCG CGCGCGGGCA AGGTTTCTTC
GCGAAGATGG GAGATGGGAC GTTCAAGCTG AATCAAGGTA GTCGTGGGCG GATGATCCTC
GAACCCGATG CGTTTCCAGA CGGGATTTTG ACGCAGATGT GCGAGGACGC GGTGGCGGCG
GCGTGCGCCG CGGACGCCGA GATGCCGACA AACATGAACC CGACGACGTG CCTGGTAAAC
TTTTACAAAG ACGGCGCCGA GTTCAAGTGG CACAAAGATA GTGAAGATCC AAAGCTCGTA
AAATCGCGCA CGGGTCCGCC CATCGTGAGT TTCTCCGTAG GGCTGAGCGG CGACTTTGGG
TACAAATATT CGTTTGACGA TCCCGAGCAC AAAGTCGTGC GCCTGAACTC GGGCGACGTC
TTGCTCTTCG GCGGCCCTTC GCGCATGATC GTGCACAGCG TGTTAAACGT GTACCCGGGA
TCGATGCCCG GTCACTTGCG TGGGAAAATG CTCAACGGTC GCTTGAACGT CACTGTGCGA
GACATCGGTT GCGGCGTCAT CGACGCCAGC CAATTCCCGG CGTACAGAGT CTCCTACGAC
GGCGTCCAGG CCGACGGCAA CGTCTGA
 
Protein sequence
MDDARDATGG EGARGRCQKL AKALRGQLAK KAYDALRAHA RAFQRREVTT SEFAKVLVEC 
ARTGGVTRAT AREVIATTPS ALDRRRLREC VGRDLDDDAA VKSTREREKK PHGFKTERLG
PGLVCLRKFL SVEAQMWLAS ESFALGESGS DDAARGQGFF AKMGDGTFKL NQGSRGRMIL
EPDAFPDGIL TQMCEDAVAA ACAADAEMPT NMNPTTCLVN FYKDGAEFKW HKDSEDPKLV
KSRTGPPIVS FSVGLSGDFG YKYSFDDPEH KVVRLNSGDV LLFGGPSRMI VHSVLNVYPG
SMPGHLRGKM LNGRLNVTVR DIGCGVIDAS QFPAYRVSYD GVQADGNV