Gene OSTLU_26042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26042 
Symbol 
ID5004226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp132627 
End bp133907 
Gene Length1281 bp 
Protein Length426 aa 
Translation table 
GC content63% 
IMG OID640419647 
Productpredicted protein 
Protein accessionXP_001420089 
Protein GI145351448 
COG category[L] Replication, recombination and repair 
COG ID[COG3145] Alkylated DNA repair protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0723901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCG CGCGCGGCCA CCATCCCGCG CGATGTACCG CGCGCGCGCA CCGCATCGCG 
CGACGACGAC GCGTCGTCGC GCGTCGCGCC GACGCGAGCG ATCGCGAAAC CGCCGCCGCG
CCGAGCGCCG CGCAGCTCGT CAACGACGCG AAAAGTGCGC TGGATATTTT AACCGCCGCG
GATTCGTTGC CGCTGCCGAC GGACGCGTCG CTCGCGCCGC ACGAGTCCCA GCTGCACCAC
CGAAGGAAAC GCAAAAAGAC GTGCTCGCGG GCGCTCCAAA GGCTGGCGAA AATGCTGGTC
GGGACGCGAC GGGAGGACGC GCGGCGGGAG GCGACGAGCG CGACGCAGTT CGCGAGGTTG
GTCGCGGGCG CGCTGTGGAT CGACGACGAC GACGAGACGA GGAACGACCC GGAGGCTGGT
GTGTTGTTTA CCGAAACCGC GCGCGCGCTC GGGAGCTTGG CGCCGTTTGA GATGGAGGAA
GCGGCGAGAG TGAATTTTTA CGCGACGGCG GCGGCGGCGA CGCTGCCGCC GCGCTGCGCG
ACGGTGGTCG CGTGGGCGCT CGCGCGGTGC GGGAGCGCGG TGCCGAACGA AGTCGACATC
GCTATGCGTG GGGTTCCGTT TCGTTTTCAA CCCTATCTAA CGGCGGGATT GATTGATTTG
GAGACTTTGA AACGCGAAGT GCCGTTCAAG CGCGAACAGT TGACGACTCG GGACGGCAGG
CGCGTGGACG AGCGCCGCGA GACGTGTTGG ATGGGCGAAG AACACGTTGG TTCGTACGCG
TACAGTGGGA AAATTATGCA ACCCGTGCCG ATGTGCCCGG CGGTGGCGAG AGTGCGCGAT
GCATTGGAAG AGAAGACGGG CGAGAGGTTC GATTGCTGCT TGATTAATTT GTACCCGAGC
GAAACGGCGG CGTGCGCGTA TCATACCGAT CCGTTCATGG GCATCGGGTA CGCCACGGAT
AGCATCATCG TCTCCGTAGG TGAGACCAGG CGATTTAGTT TTAGGCCTCT AGGTTCGACC
GACGCGGAGT CGCATTGGAT CCGAACGCTC GATGGCGACG CGATTTGGAT GTTCGCGAAT
TGTCAAGACG ACTTCGAGCA TTGCGTGATG ACAGCAGAGG GCGACGGTAA CGACGCGCCT
CGCGCGAGCA TAGTTTTCAA GCGAAGTCTG AAAAGAAAAT CAGCGGCGGA GGCGAGAGCG
AAGAAGAAGA AGAAGAAACC TCCACCGTCG TCGAGCGGAG GAGCCGGAGG AGGTAGGAAA
CAGCCTGCGA AGAGACGTTA G
 
Protein sequence
MRAARGHHPA RCTARAHRIA RRRRVVARRA DASDRETAAA PSAAQLVNDA KSALDILTAA 
DSLPLPTDAS LAPHESQLHH RRKRKKTCSR ALQRLAKMLV GTRREDARRE ATSATQFARL
VAGALWIDDD DETRNDPEAG VLFTETARAL GSLAPFEMEE AARVNFYATA AAATLPPRCA
TVVAWALARC GSAVPNEVDI AMRGVPFRFQ PYLTAGLIDL ETLKREVPFK REQLTTRDGR
RVDERRETCW MGEEHVGSYA YSGKIMQPVP MCPAVARVRD ALEEKTGERF DCCLINLYPS
ETAACAYHTD PFMGIGYATD SIIVSVGETR RFSFRPLGST DAESHWIRTL DGDAIWMFAN
CQDDFEHCVM TAEGDGNDAP RASIVFKRSL KRKSAAEARA KKKKKKPPPS SSGGAGGGRK
QPAKRR