Gene OSTLU_33041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33041 
Symbol 
ID5003082 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp296700 
End bp298110 
Gene Length1411 bp 
Protein Length343 aa 
Translation table 
GC content64% 
IMG OID640418503 
Productpredicted protein 
Protein accessionXP_001419122 
Protein GI145349400 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02239] DNA repair protein RAD51 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0526282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGT GCGCCGTCGA CGACCGCGCG CTCGCGCGAG ACGACGACGT CGCGCGCGAC 
GAACCGATCG CGACGACGAC GCCGATCGCC GCGCTCGAGG TGCGACGACG CGCGACGACG
CGCGCGAACG ACGACGCGAA CGACGACGCG ATGCGATTCG AACGAGTCGT TTGACGACGA
CGACGACGAC GACGACGACG ACGACGACGA CGCGCGACGC GCGACGAGGG ACGCGACGGA
CGACGCGCGA CGGCGCGTTC GCGCGGATGA CGACTGACGA CGAACGAACG GCGACGCCGA
ACGGACTCGG ACGACGCAGG AATCTGGGAT CGCGGCGTCC GACGTGAGCA AGCTGCGCGA
CGCGGGCGTG CACACGGTGG AGGGACTCGC GGCGGCGTCG AGGAAGCACC TGCAATCGAT
CAAGGGGCTG TCGGAACAGA AGGTGGAGAA GCTGAAGCAA GCGGGTGCGA GCGAGGCGCG
AGGCGAGGCG ACGCGAGCGA ACGCGCGAGG ACGCGGGGGG GGCGCGACGC GCGAGGGCGA
CGCGAGAGGT GCCTCGTTTG ATGGATTGAA AGCGAGCACC CGAGGAGACT GACGAGAGAC
GCGACGACGC GCGAGCGACG CAGCGAACGC GATCGTGCCC GCGGGATTCA CGACGGCGAA
GATGATCGAT CAGCAGCGTC AGGATACGAT ATATATCACG ACGGGTTCGG CCAAGGTGGA
CGAATTGTTG CAGGGGGGGA TCGAAAGCGG GAGCGTGACG GAGATTTACG GCGAGTTCAG
GACGGGGAAG ACGCAGTTGA TGCACACGCT CGCGGTGACG AGCCAGATGC CGATCGAGCA
CGGTGGTGGC GAGGGTAAGT GTCTGTACAT CGATACCGAA GGCACGTTCC GTCCGCAGCG
GTTGATTCAG ATCGCCGAGA GATTCAACAT GGATCCCTCG GCGGTGCTGG ATAACGTGGC
GTACGCCAAG GCGCACAACG TCGAACATCA GAGCGAACTT TTGCTCGCCG CCGCAGGAAT
GATGGCGGAG ACGCGATTCT CGCTCATGAT TATCGACTCC GTCACCAATC TCTACCGCAC
CGAATACGAA GGTCGCGGTG AACTGAGCGC GCGTCAGATG CACCTCGGGA AATTTTTGCG
CCAACTCGCC CGTCTCGCGG ATGAGTTCGG CGTCGCCGTC ATCGTATCGA ACCAAGTCGT
CGCTAACCCT GAAGGCGGAC CATTCGCGGG AGCGAATGCG CTCAAACCCA TCGGCGGGAA
CATCATGGCG CACGCGAGCA CGACGCGATT AGCCCTTCGC AAGGGGCGCG GAGAGAACAG
AGTGATGAAA GTCGTGTGTT CTCCCGTGCT GCCGGAATCG GAGGCGCAAT TTTCCATCAG
CGAGTTTGGT ATCGAAGACG CCAAGGATTA A
 
Protein sequence
MSACAVDDRA LARDDDVARD EPIATTTPIA ALEESGIAAS DVSKLRDAGV HTVEGLAAAS 
RKHLQSIKGL SEQKVEKLKQ AANAIVPAGF TTAKMIDQQR QDTIYITTGS AKVDELLQGG
IESGSVTEIY GEFRTGKTQL MHTLAVTSQM PIEHGGGEGK CLYIDTEGTF RPQRLIQIAE
RFNMDPSAVL DNVAYAKAHN VEHQSELLLA AAGMMAETRF SLMIIDSVTN LYRTEYEGRG
ELSARQMHLG KFLRQLARLA DEFGVAVIVS NQVVANPEGG PFAGANALKP IGGNIMAHAS
TTRLALRKGR GENRVMKVVC SPVLPESEAQ FSISEFGIED AKD