Gene OSTLU_16969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16969 
Symbol 
ID5004128 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp75702 
End bp76909 
Gene Length1208 bp 
Protein Length371 aa 
Translation table 
GC content57% 
IMG OID640419549 
Productpredicted protein 
Protein accessionXP_001420068 
Protein GI145351399 
COG category[L] Replication, recombination and repair 
COG ID[COG0708] Exonuclease III 
TIGRFAM ID[TIGR00195] exodeoxyribonuclease III
[TIGR00633] exodeoxyribonuclease III (xth) 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGG CGTTCACGGT CGAGGACGCG TCCGAGCCGC TGTCGTTCGC GTCGTGGAAC 
GCGAACGGAT TGCTCAACAG AATCAGAGAC AAGAGCGACC CGAACGGACG ACGCACGCGT
GCGCTGCTCG CGCTGAGCGA AAACGTGATG CGAAAACGAC CGGACGTGAT AGCGTTGCAG
GAGGTGTGGC TGAAGTGCGA GGAGATGGGA AAGGGAACGG GCAAATGGTG CGAGATTCGA
TTCGATGTGC TTTGATTCGT CGGCGACGCG CGAGACGCGC GAGACGAAGG ACTGACGGTG
ATTCGATCGA AACGCGCAGG GATCACGCGA AGATGGGGGC TGAGGATAAA AAGTTACGAG
ACGAGAAGGC ATTCATCGAG GCACTGATGG GGCGACGGCC GTTTAAGAAT TACGACGCGT
ATTATTGTTT GGCGAACGCG CGGCGCGCCG GCGTCATCAC GATGGTGAAG AAATGTTTGG
CGAAACCGAT TCGAGTCGCC AGGACGTTGG CGCTCGATGG GCGGGACGTG ACGCCCGATG
AAATCGACAC GAACGAAGGG CGAGTGCTGA TGCTCGAATA CGAAAAGATG ATTGTGTTGA
ACACGTACGT GCCGCACAAC GGGAGCAACG CCGAGCGGTA CGAAAAGCGC GCGCTGTGGG
ACTTTCGAGT TCAAAGGTTT CTGGAAAACT ACAGAGGGAA GAAGGATGTG GTGTGGATGG
GTGATTTGAA CGTGGCACAC CAGGATCATG ACGTCGGACC GAGTCCACGG CTCTTCGAGG
GGGTGGGCGG TTTTACGCTC CCGGAACGCC GACGATTCAC CGACATTTTA GCCGCGACGG
ATATGGTGGA CACTTATCGC GCCTTTAACG GCGATCGTTT GACGTACACT TGGCGTTCTA
CGCGAGGTCA AGGTTTGGAC GGTTGGCAGG GCATGCGATT AGACTACTTC GTCGTGCCGC
GTAAGTTGGT CGCGCGCATC AAGTCGTGCG AAACCTCCAC CGATCGATTC GACGACACCA
CGGCACAATC GATGCCTATG TCTTGTTTCA TGGATTCCGA TCACTGCATG ATTCACCTTT
CTCTGCACAA GCGAGAAGAC GACGACGACG AAGGCGAAAA CGAAGACGAA GACGAAGAAG
AGAACGCGCG CCGCGCGAAG CAGCAGAAAC TCGACCGAGA CGCCGACGTG ATTCTCATCT
CCGACTGA
 
Protein sequence
MTEAFTVEDA SEPLSFASWN ANGLLNRIRD KSDPNGRRTR ALLALSENVM RKRPDVIALQ 
EVWLKCEEMG KGTGKWDHAK MGAEDKKLRD EKAFIEALMG RRPFKNYDAY YCLANARRAG
VITMVKKCLA KPIRVARTLA LDGRDVTPDE IDTNEGRVLM LEYEKMIVLN TYVPHNGSNA
ERYEKRALWD FRVQRFLENY RGKKDVVWMG DLNVAHQDHD VGPSPRLFEG VGGFTLPERR
RFTDILAATD MVDTYRAFNG DRLTYTWRST RGQGLDGWQG MRLDYFVVPR KLVARIKSCE
TSTDRFDDTT AQSMPMSCFM DSDHCMIHLS LHKREDDDDE GENEDEDEEE NARRAKQQKL
DRDADVILIS D