Gene OSTLU_31440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31440 
Symbol 
ID5001581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp876828 
End bp878252 
Gene Length1425 bp 
Protein Length474 aa 
Translation table 
GC content60% 
IMG OID640417002 
Productpredicted protein 
Protein accessionXP_001417629 
Protein GI145346298 
COG category[L] Replication, recombination and repair 
COG ID[COG0708] Exonuclease III 
TIGRFAM ID[TIGR00633] exodeoxyribonuclease III (xth) 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGC TGTTCGACGC GCTCGGAAGC GATTTGAAGG TGATTTGTTT GCAGGAAACA 
AAGCTGAGCT CGAGCGGCGA CGTCGAGCGG CTGGAACGCG TCGAAGGTTG GGACGGCGCC
CACGCGGTGT GCGAATCCTC CAATCACCGC GTGGGATACA GCGGCGTGGC GGTGTACTGG
CGCTCGAACG AGATTTGCCC GACGTCGATC GAGCGGGGAG TGTGCGCGAA AGGCGACGCG
GGCGAGTCGA CCATGTGGGC GGGAGAAATC GCGCCGTTCG CGGACGACGC GACGCGCGCG
AAAGAGATCG ACGGCGAGGG CAGGGCGTTG TGGGTCGATT TCGGTGAGTT CGTTCTGTGC
ACGGTGTACG TGCCGGCCGT TTTCGGCGAT CCAGCGGTCG ATGAGAAAAC AGCCGAGCGC
GCGGCGTTCA AGCGCGATTT TTTGAGCGCG CTCGAGGCGA GATACAAGAG TCTGCGCGAA
CGCGGTCGAA ATGTGATTTT ATGCGGTGAT TGGAACATCG CACCGTCGTG GAAACTCGAT
CGCGCGGATG AAGACCCGAA CGCGGTGGAA CCTCGAAATC CATCGCGCGA TTGGCTCGCG
GCACAACTTG CCGGGGACGC GATGGTGGAC GTGTTTCGCG AATTTTTTCC GACGCTCGGC
GATGCGTTCA CGTGTTGGAA CGTTGCGAGT GGAGCGCAGT TGTCCAATTA TGGATCACGA
ATCGATTATT TCTTGTGCGA TCGAGCGGTG ACGTTGAAGC GCGTCCGAGG CGTCGGTGTG
GCACAAAAAT TTGAAGGGAG CGATCACGCG CCAGTCTATC TTGAGCTCGA GGAATCGATG
TGGAGGCGGA GAGATTCGCA ACAAACGCCT CCGTCTTTGG CGATTTCGAT GCTTTACCCA
GGTCGACAAA CCACGGTAGA TTCAATATTC GCACGCGCAT CGTCGACGAG CAACGCGACG
CCGGAATTTC TTAACGCGGC GTCTCAGTCG CGAGCGAAGC CGACGCGCCC AAGCGCCCGC
GCCCAATCCC GCGCGGGCGT CTCAGACGCG CCCAAGCGCA AGCCCGAAGC GACGTTGAAA
GATTTCTTCG TCGTCAAGTC CAAAAAGAAA GAGCCGGATG ACCGAAACGA ACGCCAACTA
GACACAGTAG AACAACCTAT AGCACCCACG GCGAACGCAT TCGAATCGCG CGAGACGAAA
GTGAGCTCCG AAGAAGCGCG AGGCGCGTGG ATGAACACGT TCGCCAAAAT GGCGCCTCCC
AAGTGCAAGC ACGGCGAAAC GTGCAAAGTA CGCACGGTGA AGAAGAAGGA GAGCCCACAC
TGTGGACGCG TGTTCTTTTG CTGCCCGCGC CCGGCCGGCG CGCGCACCAA TCCCGATTGT
GACTGCGGTT TCTTCCTCTG GCGAGAGCAT CGCGCGCCGA AGTAG
 
Protein sequence
MKALFDALGS DLKVICLQET KLSSSGDVER LERVEGWDGA HAVCESSNHR VGYSGVAVYW 
RSNEICPTSI ERGVCAKGDA GESTMWAGEI APFADDATRA KEIDGEGRAL WVDFGEFVLC
TVYVPAVFGD PAVDEKTAER AAFKRDFLSA LEARYKSLRE RGRNVILCGD WNIAPSWKLD
RADEDPNAVE PRNPSRDWLA AQLAGDAMVD VFREFFPTLG DAFTCWNVAS GAQLSNYGSR
IDYFLCDRAV TLKRVRGVGV AQKFEGSDHA PVYLELEESM WRRRDSQQTP PSLAISMLYP
GRQTTVDSIF ARASSTSNAT PEFLNAASQS RAKPTRPSAR AQSRAGVSDA PKRKPEATLK
DFFVVKSKKK EPDDRNERQL DTVEQPIAPT ANAFESRETK VSSEEARGAW MNTFAKMAPP
KCKHGETCKV RTVKKKESPH CGRVFFCCPR PAGARTNPDC DCGFFLWREH RAPK