Gene OSTLU_19106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19106 
Symbol 
ID5006811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp445418 
End bp447331 
Gene Length1914 bp 
Protein Length637 aa 
Translation table 
GC content59% 
IMG OID640422232 
Productpredicted protein 
Protein accessionXP_001422754 
Protein GI145357087 
COG category[L] Replication, recombination and repair 
COG ID[COG5535] DNA repair protein RAD4 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.169649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCTTAG TGGCTCGCGG TAGGTTGGTG CGGGCGGCGG CGTCGAGCGA GATTTTGCGA 
GCGCTCGTGA CGTCGTGCGC GCCGCGGCGG TTGGTGGACG CGTGCACGAC TACGACGGCG
ACGGTTGAGG TGAGTGCGCT GGCGAGATTA GTAGATTGGT TTGCCGATGC GACAGCACCG
CCGATCGCGA TCGATGGTGA TGACGTTTTA GGCGCGAAAA AGGCGGTGAA GGACGCTCGA
TGGCGTCGCT TAGCGTTTCG AGCGGCGATG CATCGCGCGA GCGCCGGCGC GACAGGCAGC
GCGTGTGGTA TCGCCGCAAG ACTTGCGTCG CTTTGGGCGC TTCGAGGTCG CACTGAAATA
AGTGAAGAAG AGTGCGCGGC GCTGTTCGCG GCGTTGTGTC AAGGTTTGGG TTTGATGTGT
CGCGTAGTGT CGTCGCTCGA GCCAGTGCCG GTTCGCGCGA GCGCAGCGAA ACTCGAATCC
ATGGGCGTGC TGCATACACC TAAGCCACTG CCGTTAGAGA CTGTGCGACG TCGCGAAGGA
CTTGTCCGAC ACTGGTGCGA GGTGTTGTGC GCGCGACACG ATGCGGAATC AAATGACAAA
GGAAACGCTC GCTGGGTGAG CGTTGTGCCG ACGACGCGAG CATCGGTGGA CGCTCCAGAG
ATTATTTTTG GCAATCGCAA ACGTGGCACG ACTGCGGATG CGACGTCGTC TATGCCGTAC
GTCGTGGCGT TTTACGCCGA TTCGGGCGCG CGTGATGTCA CACGCAAGTA CTCCGCGGCG
TTTTCACAGG CATTGCACCA CCGCACGCCA GATTGGAAGT GGTGGGAGAA AATCACAGAG
CACGTCGAAA GAATCCATCG CGACGCGATC GCGCGCGATG CCTCACCCGA GCTGCGGAAA
GTCGTAGAAA CGGCTGATGC CACAGAGTTA TTCGAGATGG ACGTGAGAAG CTCGAAAGAG
CGCGTGCCCG GGACGATGAC TGAGATTAAA AATCATCCGC TCTGGGTCGT CGAGCGCTTT
TTGTCGCGAT CACAGTGCAT TCACCCTCGA CATCCCGTGA AAGGTCTCAT CGCCGGCGAA
CCAGTATTTC CGAGATCGTG TGTGAAGGAG TTGAAGAGCG CCGAGCGCTG GAAGAGCGAG
TGCAGAAGAC GCGTCATCGA CACGTTGATG AACTCGCCCG TGCGCAAGAT TCACAGCAGA
GCGTCTCAGG CGCGCGTCAA GGCTCTCACG CGCGCGCGCG AAGGCTGGTT CATGACGCAA
GCGGAAGGAT CAAAAGAACG CTTAGACTCA GAAGAGTGGC GCGTCTCGAT GTCCGAGCAC
GACGATTGTC CGGACGATCC GCAGAGAATT CCCGGCGACG TTGCGCTCTA CGGCGAATGG
CAAACCGAAC CTTGGACGCC TCCAGCCGCC GTGGGCGGTC TTGTGCCGAA GAACGATCGC
GGAAACGTCG ATTTGTACGG TAATGCATTA CCACCACCGG GAACCGTGCA CGTGAACTTG
CCGCGAATCG CAAAAACCGC CAAATCGATG AGTATTGACT ATGCCCCCGC GCTCGTCGGT
TTCGAGTACA AAGCCGGCGG CAAGACGCTC CCCGTCTTCA ACGGTATCGT CGTCTGCGAA
GAATTCAAAG ACGACTTACT CTCGAAACAC GAAGAAGCCG AAGAAACGCG CCGACTCGCG
ATCGAGGCCA AGGTATACAA AGAAGCCTGC CTGCACTGGC GTCTCCTCCT CGGCGCGATT
TGGACAAGAG CCGCGCTTCG CGAGGAATTC CAAGACGGCG AGGTTTTCCA AGATCCCACC
GCTCGACGCA TCGCCGCCGC TCGCGCGATG CACGAATCTG ATAATCATCC AACTGCAGCC
GTCGCGGTCG AACCCTTAGA ACTCGGCGCC GCGGCGTACG TAGAAGAATT GTAA
 
Protein sequence
MCLVARGRLV RAAASSEILR ALVTSCAPRR LVDACTTTTA TVEVSALARL VDWFADATAP 
PIAIDGDDVL GAKKAVKDAR WRRLAFRAAM HRASAGATGS ACGIAARLAS LWALRGRTEI
SEEECAALFA ALCQGLGLMC RVVSSLEPVP VRASAAKLES MGVLHTPKPL PLETVRRREG
LVRHWCEVLC ARHDAESNDK GNARWVSVVP TTRASVDAPE IIFGNRKRGT TADATSSMPY
VVAFYADSGA RDVTRKYSAA FSQALHHRTP DWKWWEKITE HVERIHRDAI ARDASPELRK
VVETADATEL FEMDVRSSKE RVPGTMTEIK NHPLWVVERF LSRSQCIHPR HPVKGLIAGE
PVFPRSCVKE LKSAERWKSE CRRRVIDTLM NSPVRKIHSR ASQARVKALT RAREGWFMTQ
AEGSKERLDS EEWRVSMSEH DDCPDDPQRI PGDVALYGEW QTEPWTPPAA VGGLVPKNDR
GNVDLYGNAL PPPGTVHVNL PRIAKTAKSM SIDYAPALVG FEYKAGGKTL PVFNGIVVCE
EFKDDLLSKH EEAEETRRLA IEAKVYKEAC LHWRLLLGAI WTRAALREEF QDGEVFQDPT
ARRIAAARAM HESDNHPTAA VAVEPLELGA AAYVEEL