Gene OSTLU_37849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37849 
Symbol 
ID5005921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp253969 
End bp255378 
Gene Length1410 bp 
Protein Length469 aa 
Translation table 
GC content62% 
IMG OID640421342 
Productpredicted protein 
Protein accessionXP_001421892 
Protein GI145355280 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00619482 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00164206 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCGGAG TGTCGAACGT GGAGGACTTG GCGTGGACGC GCGCCGACGG CGGCGACTGC 
GAAGAGACGC CGTTTTGGTA CGTCGAGCTC CCGGACGAGC GCACAGCGCG CGCGATCGCA
TCGCGCGCGC TGTTAGTCAA AGCGATTTTA GAGGCGTGGG GTTCGGCGCC GGACGAGGGT
GGGCTGCGGG ACGCGGTCGC GGCGTACGAC GAGTCGCGAA AGACGCCGTA TTTAGCGCCG
GGGACGACGT TTAAGGTGGA GGTGGAGGAT TTCGGCGTCA GGCGCGGGTC GAAAGACATC
TTAAAGCGCG TCGGCGACTT GGGATTGCCG TTTCTAGGGA AGGCGGATCT TAAGAATCCG
GAACATTTAT TTTGGAGCGT GGTGAGCGAC ACGAAGGAGA CGCCGAGTTT ACCGCAAACG
CCGCGGCATT GCTTCTTCGG GCGCGTCGTC GGTCAGAGCG ATCGCTCGAC GTTGAAAAGG
TACGACTTGA AGCAACGTTC GTATTTAGGG CCGACGAGCA TGGACGCGGA GATGGCGCTT
TTGATGGCGA ATTTCGCGCA GGCGCGTCCT GGCGGCGTGA TATTAGATCC GTTTTGCGGC
ACGGGATCAA TGCTCGTCGC CGCGGCGCAT TACGGTGCGA TGACGATGGG CATAGACATA
GACATTCGCG TCATCAAGCA TGGGAAATCG GCGCGCAAGA GCGGCTCGAA GTTTGGCGTA
AAAGCGAGCG ATGGTTCGTC GGTGGACGTG TGGACGAATT TCGCGCAGTA CGGTTTGCAA
CCGCCGGTGG CGTTGTTTGT CGGCGATTTG CACGCGTTGC CGACGCGACG GTTTGGTTTA
GAGGGTACGC TCCAAGGTAT CGTCGCCGAC CCTCCGTACG GCGTCCGCGC CGGCGGACGC
AAAAGCGGTG GGCGCAAACC GCTTCCCGAA GACTACGCCA TCCCGGAGGA GATGCGAGAA
ACGCACATTC CGAGCACCGC GCCCTATCCA TTCGCCGAGA TGAACGACGA TTTGATGGAG
CTCGCCGCTC GGTTTCTCTC CATCGGCGGC CGTCTCACGT TCTTCCTCCC CGGTTCCACC
GAAGACGCCG AACGAGAGAT TCGCGACCTC CCCGCGCACC CGGCGCTTCG CCTGCGATGG
CACTCTCTAG AAACCTTCAA CCAAATCTGG GGTCGCCGTC TCGTGACGTA CGAAAAAATA
CACCCCTACG ACGTTGAAGT CGCGCGAAAG GCGCGCGAAG ACGCCGTCGC CGCCCGCGCG
GCGAGCGACG AGCCGGATTT GATCGAACGG ATGCGCGCGT TGGTGTACGA CCAAGTCCCC
GCCGAGGCGA AGCGTCGCAA GCGATACGAG AAATTCCACG GCGTGCCTCC GCCAGACGCG
CTCACCGAGC GCGCGAGCGC AGAGACGTAA
 
Protein sequence
MCGVSNVEDL AWTRADGGDC EETPFWYVEL PDERTARAIA SRALLVKAIL EAWGSAPDEG 
GLRDAVAAYD ESRKTPYLAP GTTFKVEVED FGVRRGSKDI LKRVGDLGLP FLGKADLKNP
EHLFWSVVSD TKETPSLPQT PRHCFFGRVV GQSDRSTLKR YDLKQRSYLG PTSMDAEMAL
LMANFAQARP GGVILDPFCG TGSMLVAAAH YGAMTMGIDI DIRVIKHGKS ARKSGSKFGV
KASDGSSVDV WTNFAQYGLQ PPVALFVGDL HALPTRRFGL EGTLQGIVAD PPYGVRAGGR
KSGGRKPLPE DYAIPEEMRE THIPSTAPYP FAEMNDDLME LAARFLSIGG RLTFFLPGST
EDAEREIRDL PAHPALRLRW HSLETFNQIW GRRLVTYEKI HPYDVEVARK AREDAVAARA
ASDEPDLIER MRALVYDQVP AEAKRRKRYE KFHGVPPPDA LTERASAET