Gene OSTLU_18607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18607 
Symbol 
ID5006096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp391843 
End bp394185 
Gene Length2343 bp 
Protein Length780 aa 
Translation table 
GC content56% 
IMG OID640421517 
Productpredicted protein 
Protein accessionXP_001422056 
Protein GI145355619 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[L] Replication, recombination and repair 
COG ID[COG5049] 5'-3' exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCA CTGGGTATAA CAAATACCTC CAACGCGAAT TCAACGGCGC GTTCCTGCGC 
GGAGGACGAC ACAAACGGCG GCCGAAGCGG TACGATCACG TGTACGTGGA CGTGAACAAT
CTTCTGCACG TCGCGGCGCA CAACACGAAC AGCGAGCGGT CGTTTTTTAA AAAGTTGTTC
ACGCTGCTGG ATAACAGGTT GACGAAGACG AACCCGAGGC ACAGCGTGAC GTTGGCGCTG
GATGGACCGG CACCGATGGC GAAGACGATC ACGCAGAGAC GACGGAGGAT TCGACTGAGT
GCGGGGGCGG CGACGCCGCT GAGCGATGAT ATGAGCAAGC TGTTGAAGAT TGGAATCACG
CCGGGGAGCG TGTTGGCGCT GAAAATTGAC AGGGCGTTGG AGTATTACGT GGCGCGGAGG
ATGTTGCGAC GCGATCACGC GGGTTCGCCG GCTGATAACG TGCTGTACGA GATATCGAGT
ATGCGCGTGG CGGGAGAGGG GGAGATTAAA CTCGTCAAGT CGATTCAACA GCGATTGCAG
AACCCACGGT TTCAAGGGCA CTCGCACTGC ATCGTGACGG AGGACAGCGA CGCGTTGTTG
CTCGCGATGA CGCTCTTCGG GCAAGGTCAA AAGACGCGAT ACGCGTCAAA CGAGGAGTTT
CAGGTGTACG TTTTGAGTGG AAACGTAGTT TTCAGCGCGC GGCTGTTTGA TCAGTTGCTG
CTGCAATCGT TACCGAAGGG TGCGTCGCTC GACAGCGCCA GACGCGACTT CATCGGGTTG
TCGGCGATGA TGGGGAACGA TTACATCACC GGCAGCAAGC TCGGGGCGAA GACGAGCTGG
AAAGCATACT TGGAAATGCG AGGTACTTAT CTCTATCGCG ATGATCCGCT CTTTCCGATG
CCTGCAAATC AAGAGCTAAG CGCACAAGCA AAGCCGGATG GCGCCGGATA TAAAAAGAAG
CAAAAGAGCG CGACGGGAAT TCAAACTTCG GTGAATTGGG CGTTCTTAAA GCAACTGTCC
TTGAAACTTG CCGATACATC GTATGCGGCA AAATCGGCAA GTAATGCGCT GGCAAGTTCT
TCTAACCCCG CTCAGAATGA CGTCAAGAAG AAGCGCGTGT ACGACTACTT GTACGGCATC
GAGTGGATGC TCAACATGTA CTACCAAGGA GAATGTACCG ACTTTAGCTT TTACACGTAT
ACGCAAGGAC CGGATATGAT GGATTTTGCG TCAATTGGTG ACGAGTATGA CGTCTCATGC
GACCCGCTCC GCGATCTCAA GCGCGCGGAG GCGAGTTTTT ACAATTTGCG ACCGATCACG
CCCTTGGCGT ACTCGCTCGC CGTGATTCCC CGAGGCGGTA GGGCGCAGAT ATCGAAAAAT
GTTCGCCAGC TCGTCGACCC TGGATCGCCC ATACGGGAAC TGTTTGCGCT GGACTACTGT
CCGCAGTGCA TCAATCATCG CATTCACGTC TCCCCGATGG AGAACGCGTT GCAAAACTCG
CTCACGGCTG CGGATCCCGG CTTCGCCGAG GTAGTGCCAT CCAACGCACA ATTTTCAAAG
TACTCTAGAT ACACTGACGA CGAAGGATAC ATCATTCACC CCGACACTGG AGAGTATATG
TCCATGGATG AAATGCGTCA AGAGGTGAAG GAGCTCAATC GCATGCATTT GCATCACTTG
CACACCGCCA AGCAACACGT GCACACGGAT CCCATTTGCT TGCCCACGCT CGAAGCGGCG
GTGGCGCGAG CGAGCGCGGA CAATCTGCTC ACCGAAGACG AAGAGATGCT GCGGACGTTG
GCTTCCCCTG TATTGTTTTG GCGTCACGCC GTGTACGACC CTAAGGATTT GGACACGCGC
GACTTCGCGA GTGAAGAAGA GCTCACCGAG TGGCGCTCGA AAACCATCCC TGACTCTCAA
TTCGAGATTC TCGACAAGCG CAGCGTTTAC GAGCTCAGAA AGTTTGATGG TGACGTCGGT
GATGTTATGC GGCGCTGGGG TTGCGATAAC GACGCGTTCG CGCGATTCGA CAGCGAGATC
GCCGAAGGAC GAACTCACTC GCGACAGCAC ACCGTCGGCA AGACTCGAAG CGCGTTGGAC
GAGCGACTGC AAAAGTGGCG TAGCGAGCGT CGTGGCGTTG GTCGTGTAGA TGACGCGAAA
TCTACGAGCG ATATCAGCAT CAAAACGAAC GATGCCAACG GCGCGGCGTC GTCTTTGGCA
CCGAACCGCG CTCCTAAACC GCGCCCAGGC GGCGCAGGCA GCAAGCGTCG AAATTTCAGC
AAATCTCCTC GCGCGCCCTC AGCTTTCGCT CGCGTCGCCA CGGCGCACTC GCGTGTATTC
TAA
 
Protein sequence
MGITGYNKYL QREFNGAFLR GGRHKRRPKR YDHVYVDVNN LLHVAAHNTN SERSFFKKLF 
TLLDNRLTKT NPRHSVTLAL DGPAPMAKTI TQRRRRIRLS AGAATPLSDD MSKLLKIGIT
PGSVLALKID RALEYYVARR MLRRDHAGSP ADNVLYEISS MRVAGEGEIK LVKSIQQRLQ
NPRFQGHSHC IVTEDSDALL LAMTLFGQGQ KTRYASNEEF QVYVLSGNVV FSARLFDQLL
LQSLPKGASL DSARRDFIGL SAMMGNDYIT GSKLGAKTSW KAYLEMRGTY LYRDDPLFPM
PANQELSAQA KPDGAGYKKK QKSATGIQTS VNWAFLKQLS LKLADTSYAA KSASNALASS
SNPAQNDVKK KRVYDYLYGI EWMLNMYYQG ECTDFSFYTY TQGPDMMDFA SIGDEYDVSC
DPLRDLKRAE ASFYNLRPIT PLAYSLAVIP RGGRAQISKN VRQLVDPGSP IRELFALDYC
PQCINHRIHV SPMENALQNS LTAADPGFAE VVPSNAQFSK YSRYTDDEGY IIHPDTGEYM
SMDEMRQEVK ELNRMHLHHL HTAKQHVHTD PICLPTLEAA VARASADNLL TEDEEMLRTL
ASPVLFWRHA VYDPKDLDTR DFASEEELTE WRSKTIPDSQ FEILDKRSVY ELRKFDGDVG
DVMRRWGCDN DAFARFDSEI AEGRTHSRQH TVGKTRSALD ERLQKWRSER RGVGRVDDAK
STSDISIKTN DANGAASSLA PNRAPKPRPG GAGSKRRNFS KSPRAPSAFA RVATAHSRVF