Gene OSTLU_35452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35452 
Symbol 
ID5002695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp763644 
End bp766673 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table 
GC content58% 
IMG OID640418116 
Productpredicted protein 
Protein accessionXP_001418796 
Protein GI145348729 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR01408] ubiquitin-activating enzyme E1 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATCG ACGAGGACCT GCACTCGCGA CAGCTCGCGG TGTACGGACG AGAGAGCTTT 
CGAAAACTCG CGAGCGCGCG GGTGTTGGTG ATCGGGGCGC GAGGTTTAGG GTGCGAGATC
GCGAAGAATG TCGTCTTGGC GGGCGTGCGC GCGGTGAGCG TGTGCGACTC GGGGGCGTGC
GAGGCGGCGG ACGCGAGCGC GCAGTTTTAC GTGGACGAAG CGAGCGTGAA GGCAAACGTG
ACGCGCGCGC GGGCGAGCGT GGGGAAGTTG CAGGAGCTGA ATCCGGCGGT GGAGGTGAAC
TGCGTGGAGA CTTGCGACGA AGACGCCGTC AAGGCGCACT CCGTGGTGGT GTGCGCCGGG
GAGACGTCCG AGGCGGAGGC GGTGGCGATT AACGCCATGT GCCGGGCGAA TAACGTGGCG
TTTATTAAGA CGGACGTGCG AGGGGTGTTC GGAAACGTGT TTTGCGACTT TGGTGACGCG
TTCAACGTCT TGGACGTGGA CGGCGAGGAG GCGCTGTCGT GCATCGTGGC GAGCGTGTCA
AACGATTCTC CGGCGCTCGT CACGTGCATC GAAGACGAAC GCGTGGAGTT GCAAGACGGC
CAGCGAGTGA CGTTTAGCGA GGTGCGAGGG ATGACTGAAC TGAACGGGCT GTCGGTCGTG
GTTAAAAACG TGAAGAAGCA TTCGTTTGAG CTCGACTTGG ACACGAGCGC GTTTTCGCCC
TACGTCGGCG GCGGCATCGC GACGCAAGTC AAAGAAACGA AGACGCTCAA GTTTGCCTCG
TACGCGGACT CGCTCGAGTC CCCAGGGGAC TTTTTGTTGA GCGATTTCGC CAAGATGGAA
CGCTCGCCCC AGTTGCACTT GGCCTTCGGC GCGCTCGACG CGTACGTCGC CAAGCACGGC
GCGTCGCCGA CGCCGGGTTC CGACTCGGAT GCTGAAAAAT TCGTCGCCGA AGCGGAAGCG
TTGAACGCGA CGCGTAAAGC GGTGGATGAA GTTGACAAGG ACTTGTTAAA GACGTTTTCG
AAGACGTGCC GAGGTCACGT CTCGCCCATG GCGGCGATGT TTGGCGGCAT CGTCGGCCAA
GAAGTCGTCA AGGCGTGCAC GGGCAAGTTC CACCCGTTGT TCCAATGGTT TTACTTTGAT
TCCGTCGAAA GCTTGCCCGA GACGTTGACC GAAGAAGACC TCGCGCCGCG AGGTGATCGC
TACGACGGTC AAGTCATGTG CTTCGGGACG AAGATGCAGG ACAAAATTCT CAGTCAAAAG
ATTTTCCTCG TCGGCGCCGG CGCGCTCGGT TGCGAGTTTT TGAAAAACTT TGCGTGCATG
GGGTTGTCGT GCGGTCCGAG CGGTGGTGTG ACGGTGACGG ACGACGACGT TATTGAGAAG
TCAAACTTGT CCCGTCAATT CTTGTTCCGC GACTGGAACA TCGGTCAAGG CAAGAGCGTG
TGCGCCTCGA ACGCGGCCAA GGTGATCAAT CCGAACCTCA ACGTCACCGC GCTCGAGAAC
CGCGTGAGCC CGGACACGGA GGACGTTTTC GACGATGGAT TCTGGGAAGG CCTGGATGTG
GTCGTGAACG CTCTGGATAA CGTAAACGCG CGGTTGTACG TAGACAGTCG ATGCGTGTAC
TTCCAAAAGC CGCTGCTCGA GAGCGGGACT CTCGGCACGA AGTGCAACAC TCAAATGGTC
ATTCCGAACA TGACAGAGAA CTATGGTGCT TCTCGTGACC CTCCGGAGAA GAGCGCGCCG
ATGTGCACGC TGCACTCGTT CCCGCACAAC ATCGATCACT GCTTGACGTG GGCGCGAAGC
GAATTCGAAG GTGCATTCGA GAAGGCTCCC GCCGAGGCCA ACTCTTATTT GTCCAAGCCA
GAGGAATACG CCGCGGCGGC GCTGTCGAAC CCCGATGCTT CCGCGCGAGA GAATGTCGAA
AAGGTTGCGC AAGTGTTGTT GAAGACGGCA TGCTCCACGT ATGACGAATG CATCGCTTGG
GCGCGCACGC AGTTCCAAGA GCAATTCCAC GACAAGATTT TACAGCTCAC GTTTACGTTT
CCCGAAGACG CCGTCACGTC GACGGGTTCA CCTTTCTGGA GCGCACCGAA GCGTTTCCCG
CGACCAGTCA TATTTTCCAC CTCGGACGCT TCGCACATGA CGCTCATCCG CGCCATGGCG
AACCTCAAAG CGGAGCTCTC TGGGATCGCG CGACCGGCGG CGGGAGTCAA CGACGACGCC
GCGCTCGTGC AGCTCGTCGA CAAGGTGGCC GTCGCTCCTT TCGAACCGAA GAAGGGCATC
AAGATCGAGA CCGACCCCAA GGCGAACACC GCCGCTTCGA GCATTCCTGA AGGTATCGAC
GACGAGGCTG TGATCAAGGA CGTGTTGGCC AAGCTCGAAA CGAAGCGAGC GGGCTTGGGA
GGAGATTACA GACTCAACGT CATCGAGTTT GAGAAGGACG ACGACACAAA CTTTCACATG
GACGCCATCG CTGGTCTTTC CAACATGCGT GCGCGCAACT ATGACATCGG TGAGGTCGAT
AAACTCAAAG CAAAGTTTAT CGCGGGGAGA ATTATTCCAG CCATCGCGAC GACGACGGCG
ATGGCGACGG GTTTGGTGTG CCTCGAATTG TACAAGGTGT TCAAAGGCGC GAAGATTGAG
GCGTATCGCA ACACGTTCGC CAACCTCGCG CTCCCGCTGT TCGCCATGGC GGAGCCCATC
GCGGCCAAGC AAGACAAATT CAAAGACTTG TCGTGGAGCA TGTGGGACCG ATGGATCTTG
GAGGGCGATT TCACGGTTCA ACAAGTCTTG GACCACTTCG AGGCCAAGGG CCTGATCGCG
TACTCCATGT CCGTCGGCGC GAGTTTGGTT TATAACAATA TTTTCCCCAA ACACAAGGAG
CGTTTGAACC AAAAACTCAG CGAGTTGGTG CAAACCGTGG CGAAGATGGA AATTCCCGCC
AAGCGTCGAC ACTTCGACAT CGTCGTCGCG TGCGAAGACG ACGAAGGCGA AGACGTCGAC
ATCCCGATGG TGTCCATTCG CTTTAGATGA
 
Protein sequence
MEIDEDLHSR QLAVYGRESF RKLASARVLV IGARGLGCEI AKNVVLAGVR AVSVCDSGAC 
EAADASAQFY VDEASVKANV TRARASVGKL QELNPAVEVN CVETCDEDAV KAHSVVVCAG
ETSEAEAVAI NAMCRANNVA FIKTDVRGVF GNVFCDFGDA FNVLDVDGEE ALSCIVASVS
NDSPALVTCI EDERVELQDG QRVTFSEVRG MTELNGLSVV VKNVKKHSFE LDLDTSAFSP
YVGGGIATQV KETKTLKFAS YADSLESPGD FLLSDFAKME RSPQLHLAFG ALDAYVAKHG
ASPTPGSDSD AEKFVAEAEA LNATRKAVDE VDKDLLKTFS KTCRGHVSPM AAMFGGIVGQ
EVVKACTGKF HPLFQWFYFD SVESLPETLT EEDLAPRGDR YDGQVMCFGT KMQDKILSQK
IFLVGAGALG CEFLKNFACM GLSCGPSGGV TVTDDDVIEK SNLSRQFLFR DWNIGQGKSV
CASNAAKVIN PNLNVTALEN RVSPDTEDVF DDGFWEGLDV VVNALDNVNA RLYVDSRCVY
FQKPLLESGT LGTKCNTQMV IPNMTENYGA SRDPPEKSAP MCTLHSFPHN IDHCLTWARS
EFEGAFEKAP AEANSYLSKP EEYAAAALSN PDASARENVE KVAQVLLKTA CSTYDECIAW
ARTQFQEQFH DKILQLTFTF PEDAVTSTGS PFWSAPKRFP RPVIFSTSDA SHMTLIRAMA
NLKAELSGIA RPAAGVNDDA ALVQLVDKVA VAPFEPKKGI KIETDPKANT AASSIPEGID
DEAVIKDVLA KLETKRAGLG GDYRLNVIEF EKDDDTNFHM DAIAGLSNMR ARNYDIGEVD
KLKAKFIAGR IIPAIATTTA MATGLVCLEL YKVFKGAKIE AYRNTFANLA LPLFAMAEPI
AAKQDKFKDL SWSMWDRWIL EGDFTVQQVL DHFEAKGLIA YSMSVGASLV YNNIFPKHKE
RLNQKLSELV QTVAKMEIPA KRRHFDIVVA CEDDEGEDVD IPMVSIRFR