Gene OSTLU_30829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30829 
Symbol 
ID5000752 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp813378 
End bp814825 
Gene Length1448 bp 
Protein Length433 aa 
Translation table 
GC content58% 
IMG OID640416173 
Productpredicted protein 
Protein accessionXP_001416771 
Protein GI145344503 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.461496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCTCGACGT TCCAAAATTC CTCTCGACGC CCCCGCCGTC GAGCCTCCCG CCGAGGCGAT 
GTCTCGCCTC GACGTCGATT TCTGCGTCGA CGAACGCGCG CGCTGGCGCT CCATCGACGC
GCTTTTACTT CGCCGTGGGA GATTCACCGG TCCAGATTTT GAGCCGGGAG AAGACGTCGC
GCGCGTGCTC CGCGAGCACG TCCGCGTGCT CGTCGTCGGC GCCGGAGGAC TCGGGTGTGA
ACTGTTGAAA GGGCTCGGTG CGTGACGACG GCGCGAGAGA CGCGCGGACG TTGGTGGTGA
TGAAGGACGC GCGACTGACG AGACGAACGC GACGACGCCG CGCAGCGCTG AGCGGATTCA
CGACTCTGGA CGTGATCGAT ATGGACACGA TCGACGTGAC GAATTTAAAT AGACAGTTTT
TGTTTCGCGC GGAGGACGTG GGGAAGAGTA AGGCGGAGAC GGCGGCGAGA CGAGTGCGGG
AACGCGTGCG CGGGTGCGCG GTGAACGCGC ATCACGGACG AATAGAAGAG AAAGAGGATG
GGTGGTATAA ACAGTTTGAT ATCATCGCTC TGGGATTGGA TTCTCTGGAA GCGCGGGCAT
ACATCAACGC GGTGTGCTGT GGGTTCTTGG ATTACGACGA GGATGGGAAC GTGGATCCGG
CGACGATTAA ACCGCTCGTG GACGGCGGTA CGGAGGGATT CAAGGGACAC GCGCGCGTCA
TCGTTCCGGG AATGACACCG TGTTTCAATT GCACAATGTG GCTGTTCCCT CCGCAAACGA
CGTTTCCGTT GTGCACGCTG GCAGAGACGC CGAGGAACGC AGCGCACTGC ATTGAATACG
CAAAATTAAT TCAGTGGCCG GCGGAGCGAT ACGGGGAGAC GTTTGACGCG GATGTCGTCG
AGCACATGAC GTGGGTGTAC ACGAAAGCGC TCAAACGCGC CGAGACATTT GGTATTCCAG
GCGTAACGTA CGCTCACACG CAAGGTGTGA CGAAGAACAT CATTCCGGCG ATTCCTAGCA
CGAACGCAAT CATAGCCGCG GCGTGCGTCA TCGAAACGTT GAAAATGGCG ACGATGTGCG
CCAAGGGAAT GAACAATTAC ATGATGTACG TGGGCACGGA TGGTGTGTAC TCGCACACTG
TGGAGTACGA ACGCGATCCA TCGTGCGTGG TGTGCTCACC CGGGATCGCT CACGCGTTGA
ACGCGAACGC GACACTCGAA GAATTCATGG CTTCCATCGT CGCCGCGTAT CCAGATTCTC
TCGCCGAACC GAGCGTGAGT TTCGGCGGGA AAAATCTGTA CTTGCGCGGC GTGCTCGAGT
CCGAATTCGC GGAAAACTTG AATAAACCTA TGATTGAGCT CATGAATGGG CGCAAAGAAG
GCTTAGTCGT GGTGAATGAC AAGAAAATGA AGAAGACGTC GATGCGGTTG CGACTGTCGT
TAAAATGA
 
Protein sequence
MSRLDVDFCV DERARWRSID ALLLRRGRFT GPDFEPGEDV ARVLREHVRV LVVGAGGLGC 
ELLKGLALSG FTTLDVIDMD TIDVTNLNRQ FLFRAEDVGK SKAETAARRV RERVRGCAVN
AHHGRIEEKE DGWYKQFDII ALGLDSLEAR AYINAVCCGF LDYDEDGNVD PATIKPLVDG
GTEGFKGHAR VIVPGMTPCF NCTMWLFPPQ TTFPLCTLAE TPRNAAHCIE YAKLIQWPAE
RYGETFDADV VEHMTWVYTK ALKRAETFGI PGVTYAHTQG VTKNIIPAIP STNAIIAAAC
VIETLKMATM CAKGMNNYMM YVGTDGVYSH TVEYERDPSC VVCSPGIAHA LNANATLEEF
MASIVAAYPD SLAEPSVSFG GKNLYLRGVL ESEFAENLNK PMIELMNGRK EGLVVVNDKK
MKKTSMRLRL SLK