Gene OSTLU_51122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51122 
Symbol 
ID5004623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp218219 
End bp220779 
Gene Length2561 bp 
Protein Length841 aa 
Translation table 
GC content60% 
IMG OID640420044 
Productpredicted protein 
Protein accessionXP_001420784 
Protein GI145352925 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0345053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGA CGCTCGCCGA CGTCGCGCGT CCTCGCGCGT GGCGCGCACG CGCGCGCGTC 
CAACGCGCGC ATCACCGTCC TCGCGCAGCC GTCGCGCGCG TCGCCGTCCT CGACGACGTC
GCCGCCTACG CCTTGGTGAC AAAGTCGGCT GATGATTTAC TCGACGACGT CCGACGCCTC
AACCGGGACG CGTCGTCGTC GTCGCCGCGA ACGACGAATG CAAACGTCGT GACGTACTCG
CCCAAGGTGT TCGTTCCGCT CACGCGCGCG TGTCGGGACT CGTGCGGATA CTGCGCGTTC
GTCGACTACG AACCGAGCGC GGCTGGAAAG CGCGTGTACA TGACGCTCGA GGAAATCGTC
GACGTGGCGC GACGAGGCGC GGCGGCGGGG GCGACGGAGT GTTTGTTGAC GTTTGGTGAT
CGACCCGAGG CGACGCGGGA GGATGCGCGA GAGGGATTGA GGGAGTTGGG ATGCGCGAGC
ACGGCGGAGT ACGCGGCAAA GGCGTGCGAG GCGGTGTTGC GAGAGACGGG GTTGTTGCCG
CACGTAAACG CGGGTGTGTT GACGAGAGAC GAGTTGAGGA TGCTACGACG CGTGAGCGCG
TCGCAAGGGT TGATGTTGGA GACGACGAGC GAGCGGTTAT TGGGGCCGGG AATGGCGCAC
GACGGGTGTG AGACAAAGCG ACCGAAGACG CGCCTGCGGT GCATCGAGCT CGCGGGAGAG
GAGCGCATCC CGTTCACGTC TGGATTATTG ATTGGGATCG GCGAGACTCG CGAAGAGCGT
ATCGATGCAC TTCTGGCGCT TCGCGATGTA CATGCCAAGC ACGGACACAT TCAAGAGCTC
ATCATACAGA ATTTCTTATC GAAACCCGGC ACCGCGATGG CTGATTTTCC AAATCCTCCG
CTGGAAGAGT TGACGTGGAC GGTGAGCGCA GCTCGCCTAA TTTTTGGCGC AGACATGATT
ATACAGGCGC CACCGAATCT TACACCAGGC GAAGAGGCTG GCTGGCGCGC CCTTTTGCGC
GCCGGTGCGA ATGATTGGGG AGGAATCTCG CCGGGCGTCA CGCCGGACCA CGTCAACGCC
GAGGCGCCAT GGCCGCACAT AGAAGAGCTC GCCACCGTGT GCGCCGATGA AGGTTTCGCG
CTCGTCCCGA GACTGCCAGT GCACCCTAAG TACTTGAGGG TAGACGATGA TCGAGTGAGC
GTCGGGGGAT CCGCAGTTTG GCTTGACGAC AAAGTTTCGC CGTATCTTCG CAAACTCGCC
GACAGCGAGT TTCTCGTTCG CGGTACGACA TGGTCGCCAG GACGTCCGGA TGATGAAAAG
AAAGAGTTTG TGGATATCGT CGGCGTGAAT GGCTCTGTTC CTTGTCGTGG TACCAAGAGG
CGTATATCGA GCGAAGTTCT GGCCGCCATA GCCGCCATAG TGGACGGAAA CTATGAGTTG
GACGACATCG TGACGTGCTT ACAAGCGAGA GGCGCCGATT TCGACAAGGT GTGCGAGGCC
GCGAATACTT TGCGAGAGCA GCAGTGTGGT GATACCGTTA CGTTTGTGAA CAATAGAAAC
ATTAACTATA CGAATATCTG CACGTTGGCG TGCACGTTTT GTTCGTTTTC CAAGGGAAAG
GCTGCAGAAG AACTTCGCGG TTCGCCGTAC CTGCTCGACT TGGACGAAGT CGCAAGGCGA
ACGGCCGAGG CTTGGGAGCG TGGTGCGAGC GAAGTCTGCA TGCAAGGCGG CATTCATCCC
TCGTTCACAG GCGAAGATTA TATGGCTTTT ATCGGAGCTG CGAAACGAGG GGCACCGGAC
ATACACATTC ACGCCTTTTC ACCGCTCGAA ATCGCTCACG GAGCGCAGAC TCTCGGTCTG
AGCGCTCGCG AATACTTGCG TAAACTCAAG GATGCAGGGC TGGGATCGCT CCCAGGTACT
GCTGCCGAGG TTTTGGACGA CCAAGTTCGC GAAACACTCT GTCCAGATAA ACTCACCGCG
AAAGAATGGC TCGATGTCGT CGAAGACGCT CACTTTGTGG GCGTGCCAAC GACGAGCACC
ATCATGTTCG GTCACATTGA CGCCGACGGC CCGCGCGCGT GGGCGCGACA TCTCGTCTCC
ATTCGCGATT TGCATCTCAA GACGGGTGGA TTCACGGAGT TCGTACCACT ACCTTTCGTG
CATTTCGAGG CGCCGACGTA TCGTTTCGGC GCGTCTCGGA AAGGTCCAAC GCTGCGCGAG
TGCATCCTGA TGCACGCCGT CGCGCGTCTC GTGCTGGGAC CGGCGGGAAT CACGAACATT
CAGGCGAGCT GGGTAAAAAT GGGTCCCGAG CTCGCCTCAC TTCTCCTGCA CGCTGGATGC
AACGATATGG GCGGTACACT CATGAATGAA TCCATCACTC GCGCCGCTGG TGCGACGTTT
GGGCAAGAAA TCGACGCGCG CGAAATGCGT CGAATCATCG AAGCCGCCGG CCGCGTTCCG
CTTCAACGCA CCACCTTGTA CGCTCACGCA CCCCAACATC GCGTCGAGCA CCCATCGATC
GCGTAGGACG CGATCGTCGT AGATTGTATG TAGAGTCACT A
 
Protein sequence
MRETLADVAR PRAWRARARV QRAHHRPRAA VARVAVLDDV AAYALVTKSA DDLLDDVRRL 
NRDASSSSPR TTNANVVTYS PKVFVPLTRA CRDSCGYCAF VDYEPSAAGK RVYMTLEEIV
DVARRGAAAG ATECLLTFGD RPEATREDAR EGLRELGCAS TAEYAAKACE AVLRETGLLP
HVNAGVLTRD ELRMLRRVSA SQGLMLETTS ERLLGPGMAH DGCETKRPKT RLRCIELAGE
ERIPFTSGLL IGIGETREER IDALLALRDV HAKHGHIQEL IIQNFLSKPG TAMADFPNPP
LEELTWTVSA ARLIFGADMI IQAPPNLTPG EEAGWRALLR AGANDWGGIS PGVTPDHVNA
EAPWPHIEEL ATVCADEGFA LVPRLPVHPK YLRVDDDRVS VGGSAVWLDD KVSPYLRKLA
DSEFLVRGTT WSPGRPDDEK KEFVDIVGVN GSVPCRGTKR RISSEVLAAI AAIVDGNYEL
DDIVTCLQAR GADFDKVCEA ANTLREQQCG DTVTFVNNRN INYTNICTLA CTFCSFSKGK
AAEELRGSPY LLDLDEVARR TAEAWERGAS EVCMQGGIHP SFTGEDYMAF IGAAKRGAPD
IHIHAFSPLE IAHGAQTLGL SAREYLRKLK DAGLGSLPGT AAEVLDDQVR ETLCPDKLTA
KEWLDVVEDA HFVGVPTTST IMFGHIDADG PRAWARHLVS IRDLHLKTGG FTEFVPLPFV
HFEAPTYRFG ASRKGPTLRE CILMHAVARL VLGPAGITNI QASWVKMGPE LASLLLHAGC
NDMGGTLMNE SITRAAGATF GQEIDAREMR RIIEAAGRVP LQRTTLYAHA PQHRVEHPSI
A