Gene OSTLU_52106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52106 
Symbol 
ID5006979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp202897 
End bp204870 
Gene Length1974 bp 
Protein Length626 aa 
Translation table 
GC content59% 
IMG OID640422400 
Productpredicted protein 
Protein accessionXP_001422838 
Protein GI145357260 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4284] UDP-glucose pyrophosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.222243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG CGCGCGCGAG CGCGACGGCC GTCGTCGACG CGCACGTCGC GCGCGGCGCG 
CTGACGACGG ACGACGCGCG GACGCTGCGC GAAACGATCG CGCTGGGACA AGCGCATCTG
ATCGCGGATT GGCCGGCGCC GGGCGTGGAC GACGAGAGGA AGCGCGCGTT CGTCGAGGAA
GTGCGACGGG CGGATCGGGG GTACCCGGGG GGGGTGGCGA AGTACGTGTC GAACGCGCGC
GAGCTGCTGA GGGCGTCGAA GGAGGGGAAG AATCCGTTCG AGGGATGGAC GCCGAGCGTG
CCCACGGGGA AGACGGTGGA GTACGGATCG GCGGCGCACG AGATTCTGGA GAAGATTGGG
ATGCGGGAGA CGGCGGAGAC GTGCTTCGTG CTCGTCGCGG GGGGGTTGGG AGAGCGATTG
GGGTACTCGG GGATCAAGGT CGCGCTGCCG GTGGAGCGGG CGACGAACGC GTGCTATTTG
GAGTTGTACG TGAAGAATAT CTTGGCGATG GAGAAACGCG CGGAGGGTGC GGAGGGTGCG
ACGAACGCGG GTGGGTGCGG GTGCTTCGGC GGTGGCGGCG CGAAGGCGAA ATCGTCCACG
AAGATTCCGT TGGCGATCAT GACGTCGGAG GACACGCACG CGCTGACGCT CGATTTGCTC
GAACGCAACG ATTACTTTGG CGCGTCTCGC GATCAAATCA CGCTCATGAA GCAAGAAAAG
GTGCCGTGCT TGATGGATAA CGATGCACGT TTGGCGGTGA AAGACGACGA TCCTTACAAG
CTCGCGCTCA AGCCGCACGG CCACGGTGAC GTGCACTCTC TCCTGCACAC GAGCGGGTTG
TTATCAAAGT GGATGAGCCA AGGCAAGAAG TGGGTCGTCT TCTTTCAAGA CACGAACTCG
CTCGTGTTCC GCGTCATCCC TGGTGCGCTC GGGGTGTCGA AGACGATGAA TCTTGAGTTC
AATTCTTTGT GCGTTCCGCG AAAAGCCAAG GAAGCGGTCG GTGCGATTTC GTTGCTAACT
CACGAGGATG GACGCAAGAT GACCATCAAC GTCGAATACA ACCAGCTCGA TCCGCTTTTG
CGAGCCACTA CGAATCCCGA AGGCGACGTC AACGATGCCA CGGGCTTCTC CCCATTCCCG
GGTAACATCA ATCAGCTCAT CGTGAGTCTT CCAGAGTACG CAAAACAACT CAAGAAGACT
GGCGGCGCGA TCGAAGAATT CGTCAATCCC AAGTACAAGG ATGAGACAAA GACGGCTTTC
AAATCGCCGA CGCGATTGGA GTGCATGATG CAAGACTATC CCAAGAGCCT CGGATCGAAG
GCTAAGGTTG GTTTCACCGT CTTTGCGAAC TGGATTGGCT ACAGCCCGGT GAAGAACTCT
CCGGCGGACG GTTTGGCCAA GTTCAAATCT AACGGCCCGA CGCACACGGC GACGAGCGGC
GAGTTTGAGT TTTACGAATC GTGCGCAAAC TTGTTGCGTT TGGCCGGTGC CGACGTCCCC
GCCGCCGCCG TCGACGCTGA ATTCAACGGT ATGAAGCTTC CCATGGGTCC ACGTGTTGTG
CTCGGTCCGG ATGTCGCCAC ATCTTTTGAT GAACTCAAGT CGAAAGTCGG CGCCGTCAAG
TTGGGCGCGA AGAGCGCGCT CGTCGTCGAA GGCTCTGGCG TCAATTTGAA AAACGTCGAA
GTGGATGGTG CGCTCGTCAT CAAGGCGTGC GAGGGCGCGG AAGTCATCGT CGATGGTTTG
AAAGTGACGA ACAAGGGTTG GCAGTGGAAG CCGACCGGCA AAGGTGCGCC CGAAGTCGAC
GCGCTCGCGG GATTCGTCGT GAAGAAAAAC GAAACGGCCG AGTACGTCTT CGACAAGCCT
GGCAAGTACA CGCTCCCGTA AGCGTTTCTC TTCTCCCTCG TCCTAGTAAA AACCACCAGC
GACGCAAAAA CACGCCATTT GAATTGAAAT AACAACACGA ATGAATACTT TGTG
 
Protein sequence
MDDARASATA VVDAHVARGA LTTDDARTLR ETIALGQAHL IADWPAPGVD DERKRAFVEE 
VRRADRGYPG GVAKYVSNAR ELLRASKEGK NPFEGWTPSV PTGKTVEYGS AAHEILEKIG
MRETAETCFV LVAGGLGERL GYSGIKVALP VERATNACYL ELYVKNILAM EKRAEGAEGA
TNAGGCGCFG GGGAKAKSST KIPLAIMTSE DTHALTLDLL ERNDYFGASR DQITLMKQEK
VPCLMDNDAR LAVKDDDPYK LALKPHGHGD VHSLLHTSGL LSKWMSQGKK WVVFFQDTNS
LVFRVIPGAL GVSKTMNLEF NSLCVPRKAK EAVGAISLLT HEDGRKMTIN VEYNQLDPLL
RATTNPEGDV NDATGFSPFP GNINQLIVSL PEYAKQLKKT GGAIEEFVNP KYKDETKTAF
KSPTRLECMM QDYPKSLGSK AKVGFTVFAN WIGYSPVKNS PADGLAKFKS NGPTHTATSG
EFEFYESCAN LLRLAGADVP AAAVDAEFNG MKLPMGPRVV LGPDVATSFD ELKSKVGAVK
LGAKSALVVE GSGVNLKNVE VDGALVIKAC EGAEVIVDGL KVTNKGWQWK PTGKGAPEVD
ALAGFVVKKN ETAEYVFDKP GKYTLP