Gene OSTLU_34345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34345 
Symbol 
ID5000996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp546997 
End bp548073 
Gene Length1077 bp 
Protein Length358 aa 
Translation table 
GC content65% 
IMG OID640416417 
Productpredicted protein 
Protein accessionXP_001416692 
Protein GI145344338 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.190727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCG TGTTCGTCCC GGGACGCGTG TGTCTGCTCG GCGAGCACTC GGATTGGGCC 
GGCGCGCGCG CGGCGCGCGA TGGCCCGGGA GCGTGCGTCG TCGTCGGCAC GCGCGAGGGC
GTCAGCGCGC GGTGCGACGT CGGTGAAGGG CCGCGGTTTC GCGTGGTTGG CGCGGACGGC
GACGCGTTCG AGTGCGATGT GAGCGTCGAT GACGCGCTCG AGCGAGAGGC TTCGAGCGGA
GGGTACTGGT CGTACGTCGC CGGGACCGCG CTGGAGGTTT TGAGGCGATT TCCGCGGTGT
CGCGAGCGCG GGCTCGTCGT GGAGACGCTC GAGACGACGC TGCCGACGAG GAAAGGCTTG
AGCTCGTCGG CGTGCGTCTG CGTCCTCGTG GCGCGGTGTT TCGGCGTGGC GTACGAGTTG
GATCTCGAGC TGAAGGATGA GATGGAGTTG GCGTATCGAG GGGAGGCGGT GCACACGCCG
AGCAAGTGTG GAGCGATGGA TCAGGCGTGC GCGTACGGGA GCGAGCGCGT CGTGGCGCTC
ACATTCGACG GCGAAGACGT GGACGTCAGA GCGTGCGAGG TTGACGGTGA AATACACATC
GTCGTGTGTG ATTTGGCGGC ATCGAAGAGT ACGGTGCGGA TTTTAGCCGA TTTGCAAGGA
GCTTTCGACC GAGGGGACGA GGCGCTGCGC TCGGCGCTCG GTGCCCGTAA TCGAGCGCTC
GTGGCCGAAG GATTAGACGC GATCAAACGC GGCGATGCGC GGGCTTTAGG CGCGGTGTAT
ACGCGCGCAC AGACCACATT TGACGAAGCT GCGATCCACA TCTGCCCATC CGAGCTCACG
GCGCCTCGTT TGCGCGAGAC GCTCGCCGCC GTCGCCCACG ACGTTCCGGA AACTGTCTTT
GGCGCGAAAG GCGTCGGAAG CCAAGGCGAC GGTGCCGCGC AATTCGTGGC GGTATCTGAA
GCAGCGGCTA AAACACTCCG ACAGTACTTG CACGACTTTT CGGGCGGTCG GTTTAAAGTA
TTCGACGTCG TTCTGCGAGA CGAAGAGCGC CACGCACGAA CTAGCACACA CAAATAG
 
Protein sequence
MPRVFVPGRV CLLGEHSDWA GARAARDGPG ACVVVGTREG VSARCDVGEG PRFRVVGADG 
DAFECDVSVD DALEREASSG GYWSYVAGTA LEVLRRFPRC RERGLVVETL ETTLPTRKGL
SSSACVCVLV ARCFGVAYEL DLELKDEMEL AYRGEAVHTP SKCGAMDQAC AYGSERVVAL
TFDGEDVDVR ACEVDGEIHI VVCDLAASKS TVRILADLQG AFDRGDEALR SALGARNRAL
VAEGLDAIKR GDARALGAVY TRAQTTFDEA AIHICPSELT APRLRETLAA VAHDVPETVF
GAKGVGSQGD GAAQFVAVSE AAAKTLRQYL HDFSGGRFKV FDVVLRDEER HARTSTHK