Gene OSTLU_18471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18471 
Symbol 
ID5006012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp95793 
End bp96881 
Gene Length1089 bp 
Protein Length362 aa 
Translation table 
GC content68% 
IMG OID640421433 
Productpredicted protein 
Protein accessionXP_001421838 
Protein GI145355166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.23754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCG ACGCGCGCGC GGGCGAAGGC GCGAGACTGT TGGCGGCGTT CGCGACGCGC 
GTCGTCCCGC GATTGGCGCG CGCGCGCGCG CACAAGGGGT CGCACGGCGG GAAGATCGCC
GTCGTCGGTG GGAGCGAACT GTACGCGGGC GCGCCGTACT TCGCGAGCGC GGCGGCGATG
CGCGCGGGGT GCGATCTGTG TCACGTGTTC ACGCACGCGA AATGCGCGCC GGTGATGAAG
GGGTACGGGC CGGACCTGAT CGTGCACGAG GCGTGGTCGC GGGACGCGCG CGAGGCGACG
CGCGGAGCGA AGACGGAGAC AGAGACGGAG AACGAACGAT CGATCGATCT CGTCGAGGCG
TTCGGGAGGT TTAGGATCGA TAACGCGGTG ATCGGACCGG GATTGGGGCG CGGGGCGGCG
CTGGAGGCGG TGGAGGCGCT GAGAGAGGTC GCGGCGGCGT GCGTCGTGGA CGCCGACGGG
TTGAAGGCGC TGGAACCGAC GAGCGCGGAC GAGGACGGCG CGGAGGCGGC GCGAGGGAGA
AATCCGACGG CGCTGGCGAC GCCAAATAAG ATGGAACTGT GGCGATTGGT GCGAAAGGCG
TCGGGGGCGT TCGAGGGGGG GGTGACGACG ATGGATTTGA GCGCGCGCGA GGACAGGGAG
AAAATAGCGA GCGCTCTGCG ACGGTACGCC GGCTATAATT TCCTCGTCAA GGGCGAAGAC
GATTACTTAT TCATTCAACA CTGGGACGTC GCGCCGTCGG TGTGCGACAG CGAGCGCGCG
GCGAGCGGCG ACGCGTCGAT CGTTCGGCTC CGTTTCGACG GCGTCGGCTC GCCGAAACGT
TCCGGCGGTC AGGGCGACAT TCTCGCCGGC GTCCTCGCGG TTTTTCTCCT CTGGTCCCAG
CGCACGGACG CCACGACCGC GAGCAATCGT TTAGACGATT ACGTCGCCGC CGTCGGCGCG
GCGTGTTTCC TCGTGAAAGC CGCCTCGAGC GCGGCGTATC GCGAGTACGG TCGCGGCGCG
CACGCGCAAG ACGTCCTCGC GCGCGTCGCC TCGACGTTCA TGGCGCATTT AGAACCCGAT
CTCTCCTAG
 
Protein sequence
MRADARAGEG ARLLAAFATR VVPRLARARA HKGSHGGKIA VVGGSELYAG APYFASAAAM 
RAGCDLCHVF THAKCAPVMK GYGPDLIVHE AWSRDAREAT RGAKTETETE NERSIDLVEA
FGRFRIDNAV IGPGLGRGAA LEAVEALREV AAACVVDADG LKALEPTSAD EDGAEAARGR
NPTALATPNK MELWRLVRKA SGAFEGGVTT MDLSAREDRE KIASALRRYA GYNFLVKGED
DYLFIQHWDV APSVCDSERA ASGDASIVRL RFDGVGSPKR SGGQGDILAG VLAVFLLWSQ
RTDATTASNR LDDYVAAVGA ACFLVKAASS AAYREYGRGA HAQDVLARVA STFMAHLEPD
LS