Gene OSTLU_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3043 
Symbol 
ID5004548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp307167 
End bp308390 
Gene Length1224 bp 
Protein Length408 aa 
Translation table 
GC content66% 
IMG OID640419969 
Productpredicted protein 
Protein accessionXP_001420472 
Protein GI145352265 
COG category[G] Carbohydrate transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5371] Golgi nucleoside diphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.064968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACACGCTACG CCGTCGTCAT CGACGCCGGC TCCACGGGCA CGCGCGTGCA CGTGTTCACG 
TTCTCTCGAT CGGCGTTCGC GAGCGGCGGC GAGGCGCTGC GAGACGAGAC GTTTCGCTCG
ATCGAACCGG GACTGAAGAG CTACGGCGGC GACGCCGAGG CGGCGGCGAC GTCGATCGAG
GCGCTGATCG ACGTGGCGAA AGGAGTCGTG CCGGAGAGCG CGAGACGAGA AACACCGTTT
AGTGTGCGCG CGACGGCGGG ATTGAGACTG ATGCCCGAAG GGCGGGAGGC GGCGGACGCC
ATCGTGGAGG CGGTGCGACG AAAAATCGCG AACGCGGGAT TTCATCCGTC GTCGGCGTCG
TTCGTGAGCA TCATGGACGG CGAAGACGAG GGCGCGCACG CGTGGGTGAG TGTAAATTAT
TTGCTAGGGA ATCTCGGCGG GGCGCCGGAG AAGACTGTGA CGGTGGTAGA TTTAGGCGGC
GGGAGTACGC AAATCGCGTA CGCGGTGGGC GGGGGCGCGG CGAAGGACGC GCCGAAAGGG
TACGTGCGCG ACATCGAGGC GGCGTCGACG ACGTATAGGA CATACGTGCA TTCGTTTAAG
GGCTACGGTA TCGTCGCCGT ACGGCCGAAG ATATTTAGCG TGGGGAAGAA TAAAGACGGT
TCGCATCCGT GCTTACCGAA CGCGTTCGCG GATTCGTGCG AAAAAGATTG CTACGGGCTC
GAGCCTGGGG AGACGTACGC CGCCATCGGA TCCAGCGACG GCAGCGACTT TACACGGTGT
CTGCTCGCCA CGACGCAGGC GCTCGAGGGA AATTGCGCGA AAGCACCGTG TTCGTTCGCC
GGCGCTTGGA CGACGCCGCG CAAAACGCCC CTCTTCGTCA TGTCCTTCAT CGTCGAACGC
GCGATTCAAG GCGGCGCGGT GCCGCCGCCG AGGCGCCCGA CCGATATCGC GACCATGACA
CCGCGCGACG TGAAGCGAGC CGCCCTTCGC GCGTGCTCCA CGCCCGCCGC CGAGCTCGAG
GCTCGCTTCC CCGTCGCCGC GCGCGACGCC GTCGACGTCA ACTACCTCTG CCTCGACCTC
GTCTACGTGT ACGCCCTCCT CACCGTCGGT CACGGCGCCG CGGACGACGA GACGATTCGC
GCGCTCGACA AGATTCGTTA CCGGCGTCGA GACGTCGAGG CGAGCTGGGC GTTGGGCGAC
GGCATCGCCG CCGCCGCCGC CGCG
 
Protein sequence
TRYAVVIDAG STGTRVHVFT FSRSAFASGG EALRDETFRS IEPGLKSYGG DAEAAATSIE 
ALIDVAKGVV PESARRETPF SVRATAGLRL MPEGREAADA IVEAVRRKIA NAGFHPSSAS
FVSIMDGEDE GAHAWVSVNY LLGNLGGAPE KTVTVVDLGG GSTQIAYAVG GGAAKDAPKG
YVRDIEAAST TYRTYVHSFK GYGIVAVRPK IFSVGKNKDG SHPCLPNAFA DSCEKDCYGL
EPGETYAAIG SSDGSDFTRC LLATTQALEG NCAKAPCSFA GAWTTPRKTP LFVMSFIVER
AIQGGAVPPP RRPTDIATMT PRDVKRAALR ACSTPAAELE ARFPVAARDA VDVNYLCLDL
VYVYALLTVG HGAADDETIR ALDKIRYRRR DVEASWALGD GIAAAAAA