Gene OSTLU_32539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32539 
Symbol 
ID5002958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp214649 
End bp215952 
Gene Length1304 bp 
Protein Length218 aa 
Translation table 
GC content66% 
IMG OID640418379 
Productpredicted protein 
Protein accessionXP_001418872 
Protein GI145348882 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0125] Thymidylate kinase 
TIGRFAM ID[TIGR00041] thymidylate kinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.743007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.343685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CGGGAAAAGT GGCCGCGCGA GGCGCGTTCG TGCTGTTCGA AGGCGCGGAT 
CGATGCGGGA AGTCGACGCA AGCGATGCGG TTGGTGCACA CGCTCGAGGC GCGGGGGGTC
GAGGCGGAGC TGTGGCGGTA CCCCGATCGC GCGACGGCGA TGGGGAAGAT GATCGATCAG
TACTTGCGGT CGAAGAGCGA GATGGAGGAT GGGGCGATAC ACCTGCTGTT CGCGGCGAAT
CGATGGGAGA AGAAGGCGCT CATGGAACGC AAGCTCGCGA GCGGCGTGAC GTTGGTGTGC
GATAGGTACT CGTACAGCGG GTGCGCGTTT ACGGCGGCGA AGGGGGTGGA TGGATTGGAT
TTAGAGTGGT GTCGCGCCCC TGAAGTCGGA TTACCGCGTC CGGACGCGTT GATGTATTTA
GAGTTATCGC TCGAGGACGC GGCGAAGCGC GGCGGCTTCG GCGAGGAGCG ATACGAGACG
ACGGAGATGC AACGCGCCGT CAAGGCGAGT TTCGAAGCGA TGCGAGAAGA TTGGTGGGAC
GTGATCGACG CCAATCGCGA ACCGGACGTG ATTCAGGACG AAGTCTTGCG CATCGCGCTG
AGCGCGGTGG AGAAATGCCG AGCCGGACGC GAGTTGAAGC GGCTGTGGCA GTCGTAGCGC
GCGTGCAACA ACCAAACCAA CCAACTCTGC GCGTTTCCTC ACCCTCGCGC GCGCTCGCAT
GGCTCCGAGC GCCGACGTCG ACGCGTTAGA CGCGCGCGTG ACCGACGAGT TCGCGCGCGC
GTTCGGAGGC GACGCGGTGG CGGACATCGC GGCGCACGGC GAAGAAAAAC TCGCGCGTTG
GCTCGACCTC GTCTTGACAA AGTCGTCGAG CGCGGATGAC GACGACGACG CGAAGAAGGC
TCGAGAGCGC GTCGTCGACG CGACGTACGC GGCGCTGCAA GCGCGCGGCG CGTGGCCGCG
GGAGAGCTGG CGAGATGCGT ACGTCCTGGC GCAGCTTCGA CGGTGCGCGG CGGCGCTGCG
GGACCGGGAC GGCGACGGCG AGACGCGAGC GGAGAGCGCG CGGGAGGCGA TGCTGGCGGT
GGACATGGCC TTGATCGTGG GTGCGCCCGT GGATATGGTG GTTGATTTCG TGCGCGCGTG
CGAGCGGGCG TTGGGGATGG ACGGACGCCG TGAGGTGAGC GCCGGGCACG CGCGAAGCGC
GTCGACGAGC GAGTGGTTGT TCCCGAGCGC GGCGCCGAAA GGCGACCGAG GCGATCATGC
CGCGCAGACG AGGCTCGCGC GAGTGCATCA TCGCGAAATG GACT
 
Protein sequence
MSAAGKVAAR GAFVLFEGAD RCGKSTQAMR LVHTLEARGV EAELWRYPDR ATAMGKMIDQ 
YLRSKSEMED GAIHLLFAAN RWEKKALMER KLASGVTLVC DRYSYSGCAF TAAKGVDGLD
LEWCRAPEVG LPRPDALMYL ELSLEDAAKR GGFGEERYET TEMQRAVKAS FEAMREDWWD
VIDANREPDV IQDEVLRIAL SAVEKCRAGR ELKRLWQS