Gene OSTLU_17535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17535 
Symbol 
ID5004685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp145125 
End bp146561 
Gene Length1437 bp 
Protein Length478 aa 
Translation table 
GC content62% 
IMG OID640420106 
Productpredicted protein 
Protein accessionXP_001420600 
Protein GI145352542 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.274913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGCG GTAAAGTCCT CGTCGTCGCC GGCTCCGACT CGGGCGGCGG CGCCGGCGTC 
CAAGCCGACG TCAAGGCGAT CCTCGCCCAC GGCGCCTTCG CCACGACCGC CATCACCGCG
CTCACCGCTC AAAACACCAC GGGCGTGCAC GGCGTCCACG CCGCACCGTT GGAGTTCATC
GAGGCGCAGA TCGAGGCCGT GGTGACGGAT CTACCGCCAG ACGCGTGCAA GACGGGGATG
CTGGCCAACG CGGCGACGAC GCGCGCCGTC GCCGACGCGA TCGAACGACA TCGATTGACC
AACGTCGTCG TGGACACGGT GATGCTGGCG AAAGGCGGCG CGAGCCTGCT CGAAGCGGAG
GCGCTGGAGG TGATGCGGGA TCGATTGGCG CCGCTCGCGA CGGTGATCAC GCCAAACGTC
CCGGAGGCGG CGGCGCTGCT AAACTTGAGC GAGGAAGAGT TCGTGATGGA GACGATGGCG
ACGCGGGCGA AGGAGTTAGG GAAGTTAGGG TGTCAGTGGG TGTTACTTAA GGGTGGGCAC
GTCAAGGATG ACGCGGAGAT GTCGGTGGAT TATTTGTACG AGGCGAACAC GGGGAGGACG
ACGACGTTCT CGAGCGCGAG AATAGATACG AGGCACACTC ACGGGACCGG GTGCACGCTC
GCGAGCTCGA TCGCGGCGTC GTTGGCGCAA AGGTATGACG TTCCTACGGC GGTGCATCGA
GCGAAGAGGT ACATCTCGGA GGCGATTCGA ACGAGTCCGG GGTACGGCGC GGGACACGGG
CCGTTGAATC ATTTGCCGTT TCACGCCGGC GCGGCGGCGC GTGGAAAGCG GTTCGATCCG
CGATGTTTGA AACTTTATCT CGTCAGCAGT GAGGCGTTGA CCATGGATAA GCTTCGACAG
GCGCTCGAGG CGGGAGTGAC GATTGTGCAG ATGCGCGATA AGGATCCCTC GACGAGGGCC
TTGATCGAAC GCGCCAAGGC GATGAAGGCG GCGTGCGATG AATACGGCGT CCCGTTCATC
GTCAACGATC GTGTCGACGT CGCCATCGCG TGTGACGCCG ATGGTGTGCA CTTGGGTCAG
TCAGACATGA CTTGCGCGGA GGCTCGACAA ATTCTTGGTC CGAATAAATG GATCGGAGTG
AGCTGTCGAG AGGTGTCTTT GGCGCGCCAA GCAAGCGCCG ACGACGCCGA TTACATAGGG
TGCGGCGCGT GTTTCGGCAC TAATTCCAAG GGTGACGCTA AAGTCATCGG CTTGGATGGT
GTCGGGAAAG TTATCGCGGT AGCTCGCGAG CTTTCCTTAC CAGTCGTCGC CATCGGGGGC
GTCTCGCTCG AAAACGCCGC GTCTGTTCGA GCGACGGGCG CCGACGGAAT AGCCGTCATT
TCCGCCGTCG CCAACGCGGC GGACGTGAAA AAGGCGGTCC ATAAGTTGCT GCATTAG
 
Protein sequence
MVRGKVLVVA GSDSGGGAGV QADVKAILAH GAFATTAITA LTAQNTTGVH GVHAAPLEFI 
EAQIEAVVTD LPPDACKTGM LANAATTRAV ADAIERHRLT NVVVDTVMLA KGGASLLEAE
ALEVMRDRLA PLATVITPNV PEAAALLNLS EEEFVMETMA TRAKELGKLG CQWVLLKGGH
VKDDAEMSVD YLYEANTGRT TTFSSARIDT RHTHGTGCTL ASSIAASLAQ RYDVPTAVHR
AKRYISEAIR TSPGYGAGHG PLNHLPFHAG AAARGKRFDP RCLKLYLVSS EALTMDKLRQ
ALEAGVTIVQ MRDKDPSTRA LIERAKAMKA ACDEYGVPFI VNDRVDVAIA CDADGVHLGQ
SDMTCAEARQ ILGPNKWIGV SCREVSLARQ ASADDADYIG CGACFGTNSK GDAKVIGLDG
VGKVIAVARE LSLPVVAIGG VSLENAASVR ATGADGIAVI SAVANAADVK KAVHKLLH