Gene OSTLU_39912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39912 
Symbol 
ID4999547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp133130 
End bp134632 
Gene Length1503 bp 
Protein Length500 aa 
Translation table 
GC content55% 
IMG OID640414968 
Productpredicted protein 
Protein accessionXP_001415398 
Protein GI145340576 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.694161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTGC GCGCGGAGGA CGACGATGAT GATCCGTTCA TCGCGCAGTT GAGTCAGATC 
GCGCACGAAC CGCGGGTGCA CGTGGGGGAG GAGAACGCGG CGGCGACGGC GTATTGCGCG
GGGAGACGGT CGCGAGGAAA GACGTTTAAC GATCCCGTGC ACGGACACAT GTACTTTAAT
CCGAAGCTGT GCGATGTGAT CGATACGCCG CAAATGCAAC GGTTGCGAGA GTTGAAGCAG
TTGGGGACGT CGTATTACGT GTTTCCGGGA GCGGGGCATA ACAGGTTCGA GCACTCGTTG
GGAACGTGTC ACCTGGCGAA CACGGTGTTC GAGTCCATCA AGCGCAGCGC GCCCAGGCAC
GGGTTAGGGC TGACTGTGGA GGATAAGTTA TGCGTGCAAC TCGCGGGGCT GTGTCACGAC
ATGGGCCACG GGCCGTTTTC GCACGTGTTC GATAACGAGT TTTTGCCGTT GAGACACGGT
TGGGATCCGA AAGTCGTGGC GCCGTGGAAT CACGAGCGCA TGGGGGTGGA CATGTTTTCT
TGGTGCTTAG ACGATAACCA CATTGATTTA GAGCCTCAAG TCGTGCGGCG CGTGTGCGAT
TTCATCACGA GCAACGAGGA AGGAGCGAAG GAGAAGCGAT TTTTGTTTGA CATCATCGCC
AACAAACAAA ACGGCATCGA CGTGGACAAG TTCGAGTACC TGTTGCGAGA TTCTTACCAG
GCCGGCGTGC GCATGAGCGT GGATACGATG CGATTGACGT CGCACATGAA GGTGATCGAT
GACAGGATTT GCTTCAAGTC GAGTGAGGCG AACAACGTGT ACGCGTTGTT CCACTCTCGA
GCGTCCATGC ACCAGAGCGT GTACACGCAC AAAAAGGCCA AGGCGGTGGA ATATATGGTG
GTCGATGCAT TAGTCGAAGC CGACATCGCG TGGAACGGGC GAATTAGCAA CTCCATTTGG
AGCGTTGAGG ATTTCATCGC GATGGACGAC ACGCTGCTCA AACAGATTGA ATTTTGTGAC
GATCCCGCGC TCGCCAAGGC ACGAGACATC GTGCGACGCA TCCGTCGTCG CGAGTTGTAC
CGATTCGTGA ATGAATACAC CGTGCCCGAG GATCAAGTGG TGGATTTCAA GCCGGTCGAG
GCGAAAGACA TCACGTCATG CCAAGGAACG AACAACATCC CGGGCGGTTT GAAACCAGAC
GACATCATTG TGCAGTGCCT GAAGATTGAT TACGGCCAAA AAGGACACAA AGATCCCGTG
GAGAACGTCA GGTTTTTCCA CTACTGGGAC GACGAAACCT CGTGCAGCAT CGCCAAAGAG
CAAATCAGTT CGCTCTTGCC GCGAAATTTC GTCCATCGCG TCGTTCGCGT CTTCAGTCGC
CGCCGCGAAC CAGAGTACAT CGAAGCCACC GCGCAAGCCT TCTCGAATTT CCAGCGCCGT
CAGCTCGGCA AAGAGGCGCA AATCACCCCG GTGAAGCGCC AGAGATTTTC GAACGATAGC
TAA
 
Protein sequence
MELRAEDDDD DPFIAQLSQI AHEPRVHVGE ENAAATAYCA GRRSRGKTFN DPVHGHMYFN 
PKLCDVIDTP QMQRLRELKQ LGTSYYVFPG AGHNRFEHSL GTCHLANTVF ESIKRSAPRH
GLGLTVEDKL CVQLAGLCHD MGHGPFSHVF DNEFLPLRHG WDPKVVAPWN HERMGVDMFS
WCLDDNHIDL EPQVVRRVCD FITSNEEGAK EKRFLFDIIA NKQNGIDVDK FEYLLRDSYQ
AGVRMSVDTM RLTSHMKVID DRICFKSSEA NNVYALFHSR ASMHQSVYTH KKAKAVEYMV
VDALVEADIA WNGRISNSIW SVEDFIAMDD TLLKQIEFCD DPALAKARDI VRRIRRRELY
RFVNEYTVPE DQVVDFKPVE AKDITSCQGT NNIPGGLKPD DIIVQCLKID YGQKGHKDPV
ENVRFFHYWD DETSCSIAKE QISSLLPRNF VHRVVRVFSR RREPEYIEAT AQAFSNFQRR
QLGKEAQITP VKRQRFSNDS