Gene OSTLU_44068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44068 
Symbol 
ID5004423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp502172 
End bp503407 
Gene Length1236 bp 
Protein Length411 aa 
Translation table 
GC content56% 
IMG OID640419844 
Productpredicted protein 
Protein accessionXP_001420362 
Protein GI145352030 
COG category[R] General function prediction only 
COG ID[COG0319] Predicted metal-dependent hydrolase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00043] metalloprotein, YbeY/UPF0054 family
[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00247631 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACT GCGAGTTGAG CGTGGCGCTG TGCTCGGATG AGTACATTCG AAGCTTGAAC 
GCGGGGTATA GAGAGAAAGA TAGCGCGACG GATGTTTTGA GCTTTCCTGC GGAGAGTTTC
GGGCCGATGG CAGTGCTCGG GGACGTCATC GTGAGCGTGG ACACGGCGAG CGCCCAGGCG
CGGGAGGTGG GGCATTCTTT GCGGGATGAG TGCCGAGTTT TGCTCGTGCA CGGGACTTTA
CATTTATTAG GTATGGATCA CGAAGTCAGC GAAAGCGAGG CGGAGGTAAT GGCGGCGGCG
GAGCAAGAGG TCTTGAAGGC GCTCGGATGG AAAGTCACCG GGCTCACGAG ACGCGCGTCG
GGAGAATCGA CGTCCGACTC TTCTTCGACG ACGCTCACGA CACAGAGAAG TGTGCTCGTG
ACGGATTTAG ACGGCACGCT ATTAAATGAA AATAGTGTCA TCACGCCTCG AGTCGCCGAT
GCTTTGCGCC GGGCGATGGC GTCGGGAGTC GAAGTTGTCG TCGCCACGGG CAAGGCGAGA
CCGGCGGCGA TTAAAGCCGC CGCCACGCAA GGATTAGACG GCATTATCGT CGGTAAGAAC
ACACCTGGGG TGTTCTTACA AGGTCTAGAA GTGTACGGTC GAGGCGGTGC TCTGGTCTAT
GAAGCGAAAA TGCCCGAAGA CGTCACGAGA GATGCCTTCA TGATGATGGA TGACGTCGTG
CACGACGGAT TGGCGCTCAC GGCGTTTTGT GGCGACAATT GCGCGACGCT TGCGCCGAGC
GTACTCTTGG ACGAGCTCCA CCACACCTAT CACGAACCAG CCAGCGAAAT CGCCGGATCG
GTGGATGAGA TACTATCCAA TAACACCGTT CGTAAACTAT TATTAATGGG ACCGAGCAAA
GAGAGCATTG ACGGCGTTCG ATCGATTTGG GAAGCCGCAT TCAGGGGTCG AGCGGAGGTC
ACACAAGCGG TGGCGGATAT GCTAGAAATA TTACCCCTTG GGAACGATAA GTCAAAGGGC
GTTCGAGCCG TGTTGAAATC CATGGATGTG AATCCGATGA CGGACGTTGT CGCCATCGGC
GACGGCGAAA ACGATGCCGA AATGCTTCGA TTCGTCGGTT GCGGCGTCGC CATGGCGAAC
GCCACGGAAA AGACAAAAAG CGGTGCCGCA CACGTCCTCG ATGCTTCAAA CACGCAAGAC
GGTGTCGCGG AAGCGATTGA TAGGTTTGTT TTGTAA
 
Protein sequence
MSHCELSVAL CSDEYIRSLN AGYREKDSAT DVLSFPAESF GPMAVLGDVI VSVDTASAQA 
REVGHSLRDE CRVLLVHGTL HLLGMDHEVS ESEAEVMAAA EQEVLKALGW KVTGLTRRAS
GESTSDSSST TLTTQRSVLV TDLDGTLLNE NSVITPRVAD ALRRAMASGV EVVVATGKAR
PAAIKAAATQ GLDGIIVGKN TPGVFLQGLE VYGRGGALVY EAKMPEDVTR DAFMMMDDVV
HDGLALTAFC GDNCATLAPS VLLDELHHTY HEPASEIAGS VDEILSNNTV RKLLLMGPSK
ESIDGVRSIW EAAFRGRAEV TQAVADMLEI LPLGNDKSKG VRAVLKSMDV NPMTDVVAIG
DGENDAEMLR FVGCGVAMAN ATEKTKSGAA HVLDASNTQD GVAEAIDRFV L