Gene OSTLU_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3954 
Symbol 
ID5001175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp344598 
End bp345629 
Gene Length1032 bp 
Protein Length344 aa 
Translation table 
GC content53% 
IMG OID640416596 
Productpredicted protein 
Protein accessionXP_001417481 
Protein GI145345991 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5533] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones288 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGACGCTCG GCGACGCGTT CCCACCGTCG GCAAAGTTTT TTGGATTGGA AAACTTTGGC 
AACACGTGCT ACTGTAACTC GGTGCTCCAG GCGCTGTACG CGTGCGATGA GTTTCGAGAA
CGATTGATCG AACATCACGC GGCGGCGAAC GATGGGACGT CGACGAGCGG ACGAGGAAAG
GAGACGCCCG ACAGCATGCT GGCGGCGCTC GGGGATCTGT TTCGAGAGAT TTCGGGACAA
ACGAAACGCA CGGGATACGT CAGCCCGAGG GCGTTCATCG AACGGTTGAG GAAGGATAAC
GTGTTGTTTC GAGGACACAT GCATCAGGAT GCGCACGAGT TTTTAAACTT CTTGCTGAAT
GAGTGTTGCG AAAATTTACA GACGAAGTTG AAGCGAGACG GCGCGTGGGA ACCGGGGAAG
AAGACGTGGA TACACGATGT GTTCGAGGGG AAACTGGCGA ATCAGACGCG GTGTTTATGG
TGTGAGAACA CGACGAATAG AGAGGAGTGC TTTTTGGACC TGTCGGTTGA TGTCGAGCAG
AACACTTCCA TCACGGCGTG CTTGAATAAT TTCAGCGCCA AGGAGTTGTT GGACAAAAAC
GACAAGTTTC AGTGCGATCG ATGCGGTGGG TTACACGAGG CGCAGAAGCG AATGCTGATT
CATGAAGCGC CGAAAGTATT GTCGTTACAC TTGAAGCGGT TCAAGTACAT CGAGGCGCTC
GGCAGGCACG CGAAACTGAA TCATCGCGTG GTGTTCCCTT CCGAATTGAA AATTCCCAAC
TTGATAGACG AAGCGGAGAA TCCCGATGCG AGTTATAAGC TTTTCGCCGT CGTCGTTCAC
ATCGGCTCCG GGCCTAATCA CGGACACTAC GTGTGTTTCG CCAAGAATAA TCATCGCTGG
TTCTTGTACG ATGACGATTG CGTTGAAGTC GTGGATGAAG AGCAGCTTCA ACAAGTCTTT
GGCTCGACGA CGGATGGCGG CTCCGCGGGG AGCGAGCACG GATACATTTT GTTCTACGCC
CGATCTGAAG GT
 
Protein sequence
KTLGDAFPPS AKFFGLENFG NTCYCNSVLQ ALYACDEFRE RLIEHHAAAN DGTSTSGRGK 
ETPDSMLAAL GDLFREISGQ TKRTGYVSPR AFIERLRKDN VLFRGHMHQD AHEFLNFLLN
ECCENLQTKL KRDGAWEPGK KTWIHDVFEG KLANQTRCLW CENTTNREEC FLDLSVDVEQ
NTSITACLNN FSAKELLDKN DKFQCDRCGG LHEAQKRMLI HEAPKVLSLH LKRFKYIEAL
GRHAKLNHRV VFPSELKIPN LIDEAENPDA SYKLFAVVVH IGSGPNHGHY VCFAKNNHRW
FLYDDDCVEV VDEEQLQQVF GSTTDGGSAG SEHGYILFYA RSEG