Gene OSTLU_39993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39993 
Symbol 
ID4999679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp554540 
End bp556606 
Gene Length2067 bp 
Protein Length688 aa 
Translation table 
GC content53% 
IMG OID640415100 
Productpredicted protein 
Protein accessionXP_001415532 
Protein GI145340853 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5560] Ubiquitin C-terminal hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000426118 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGACA CGACTTTTGG ATGCGATGTG AGTTTGACGC CAAAGAACGA CGTGACGACT 
TGCGCTGAGG AGGCAAAAGT AGGAACGAGG GGCAAAGCCG GACTGAGCAA TCTCGGGAAT
ACGTGCTTTA TGAATAGCGC ATTGCAGTGT CTCAGTCACT CTTCGCTTTT GACGGATTAC
TTTCTGTCCG ATAAATACGA AGTGGACATT AATACGGACA ATCCCATCGG TATGGGCGGG
GAGCTGGCGA AGGAGTACGC GAATTTGATA GGCGCACTTT GGCGGGACGG CGCGCTCACG
GTCACGCCGC GCAAGTTCAA GTCTTCCTTG GCTCGTTTCG CGCCTCAGTT CAGTGGATAT
ATGCAACAAG ATGCACAAGA GCTGTTGGCG TTTTTACTAG ATGGCTTGCA CGAAGATTTA
AATCGCGTCA AGAACAAACC GTACGCCACA GAGCGAGACG CGGAGGGAAG AAGCGACGAG
GACGTTGCAA ACGAATCTTG GGAAGCACAC ACGGCTCGCA ACAATTCTTG CATCGTCGAT
ACTTTTCAGG GTCAGTATCG ATCGAAGTTG GTTTGCCCGT CATGCAGCAA CAAATCGGTC
AAGTTCGATC CGTTCATGTA CCTTTCCATT CCGGTCCCGT CGGCTCGTGA GCGCATGATT
AAAGTCACAC TCGTATCGTA CGGCGACGAA CTTTCTGCAA TCACGTACGG TTTGAAACTG
CCTAAGAATG GTGAGATTGC CATGTTACTT AGTGCTCTAT GCGAAGCCGC TGATATCGAT
ACGATGGACG AACGTGTGGT ACTTTGCGAA GTGTACAATC ATCGAATGGA AAAGACTCTT
TCGAACATGT CCTACTCACT CACAGACATC AGAGAACGCG ACGTGATATA CGCGCACCGC
TTACCGGCGA TAAAAGACAA CGACAACGTG GAAACAGTCG ACACCGTTCT TGTGCACAGA
AAAGAGTTGA ATCAAAACAA GACGCCGTAT TCGCACGTTA GCTCCGTGGC GGCGACGACG
ATGGTCCGAT TTGGCTTTCC GTGGATCGTT CCAGTGAGCG TGCCAAAAGG CACGAAGGCT
GGTCCCGACC ACGCGAGATT CGTAGAGAAG GAGGTGGAAA AGTTTAGCGC AAAGTTTGCA
CACACAAATT CGATGGAGAA ATCGTGTTCG CCTACCACGA GCGGCGATAC AGAAGGAAGT
GCGTCTCGAC GCGCCGTCGC AGAAAAGGAC TCTCGACTCT TCAAAATGAA GTACACGAAT
AAGAGCGCCT CGGCATCGTT CCACGAGTAT GGTTCAACGA CCGCATCGAC GTCGGACTCG
CACGAAATGC AGTACACCAT CGCCTCCTCC ATGCATTGCG TTGCGATAGA CTGGTCTTCG
AAGGCTCTGA GCCAGTTCTT TGACGAAGAA TTGTTGGAGC GTGAAATCGA GGAGCACCCA
AGCGTGACGG AGAACGCGAT TGAGAACAGC GGGACTCAAG GCACACCACT AGCGTCGTGC
ATCGAGTCTT TCATTCAAGA AGAGCCGCTC GGTAAGGATG ACATGTGGTA TTGCAAGCAG
TGCAAGGACC ACGTCCAGGC GATGAAAAAG CTCGATTTGT GGCGCATGCC GCCGATTCTC
GTCATGCACC TCAAGCGTTT CAGCTACAGT CGAACGTGGC GAGATAAGAT CGATACGTTG
ATTGATTTTC CACTCAACAC GCTTGACATG ACGCCTTACG TACTCCCGAA CGCTTCCAGT
GGGCCGGCGC CGATTTATGA CTTGTACGCG GTGGTGAACC ACTTTGGCGG CATGGGCGGC
GGTCATTACA CCGCGTACAC GAGACACGCC GAGGAGGGCA CGTGGCACTT GTACGACGAT
AGTCGTTGTA CCGCAGTAGA CGTCGGTGCG GCGCTGAACA ACAGCGCGGC TTACGTCTTG
TTCTACAAGC GCCGCGACGT CCCGATGCGC CAAGCCATGT CTCGCGCCGG CTCGCTGTGC
AACATGGCCG CCATGGACAG CGTCGCGAAC ACGCGTACCC CATGCGACGA CGACGACGAC
GAACCTAGAG AAATGGAACT CAACTAG
 
Protein sequence
MEDTTFGCDV SLTPKNDVTT CAEEAKVGTR GKAGLSNLGN TCFMNSALQC LSHSSLLTDY 
FLSDKYEVDI NTDNPIGMGG ELAKEYANLI GALWRDGALT VTPRKFKSSL ARFAPQFSGY
MQQDAQELLA FLLDGLHEDL NRVKNKPYAT ERDAEGRSDE DVANESWEAH TARNNSCIVD
TFQGQYRSKL VCPSCSNKSV KFDPFMYLSI PVPSARERMI KVTLVSYGDE LSAITYGLKL
PKNGEIAMLL SALCEAADID TMDERVVLCE VYNHRMEKTL SNMSYSLTDI RERDVIYAHR
LPAIKDNDNV ETVDTVLVHR KELNQNKTPY SHVSSVAATT MVRFGFPWIV PVSVPKGTKA
GPDHARFVEK EVEKFSAKFA HTNSMEKSCS PTTSGDTEGS ASRRAVAEKD SRLFKMKYTN
KSASASFHEY GSTTASTSDS HEMQYTIASS MHCVAIDWSS KALSQFFDEE LLEREIEEHP
SVTENAIENS GTQGTPLASC IESFIQEEPL GKDDMWYCKQ CKDHVQAMKK LDLWRMPPIL
VMHLKRFSYS RTWRDKIDTL IDFPLNTLDM TPYVLPNASS GPAPIYDLYA VVNHFGGMGG
GHYTAYTRHA EEGTWHLYDD SRCTAVDVGA ALNNSAAYVL FYKRRDVPMR QAMSRAGSLC
NMAAMDSVAN TRTPCDDDDD EPREMELN