Gene OSTLU_52147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52147 
Symbol 
ID5006846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp21198 
End bp22465 
Gene Length1268 bp 
Protein Length406 aa 
Translation table 
GC content47% 
IMG OID640422267 
Productpredicted protein 
Protein accessionXP_001422878 
Protein GI145357341 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5021] Ubiquitin-protein ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC AACGCCCGCT GTTGCTTTCC GGACCCATGA AAATCCTGTT GAGCAATCCA 
CGCTTGCTGG ACTTCTCCGT CAAACGTGCG GAGATTCGGA CACGAATCAA GAAACTTCGC
GAACGCCTAG GACATAATCG TCCAGAGGCG CGAACGTTAC ACATTAGGCG TGATCGAATA
CTCGAAGACT CGTTTAGACA ACTCAACAGC CGGAGTATCG AAGAAATTCG AGGCAAAATC
AGCATCGTTT TCGTGGGCGA AGAAGGCATG GACGGTGGCG GTTTGATAAA GGAGTGGTTC
ACCATCTTGG CACGAGAAGT TTTCAATCCA AACATCGCTC TCTTCGAGTT GTCTCACGAC
AAGGGATGCT ATCAGCCGAA TCAAAACAGT GTGGTCCATC CGGATTATCT CAGCTATTTT
AGATTCGTCG GTAGACTCGT CGGTAAGGCT TTGTTCGACG ACATTCTCCT CAACGCATAC
TTCACGCGTC CGATTTACAA GCACCTTCTC GGTCAGCAGC TCACATACGA AGACATGGAA
GGTGTAGATC CAGATTATTA CAAGAGCTTG AAATGGATGC TGGAGAACTC TGTGGAGGGT
GTCATGGAAT ACACATTCAG CGACACAACG TCTTATTTTG GTGAAACTCA AGTTCACGAT
TTGACCGAAA ACGGACGAAA TATCGCAGTG ACAGATGCAA ACAAGTTTGA ATACGTCAAC
CTGATAACCG CGCACCGAAT GACGAATGCG GTGAAGGACC AACTCGCTGC TCTCGTGAAG
GGGTTTGAAG AAGTTGTCCC TAGAGAAACG ATTTCCATCC TGAATGCGTC TGAATTGGAA
CTGCTCATAA GTGGTACCCC GGACATCGAC GTCGAGGATT TACGCGCCAA TACTGAATAC
ACCGGCTTCA CCGTCGGGTC AAAACAAATT CAATGGTTTT GGGACGTCGT GAGGGAAATG
AACAAGGAAG ACTTGGCGCG CTTATTGATG TTTTGTACCG GTACCTCTAA GGTTCCTTTG
GATGGATTCG GTGCTTTGCA AGGCATGCAA GGCCCGCAAC GTTTTCAAAT CCATCGGCAG
CACGCGGATG ATTCAAAGTT GCCATCCGCA CACACGTGCT TCAATCAACT CGATTTGCAC
GAATACAGCT CAAAGCAAAT CTTACGCGAC AGGCTGCTGT ACGCGATTGT TGAAGGTTGT
GAAGGCTTTG GCTTCATTTA GATTAGCGAT TAACATAGAA TGTAACAACA CTAAACATGG
AATTTCAA
 
Protein sequence
MLKQRPLLLS GPMKILLSNP RLLDFSVKRA EIRTRIKKLR ERLGHNRPEA RTLHIRRDRI 
LEDSFRQLNS RSIEEIRGKI SIVFVGEEGM DGGGLIKEWF TILAREVFNP NIALFELSHD
KGCYQPNQNS VVHPDYLSYF RFVGRLVGKA LFDDILLNAY FTRPIYKHLL GQQLTYEDME
GVDPDYYKSL KWMLENSVEG VMEYTFSDTT SYFGETQVHD LTENGRNIAV TDANKFEYVN
LITAHRMTNA VKDQLAALVK GFEEVVPRET ISILNASELE LLISGTPDID VEDLRANTEY
TGFTVGSKQI QWFWDVVREM NKEDLARLLM FCTGTSKVPL DGFGALQGMQ GPQRFQIHRQ
HADDSKLPSA HTCFNQLDLH EYSSKQILRD RLLYAIVEGC EGFGFI