Gene OSTLU_41898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41898 
Symbol 
ID5005224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp506005 
End bp507225 
Gene Length1221 bp 
Protein Length406 aa 
Translation table 
GC content48% 
IMG OID640420645 
Productpredicted protein 
Protein accessionXP_001421017 
Protein GI145353432 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5021] Ubiquitin-protein ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC AACGCCCGCT GTTGCTTTCC GGACCCATGA AAATCCTGTT GAGCAATCCA 
CGCTTGCTGG ACTTCTCCGT CAAACGTGCG GAGATTCGGA CACGAATCAA GAAACTTCGC
GAACGCCTAG GACATAATCG TCCAGAGGCG CGAACGTTAC ACATTAGGCG TGATCGAATA
CTCGAAGACT CGTTTAGACA ACTCAACAGC CGGAGTATCG AAGAAATTCG AGGCAAAATC
AGCATCGTTT TCGTGGGCGA AGAAGGCATG GACGGTGGCG GTTTGATAAA GGAGTGGTTC
ACCATCTTGG CACGAGAAGT TTTCAATCCA AACATCGCTC TCTTCGAGTT GTCTCACGAC
AAGGGATGCT ATCAGCCGAA TCAAAACAGT GTGGTCCATC CGGATTATCT CAGCTATTTT
AGATTCGTCG GTAGACTCGT CGGTAAGGCT TTGTTCGACG ACATTCTCCT CAACGCATAC
TTCACGCGTC CGATTTACAA GCACCTTCTC GGTCAGCAGC TCACATACGA AGACATGGAA
GGTGTAGATC CAGATTATTA CAAGAGCTTG AAATGGATGC TGGAGAACTC TGTGGAGGGT
GTCATGGAAT ACACATTCAG CGACACAACG TCTTATTTTG GTGAAACTCA AGTTCACGAT
TTGACCGAAA ACGGACGAAA TATCGCAGTG ACAGATGCAA ACAAGTTTGA ATACGTCAAC
CTGATAACCG CGCACCGAAT GACGAATGCG GTGAAGGACC AACTCGCTGC TCTCGTGAAG
GGGTTTGAAG AAGTTGTCCC TAGAGAAACG ATTTCCATCC TGAATGCGTC TGAATTGGAA
CTGCTCATAA GTGGTACCCC GGACATCGAC GTCGAGGATT TACGCGCCAA TACTGAATAC
ACCGGCTTCA CCGTCGGGTC AAAACAAATT CAATGGTTTT GGGACGTCGT GAGGGAAATG
AACAAGGAAG ACTTGGCGCG CTTATTGATG TTTTGTACCG GTACCTCTAA GGTTCCTTTG
GATGGATTCG GTGCTTTGCA AGGCATGCAA GGCCCGCAAC GTTTTCAAAT CCATCGGCAG
CACGCGGATG ATTCAAAGTT GCCATCCGCA CACACGTGCT TCAATCAACT CGATTTGCAC
GAATACAGCT CAAAGCAAAT CTTACGCGAC AGGCTGCTGT ACGCGATTGT TGAAGGTTGT
GAAGGCTTTG GCTTCATTTA G
 
Protein sequence
MLKQRPLLLS GPMKILLSNP RLLDFSVKRA EIRTRIKKLR ERLGHNRPEA RTLHIRRDRI 
LEDSFRQLNS RSIEEIRGKI SIVFVGEEGM DGGGLIKEWF TILAREVFNP NIALFELSHD
KGCYQPNQNS VVHPDYLSYF RFVGRLVGKA LFDDILLNAY FTRPIYKHLL GQQLTYEDME
GVDPDYYKSL KWMLENSVEG VMEYTFSDTT SYFGETQVHD LTENGRNIAV TDANKFEYVN
LITAHRMTNA VKDQLAALVK GFEEVVPRET ISILNASELE LLISGTPDID VEDLRANTEY
TGFTVGSKQI QWFWDVVREM NKEDLARLLM FCTGTSKVPL DGFGALQGMQ GPQRFQIHRQ
HADDSKLPSA HTCFNQLDLH EYSSKQILRD RLLYAIVEGC EGFGFI