Gene OSTLU_38551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38551 
Symbol 
ID5002090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp741526 
End bp743910 
Gene Length2385 bp 
Protein Length794 aa 
Translation table 
GC content63% 
IMG OID640417511 
Productpredicted protein 
Protein accessionXP_001417857 
Protein GI145346772 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA CGCTCGAGGG GACGGTGAAG ACGTCGGCGC GCGCGTCGCG CGCGCGGGCG 
GCGCGTGCGA GCGGGCGGGC GGTGGGACGG ACGGACGCGG TGCTGGCGCG CGCGCGCGGC
GCGACGCTCG GAGACGTGCG ACGCGAGCAG ATCGACGCGG CGCTGCGGTT ACTGATCGAC
GAGCAGTGCA CGCTGCCGTT CGTGGCGAGG TATCGCAAGG AGAGGACGGA TGGGCTGGAC
GAGGGACAGT TGCGCGCGGT GGCGGCGGCG CACGCGGAGT GGACGCGATT GGAATCGAAG
CGCGAGAGCG CGTTCGAGGC GTTGACGCGG TTGGAGATCG TGGACGAAAG GTTGTTCGCG
AGATTGCGCG AAGCGAGCGC GGTGGAGACG ATCGAAGACT TGATGAGCAA GTATAAGACG
AAGACGAGCA GTCGCGCGGA CAGGGCGCGA GAGCTCGGCT GCGAACCGCT CGCGGAGAAG
ATTTTGAGCG ACGGCGCGGG GGGGGTCGAG GCGTTAGCGC GCGCGTACGT GAATGGGAAT
GATGTTCCAG ACGTGTGCGA GGCGTTGAGG CTGGCGAAAG ATGTGCTCGC GGAGAGGGCG
GCGAACGAGA CGAGCGCTCG GGAGGCGGCG AGACGATCGA CGTGGCAAAA CGGAAGGCTT
CAGTGCGCGT TGACGACGAA TGGGAAGGCG GCGGTGGAGG GAAAGATTGA CGATAAAAAG
ATGAAGAAGA TAGCCGACGC GTTGAGGGAC TACTACGAGT TCAACGCGCC ATTGCGAAGG
GTGAAGCCGC ATCAGACCTT GGCGATTTTA CGCGGCGAAG CCGCGAAGGT GTTGCGAGTG
AAGATTGATT TCGACGTCGC GTGGGCAACG TCGGCGGCGA TGAAAGCCAT GGTAGGCTCG
AGACGCTTAG GTGGGGGTTG TTATAACGTC GTTAGCGAGG CGGTGGAAGA TGGTGTGAAA
CGGTTGCTCG CACCCTCCAT CGAACGCGAG GCGAAGAGTC GACTCAAGAC GGAGGCGATG
GAGCGCGCCA TCGCAGATTT CGGCGCTAAT CTTCGCTCAC TGCTTCTTCA ACCGCCGCTG
ACACCCGCAG CTGTCGTGCT CGGCGTTGAT CCAGCGTATC GCACCGGCTG CAAACTCGCC
GTGGTTGACC CTACGGGAGC TCTGCTCGAT ACCGGCGTCG TACACTTGCC GCAATTCGAG
TCGAAGGTGA AACGAGAAGG CGGCGGTCAA TCCGCAGCCG CGAGCAAGCT GCAAGCGTTG
GTGAAGCAAT GGAACGTCAA GGCGATCGCG ATCGGCGACG GCGTAGCGAG TCGCGAAACC
GAATCGCTCG TCGCCACCGC GCTCGCTGGT TTTGAAGGCG TCGGTTGGCG CGTCGTCTCG
GAAGCCGGCG CCTCGGTGTA CAGCGCCTCC GAAATCGCGG CGAAAGAGTT GCCGGACATA
GACGTGTCAT TGCGCGGAGC GGTATCTATC GCGCGAAGAC TGCAAGATCC GATGGCCGAG
CTCGTTAAGA TCGAGCCCCA GTCCATCGGT ATAGGACTGT ATCAGCATGA CGTCAAGGAG
AAGGAGCTCG CTGACGCGTT GACGGCGACG GTGGAGAGCG CGGTGGCAAA CGTCGGTGCG
AATCTAAACA CATCCTCACA GTCGCTCTTG TCGCGTATTC CAGGCCTCGG CCCGTCGCTC
GCGGCGAAGA TCGTGCGACA TCGCTCTGAG CGTGGAAGGT TCCGCGAACG CGCCGCGCTG
AAAACCATCG CGGGCGTGGG CGCAAAGACG TATGAGCAAA TAATTGGGTT CTTGCGCGTC
CCGGACGCGA AAGACGCGCT GGAGCGCACT TCGGTGCACC CCGAGAGCTA CCACATCGCC
AAAAAGCTGT TGAAGATGCT CGAAAGAGAC GTGGCTGAGG TGATTGATCT CACGGCGAGC
GACGCCGGCG ACACGTCGAT GGACGCGCTG AAACGCCTCA AGCCCAAACT CAAGGCACTT
CGCGACGACG TAAAAGACCT CGAGGCGCGC GCTGCGCGCC TCAAGTGCCA CCCCTTAACG
CTCAAAGACA TCGCGAACGA ACTCAGCGCC CCGGGTGATG ACGCCCGAGG CCAAGCCGCC
GAGCAAAACG TGCTCCGAAC GCAAGCGCTC ACGCTCGAGG ACTTGCAAAC GGGCGCCGAA
ATCAAAAACG CCGTCGTCCG CAACGTCGTC CCTTTCGGCT GTTTCCTCGA TCTCGGCGTC
GGCCGCGACG CCCTCCTACA CGTCACCGCG ATGCGCAAAC GCACCGACGC CAACCCCAAC
GCCCAAATCG ATCCCCACGC GCTCTATCGC GTCGGCCAAA CGTGCGCCGT TCGCGTCAAA
TCCGTCGACA TCGCTCGCCA GCGCATCGCC GTCGAGCTCC CTTAA
 
Protein sequence
MKRTLEGTVK TSARASRARA ARASGRAVGR TDAVLARARG ATLGDVRREQ IDAALRLLID 
EQCTLPFVAR YRKERTDGLD EGQLRAVAAA HAEWTRLESK RESAFEALTR LEIVDERLFA
RLREASAVET IEDLMSKYKT KTSSRADRAR ELGCEPLAEK ILSDGAGGVE ALARAYVNGN
DVPDVCEALR LAKDVLAERA ANETSAREAA RRSTWQNGRL QCALTTNGKA AVEGKIDDKK
MKKIADALRD YYEFNAPLRR VKPHQTLAIL RGEAAKVLRV KIDFDVAWAT SAAMKAMVGS
RRLGGGCYNV VSEAVEDGVK RLLAPSIERE AKSRLKTEAM ERAIADFGAN LRSLLLQPPL
TPAAVVLGVD PAYRTGCKLA VVDPTGALLD TGVVHLPQFE SKVKREGGGQ SAAASKLQAL
VKQWNVKAIA IGDGVASRET ESLVATALAG FEGVGWRVVS EAGASVYSAS EIAAKELPDI
DVSLRGAVSI ARRLQDPMAE LVKIEPQSIG IGLYQHDVKE KELADALTAT VESAVANVGA
NLNTSSQSLL SRIPGLGPSL AAKIVRHRSE RGRFRERAAL KTIAGVGAKT YEQIIGFLRV
PDAKDALERT SVHPESYHIA KKLLKMLERD VAEVIDLTAS DAGDTSMDAL KRLKPKLKAL
RDDVKDLEAR AARLKCHPLT LKDIANELSA PGDDARGQAA EQNVLRTQAL TLEDLQTGAE
IKNAVVRNVV PFGCFLDLGV GRDALLHVTA MRKRTDANPN AQIDPHALYR VGQTCAVRVK
SVDIARQRIA VELP