Gene OSTLU_43047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43047 
Symbol 
ID5005451 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp455738 
End bp457363 
Gene Length1626 bp 
Protein Length399 aa 
Translation table 
GC content49% 
IMG OID640420872 
Productpredicted protein 
Protein accessionXP_001421294 
Protein GI145354020 
COG category 
COG ID 
TIGRFAM ID[TIGR02167] bacterial surface protein 26-residue repeat 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0199168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.21868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATAG CGCAGTGGGA TACGAGCTTG ATCACGGATA TGAATCACTT GTTCTACAAC 
AAAATTTCAT TCGACGCCAA TATCAGCACG TGGAACACGG CGAAAGTGAC AAAAATGGAT
AGCATGTTTT CTCAAGCACG TGCATTTAAC CAGAGGATTA ATGAATGGAA TACATCGAGC
GTGGTGACAA TGGAAGCTAT GTTCCGCTAC GCCGAGTCCT TTGACCAGCC CATTGGTGAG
TGGAATACCG GAAGAGTCAC CAATATGAAA GACATGTTTT CGCAGGCGTC TGAATTCAAT
CAGAATATCG GTAATTGGGA TACATCGAGT GTGACGACGA TGGATACTAT GTTTCGCTAC
ACCGAGTCCT TTAACCAGCC CATTAGTGAG TGGAATACCG GAAGAGTCAC CAATATGAAA
GACATGTTTT CGCATGCGTA CGCATTCAAT CATTCAATCG GTGACTGGAA CACGGGTGCC
GTGGAGAATA TGGAGCTTAT GTTCCAAAAT ACCGGCGTTT TCAACCAGGA CATCAGCAGT
TGGAACACGG CGGCGGTGAC TCAGATGGAC TCAATGTTTT ACGGTGCTTC AGCCTTCAAT
TATGACATCA CTGGTTGGAG TTCAGCAAGC CTTACGTCGT CGACGGACAT GTTTCTCTCC
GCGACGGCTT GGCTGGCGAG GTACAAGAAC ACCGCCTCCC CAGGTAGCAA AAATGGGCCA
TCGTCGAACT GGCTCGGTCC CGGTCCGTTC ACGACCAAGT CGGCGTTGGA GACCGCTACA
TGGAACTGCT TGGCGGGGAG CGCCGATGCC GATGGCAACT GCGACTGTAC GACCGTCGAT
TGTGGCGCGG CGGTGTATGA ATCGATTTCG AACTGGAACA CGAGTTTGAT CACGGACATG
ACCGGTTTGT TTGAAGGTAA GTCTTCATTC AACGCGAATA TCGGCGGATG GAACACGGGG
AGCGTCACAA ACATGCACCA GATGTTTAAC GGCGCGACTG CTTTCAACCA AGACATCGGT
TCTTGGGATG TCTCGGATGT AGCGGACATG TATCAAATGT TCGGTAGCGC GTCCTCGTTC
AATCAAAACA TTGGGTCGTG GAACACAGGC AGTGTGACGG ATATGAGCGG TATGTTCATG
AATGCATCTG CGTTTAACCA GCCCGTTGGT GACTGGAACA CAGGCAGCGT CGTGGATATG
AGCGGCATGT TTGAGAGCGT GTCTGCGTTT AACCAGCCCA TTGGTGAGTG GAATACCGGA
AGAGTCACCA ATATGAAAGA AATGTTTTCG CAGGCGTCTG AATTCAATCA GAATATCGGT
AATTGGGATA CATCGAGTGT GACGACGATG GATACTATGT TTCGCTACAC CGAGTCCTTT
AACCAGCCCA TTAGTGAGTG GAATACCGGA AGAGTCACCA ATATGAAAGA CATGTTTTCG
CATGCGTACG CATTCAATCA TTCAATCGGT GACTGGAACA CGGGTGCCGT GGAGAATATG
GAGCTTATGT TCCAAAATAC CGGCGTTTTC AACCAGGACA TCAGCAGTTG GAACACGGCG
GCGGTGACTC AGATGGACTC AATGTTTTAC GGTGCTTCAG CCTTCAATTA TGACATCACT
GGTTGG
 
Protein sequence
MYIAQWDTSL ITDMNHLFYN KISFDANIST WNTAKVTKMD SMFSQARAFN QRINEWNTSS 
VVTMEAMFRY AESFDQPIGE WNTGRVTNMK DMFSQASEFN QNIGNWDTSS SALETATWNC
LAGSADADGN CDCTTVDCGA AVYESISNWN TSLITDMTGL FEGKSSFNAN IGGWNTGSVT
NMHQMFNGAT AFNQDIGSWD VSDVADMYQM FGSASSFNQN IGSWNTGSVT DMSGMFMNAS
AFNQPVGDWN TGSVVDMSGM FESVSAFNQP IGEWNTGRVT NMKEMFSQAS EFNQNIGNWD
TSSVTTMDTM FRYTESFNQP ISEWNTGRVT NMKDMFSHAY AFNHSIGDWN TGAVENMELM
FQNTGVFNQD ISSWNTAAVT QMDSMFYGAS AFNYDITGW