Gene OSTLU_32267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32267 
Symbol 
ID5002211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp554827 
End bp556749 
Gene Length1923 bp 
Protein Length640 aa 
Translation table 
GC content63% 
IMG OID640417632 
Productpredicted protein 
Protein accessionXP_001418274 
Protein GI145347647 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.466679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGC CCGCGGACGC GCGCGACGCG GTCTCGAGCG CGGTGCTCGC GGTGCTGCGC 
GCGCCGTGCG ACGCGTACGC GAAATCGTTC GACGATGAAA CCGCGGCGTC GCTGCGCGCG
GGAGCGCGAC GAGCGCTCGA GGCGCAGTGC GCGCGCGCAG GCGGGGACGA GACGCGGAGC
GACGCGTGCG GGACGCTCGC GACGGTGGCG GACGGCGCGC GCGCGATGAC GCGAGGAGAG
GCGATGGGAC ACTTTAAGCA AGCGATGCAC GAGAGCTGTC AATCGGGGGC GGCGGCGATG
TCGGATCCGG AGAAGCGGGA CGCGAGTGAG GCGACGCGGT ACGTGCGCGA GGCGGTGACG
CGCGCGGCGG AGGAGGAAGC GGTGGCGGGA CGCACGGCGA GCGCGGGGAG ATGGGAGGAC
GACGGGGGCG AGGGAACGTC GCCGGCGACG TCGGTGACGG AGGAAGGCGG CGACGGGCGC
ACGGGTGATC GTGGAATGTG GGGATGGGAT CGAGACGCGC CGACTCAAAC GTCGGCGGAG
GACGCGTTCG GGGCGGCGTC GAGCGTGGAT TCGATCAAGC ATCGCGAGCG ACAGGCGTTG
GGTGACATGT ACAGGCGTAC GAACGGTGCG AAATGGCGCA GACGGGACGG TTGGATGTCC
AGCAAGTCTT ACTGCGAGTG GTACGGGGTG ACGTGTCTCG AAAAGGATTT CGGCGTCGCC
TTTGTGGATT TGCGAGACAA CGGCATGGAG GGCGACATGC CGCAAGCGAT CGATGAATTG
AAGCTTTTAC AAGGACTGGA CCTGTCGTAT AATCGGCTCG AAGGACGTTT GAGCGCGATG
CTCGGTGAGC TGAAGACGCT GCGGTACCTC TTGGTACGAT CGAACGCGCT GTATTCCGAC
ATTCCCGCCG GCTTGTTTAG GAAAGGCTCG CCGTTGACGC AGCTGGACCT GAGCGACAAT
TCTTTGTCTG GGGCGATTCC TGGACGCGAG TTTGTGTACT TGACAAGCTT GCGTATGTTC
AATGTGTCAA ACAACGCGCT CACGGGTACG ATTCCGAACA TCGCCTCGCT CCCAGCGTTG
GAGATATTTT CGGCATCCAC CAACGCGCTG CGTGGCGCTG TGCCTCACTT TGACGACGCC
GCCAAGATTC GTTTCTTCGA CGTGAGCAAG AATGCATTAC ATGGTTCGAT TCCATCGCTT
TCATCCGTAC CGTCCTGGGT GTTGTTTGAC GTCTCCCACA ACTCCCTCAC CGGCGAGCTC
CCGCGGACAG CGCTTCCGCG AACGCTTCGC GTGTTCTCGT GCGCCAACAA CAACCTGAAT
GGCACCGTGC CGCAAACGTT CGCTCAGCTG CCCAAGGTGG AGCACTTGGA CTTTTCGGCG
AATCAATTCA CCGGCGCGTT ACCCGCGAGC GTGTTGCAAA AAAAGACGCT GCGATACTTC
AACGTCTCGC GGAACGCGTT CGAAGGCGAA CTTCCGCGCT CGGTGTACCA AGGCGAACTC
GAGCGTCAAT CCATGAGACT GGAGGAATTT GACGTCAGTC ACAACAAACT CACGGGTGCG
CTGCCGCAGT CAATCGTCGA GTTGGACCGC CTTCGTGTCA TCGACGTCGC GCACAACGCG
CTGAGCGGCG ATTTACCTTC GCGTTGGGCC GTCGACCGCC TCGAGCGTCT CGACGTCAAA
GCCAACGCGT TCACGGGCGC CATCCCCACC ATCCTCGCCA GAGCCACGCG CCTGCGCCAC
CTCGATTTGA GTCAAAACGC CCTCAGATCT CGCGCCAACT TAGCCGTGCT CACGATCCCC
ACCCTCGAGC ACTTGGACGT CTCCGGAAAC TCGCTCGATT GGAACGAAGC CGCCGCCGCG
CCGGCGCCGA AAATCGACCG AGCGCGCGCC ATCGAACCCC CTTCGCTTCA CGACGACCTC
TGA
 
Protein sequence
MARPADARDA VSSAVLAVLR APCDAYAKSF DDETAASLRA GARRALEAQC ARAGGDETRS 
DACGTLATVA DGARAMTRGE AMGHFKQAMH ESCQSGAAAM SDPEKRDASE ATRYVREAVT
RAAEEEAVAG RTASAGRWED DGGEGTSPAT SVTEEGGDGR TGDRGMWGWD RDAPTQTSAE
DAFGAASSVD SIKHRERQAL GDMYRRTNGA KWRRRDGWMS SKSYCEWYGV TCLEKDFGVA
FVDLRDNGME GDMPQAIDEL KLLQGLDLSY NRLEGRLSAM LGELKTLRYL LVRSNALYSD
IPAGLFRKGS PLTQLDLSDN SLSGAIPGRE FVYLTSLRMF NVSNNALTGT IPNIASLPAL
EIFSASTNAL RGAVPHFDDA AKIRFFDVSK NALHGSIPSL SSVPSWVLFD VSHNSLTGEL
PRTALPRTLR VFSCANNNLN GTVPQTFAQL PKVEHLDFSA NQFTGALPAS VLQKKTLRYF
NVSRNAFEGE LPRSVYQGEL ERQSMRLEEF DVSHNKLTGA LPQSIVELDR LRVIDVAHNA
LSGDLPSRWA VDRLERLDVK ANAFTGAIPT ILARATRLRH LDLSQNALRS RANLAVLTIP
TLEHLDVSGN SLDWNEAAAA PAPKIDRARA IEPPSLHDDL