Gene OSTLU_87131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_87131 
Symbol 
ID5001572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp843077 
End bp844438 
Gene Length1362 bp 
Protein Length453 aa 
Translation table 
GC content62% 
IMG OID640416993 
Productpredicted protein 
Protein accessionXP_001417368 
Protein GI145345760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00012796 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACGG GGACGAAGGT GGCGCTCGAC GACGCGCGCG CGGCGTGCGT CGCGCTCGCG 
CTCGAGGAGG CGGACGCGGT CGGGAAGATC ACGCTCACGG GACCGGGGCG AGGGACGGCG
GCGGAGGCGC ATCGAAGGGT GGAGTTTCGA TGGGTGACGA TTCGGGGGCG GACGGCGCTG
CAGCGGACGC GGTACGACGA ACGACAGGCG TTCACGAGTA ATCACGCGAT CGAGGGCGAA
GGAGGGCTCG TTTCGAAGAG CGCGAGCGGG GATGAGGACG CGATCGGGGC GAGGGAGGCG
CTGGAGGAGG CGCTGAGGGC GGGATATAAA CATTGGCGCG TCGAACACGC GCGCGGCGGT
TACAACGTGA CGGCGAACGC GAAGAAGGCT CGGGCGACGA TTTCTAGGGA CAACGCGAGT
AAGGGGACGC TGATAGACGG TAGTGCGAGA ACGCAGACGA TCGTCGTCGG CCCACAAGGG
CACGACCGCG AGAAGTCGAG GTTACTGACG GGGGAGGACC CGTTTTTGCG ATACGTCGGC
GTCGTCGCCA AGGATGGAAC CATCAAGGCG AGCAAGCGGG ACAAGTACAA GCAAGTGGAG
GAGTTTCTGA AGATTTTGAA CGTTGCTTAC GACACCGCGA CGTCGGCGGG ACACATGAAG
GGCGGCGATG AGACTCGCCC GCTTCGTGTG TGCGATTTAG GATGCGGGAA CGCGTATCTC
ACGTTTGGGG CGTACTCGCT CTTGAGCTCG AAGAGACGCG TGCCTACAAA CGTAGTGGGC
GTCGACGTGA AGCGCCAAGC GCGCGAACAT AACTCGCGGG TGGCCAAAGA GCTAGGTTGG
GATGCGTCGA TGAGATTCAT TGAAGGCACG ATCGCCGACG CCGACGTGAC TTTCGTAGAC
GGTTCAGAGG AGGACGCGTT CACCGACGTA GTGCTCGCTT TGCACGCGTG CGACACGGCG
ACGGATGAGT CCATCGTTCG AACGGTGCGT TGGTGCGCTC CACTGGCGCT CATCGCGCCG
TGCTGTCATC ACGATCTCCA AGTGCGTTTG AAAAGCGCAC CTCATGTTGC TTTCCCGCCG
ATGGCGAGGC ACGGCATCCT CAGCGAACGT CTCGGTGACG TTCTCACGGA CGCATTCAGA
GCTCACATTT TGCGACTGCT GGGATATCGC GTCGACGTCA TGGAATTCGT AGGAGGGGAG
CACACGCCTC GAAATACTCT CATTCGAGCG ATCCGCACGA ACGCGTCGGC GTCGAAGGCG
GCGTGGGAAG AGTATGATCA CATGTGTTCA ACGTGGGGCG TCACGCCTTT TCTCGCGGAC
GCCCTCGCCG AGGAGTTGGC GGTGGCGCGT CGCGCGATCT GA
 
Protein sequence
MPTGTKVALD DARAACVALA LEEADAVGKI TLTGPGRGTA AEAHRRVEFR WVTIRGRTAL 
QRTRYDERQA FTSNHAIEGE GGLVSKSASG DEDAIGAREA LEEALRAGYK HWRVEHARGG
YNVTANAKKA RATISRDNAS KGTLIDGSAR TQTIVVGPQG HDREKSRLLT GEDPFLRYVG
VVAKDGTIKA SKRDKYKQVE EFLKILNVAY DTATSAGHMK GGDETRPLRV CDLGCGNAYL
TFGAYSLLSS KRRVPTNVVG VDVKRQAREH NSRVAKELGW DASMRFIEGT IADADVTFVD
GSEEDAFTDV VLALHACDTA TDESIVRTVR WCAPLALIAP CCHHDLQVRL KSAPHVAFPP
MARHGILSER LGDVLTDAFR AHILRLLGYR VDVMEFVGGE HTPRNTLIRA IRTNASASKA
AWEEYDHMCS TWGVTPFLAD ALAEELAVAR RAI