Gene OSTLU_18495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18495 
Symbol 
ID5006026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp147825 
End bp149582 
Gene Length1758 bp 
Protein Length585 aa 
Translation table 
GC content55% 
IMG OID640421447 
Productpredicted protein 
Protein accessionXP_001421986 
Protein GI145355474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.365263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG CCGACGTGGA CGACGTCGTC TTCCACCGCT GTGCGTGCGC GTCCACGGTG 
GCTGGGGCGA AGACGCGCGG TGAGGCGTGC GTGCGCACGC GCACGGTGAC GTGGACGCCC
GCGGATACCG GGAATCTCAC CGGTGCGAAG ACCGTGACTT TGCTAAGCGT TACCGCGCAC
CAAAGGAATA AACCGGGTAG CGCCACGGCG AGCCTGAGGT TGGTGCCGGA CAACGACGCC
AAGCGGGCGC TTGTGCTGAC GTTCGATGCG GAGAGCGACA GAGATGGGTT AAGCGATGCG
ATAAAGGCGC AGATAAGAAA ATTAGAGGAA GAGATGAAAG GCGTGCCGAG CGCGGCGGAG
CTGGAGGCGC GAAGCCAGTT GCTGAAGACG AACGTAGAGA TTGCGCAGCT GTACGAGGAG
ACGGTGAAAG CCGGAGTGAT CAGCAATGAA GACTTCTGGG AAGCACGTAG GAATTTATTC
AGCGACGTAG TGGCGAGAAA AGCGACTGGA CAGAAACATG GAATCGAAAA TACGCTAGAT
GGCGATCTCA AAGGCGCACG CGATGGGATG TCAGACACGG TGACGTGTAA CTTGACGAAC
GAAAAGATGC ACAGAATCTT CGCCGAAAGA CCTTCGGTGC GCGAAGCATT TTTGAATAAT
GTGCCAAAAA AGATGACAGA ACGCGAGTTT TGGACGAGAT TCTTACGATC GGAGTATTTC
AAACAGATGC GCGCGGGAGC ACCACCGCAA GGAGAAGAGG AAGCTGCAGA TTTGGCTCTC
TTTGCGCGAA AACCACCAGA TGCAACCACG GTTAAGGCAC AAATCAAAGC GATCGAGCCC
GGTGTGAACT TAAAAGCGGA CCTCGATGAC GGGCTCGGAG AGGGATATGG TTTGCTACGA
GACGGCTCTC GAGATGATCG CCGACCGAAA GAAGCTGGTC CGTTACCCGA GGTATTCTTG
GAGCTCAACC ATCACGCCGC CGTTGTGCTT CGAGGTCAGC CGCAAGTGAA CATCGTCGAT
GCTCGCACCG CGGCGATCGC CGCAAGAGAT CATGAAAAGA GCGCAGAGAG CGCGAGAATA
GGGTTCGAAG AGCCCGATTA CTCGATTGAC GACCTCGCCG CACCGAAGCC CGCGGCGCCG
CCAAAGGAGC TCAAAATTCG TGATTCGCAT CAATACTTTT CATCTATCGC GGAAGAACCC
GTGGTGAGTA AGCCTTCGAA GACGCATGTC GTCATCGAAC CGTTCAAAGA GGCCGTGACG
TCGGCTCTGA GTACGCTCCA GCATCCCGAA AAGAAGCAAA AACGCGCATT GGATCCCGAC
ATCGCACTGC AAGTGTTGAA AGAGGTCACG CAATCACTCG CTAAAGTTGA AGATTTGAGC
GCTCAATTTG CGTCACAGCA AAGTTTTAGT AACGCTGATG ATGTGGCTGC GTTTCCTGAG
AATGTGAACG AGCAACTGAG ACGGAGCGCG GCGACTGGTG GGGAGCTGCT ACGTCAGTTC
TGGATGTCCA CGCCGATGAT AACGTCTGTG CGATGGGAGA AAGCGACGAA GGTCTGCAAA
TCCGTCGAGA TCTTGTACGA TAAACTCGAA GGAGTGAAAA GTTCGCTCCC TTCCGCCCAC
CGACATATCG CTTCTCAGCG CCTTAGGCCG CTGCTCGGTG CGTTTGATTC TGCGTTGACG
TTTTATGATG ACGAGAAAAC TCGTCGTCCT CAGGCGTACG CTGACTTTGA GAAGACAGCG
ACGCAAGAAA ATTCATAG
 
Protein sequence
MIDADVDDVV FHRCACASTV AGAKTRGEAC VRTRTVTWTP ADTGNLTGAK TVTLLSVTAH 
QRNKPGSATA SLRLVPDNDA KRALVLTFDA ESDRDGLSDA IKAQIRKLEE EMKGVPSAAE
LEARSQLLKT NVEIAQLYEE TVKAGVISNE DFWEARRNLF SDVVARKATG QKHGIENTLD
GDLKGARDGM SDTVTCNLTN EKMHRIFAER PSVREAFLNN VPKKMTEREF WTRFLRSEYF
KQMRAGAPPQ GEEEAADLAL FARKPPDATT VKAQIKAIEP GVNLKADLDD GLGEGYGLLR
DGSRDDRRPK EAGPLPEVFL ELNHHAAVVL RGQPQVNIVD ARTAAIAARD HEKSAESARI
GFEEPDYSID DLAAPKPAAP PKELKIRDSH QYFSSIAEEP VVSKPSKTHV VIEPFKEAVT
SALSTLQHPE KKQKRALDPD IALQVLKEVT QSLAKVEDLS AQFASQQSFS NADDVAAFPE
NVNEQLRRSA ATGGELLRQF WMSTPMITSV RWEKATKVCK SVEILYDKLE GVKSSLPSAH
RHIASQRLRP LLGAFDSALT FYDDEKTRRP QAYADFEKTA TQENS