Gene OSTLU_17956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17956 
Symbol 
ID5005448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp2749 
End bp4392 
Gene Length1644 bp 
Protein Length547 aa 
Translation table 
GC content53% 
IMG OID640420869 
Productpredicted protein 
Protein accessionXP_001421176 
Protein GI145353772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones83 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACAAC AAGAGATCCT TGCATTGGAA CGTGCGACCG CGAAGAAACT GCGTAAACAC 
AGGCGGAGGA CGAAAGTCAC AAGCAAAAGC AAAGCACGAA TCTTATCCTC CGAAACGTCA
AGAATGGAGC TTCCTGAATC CACGGTTTTG ACGTTGACGT CGGCCAAAGA GTCAGCGAAG
AGGATCGTAT CAATCAGGGA CGATATCGAG GTGGTGATTC ACGACGTCCA GACCAGCGGC
GGCGTGTACA AATCATCGCA TAAAAAAACG TTCGTGCAAA TAGCGTGGAA GGAAAAACGA
AATGCAACTC GCAAGGTCAA GACACGGAGA ATAGAGGGTA CAGAGAATCC ATCTTGGGAC
GCGCAAGGGA AGTTCATCTT TCAGAATTGT CGGGCGAATG CGGCTGGCAG CAAATTCGTC
ATCAAACTAA AGGAAGTGCG CTGGTTTAAA AGAGAAACAG TCGTTGGTTA CGCAAAGATT
CCGAGTACTT TGGTTCCATT GGATGGGACG AAGTTGAAGC TTCGCATCGC GCTCGTGACG
AAAAGGCACA GAGCCAAGGC TGTGCTGAGC ATGACAATCG GAAAAGCGAC GTTTGAGGAG
CCAGCGCTGG GGGAGGGTGG ACACTTTCCC GTCGTCACCG GCGCCGAACA AGAACTCGTC
TCATTCGAAG GAAACTTTCC CTCCAGGCGC GAAAAATTAG ACTTGATGGT TGTTAACGTG
ATGAGCGCTC GCGGCGTATT CGACGCCGAT GGGTTCGGTA CGAGCGACGT GTTCATCAGA
CTCGGTTTCG ACACTACGCC CATCGAAGAA CGCTACCAAA CTACGATTAA GTACCGCACT
CGCAATCCTG AGTGGAACGA GCGTTTCTTG TTGCGTGTAC CAAGCGTCGA TGCAGCTAGG
GGCGAACCCA AAGCAATCGT GTTCACGGTG TGGGACAAGG ACAGATTCTC GCCGAGCGAC
TTCCTTGGGG CCGCCGCTAT TCCACTCGAC CGTGTTTCGA CTACTGGCAG CGTCGCGGAT
TTGGACATGG ATTTGAAGGC TAGAATTGCG CCGGACCTCG CGGGTGAGGT GTGCTTCATC
CATCCACGCG CACCCGCAGA TTTGGGTAAA CTTCGAGTCA AGGTTTCGGC GCTCATCAGT
GACGCCGCCG AAACAATCGC AAAGACAGTC AATTTAGGAA GGATAGATCC CACTGAAGGC
ACAAGTCTGG CTACGAGCGT ACACGTCGCC GTCATCGCGG CGAGAAAGTT GTTACATGTT
GACACCAAAG GGTCCTGCGA TGCTTTCGCG TACGTGCGAA TGGACAATGC GCCCAAGAAT
GAATTCTGCA GGACAGATAC AATCGCAAAC ACGCTCCATC CCGTGTGGAA CAACGGCATG
GGAAAGACCT GTTCTCTCAT CGCTCGTCCG GGCTCTGGAG ATGTCTTGTT TCAATTGTAC
GATCGCAACC TGCTTAGCAA AACGCTCATG GGAACTGCGT CGGTGTCGTT GGCGTCTCTG
CCTCCAGATG GTTCATGGAC ACAAATCGCA ACTCCAGTTT ATGGCCAGGA CAAGAATCGA
AACACTCTCG TCGGCGGTTC GGACAGTTCC ATGGCATGGA ACGCTCCAGA GAGAGTGAAG
GGTGAAACTC ATCGTTCGCC TTAG
 
Protein sequence
MRQQEILALE RATAKKLRKH RRRTKVTSKS KARILSSETS RMELPESTVL TLTSAKESAK 
RIVSIRDDIE VVIHDVQTSG GVYKSSHKKT FVQIAWKEKR NATRKVKTRR IEGTENPSWD
AQGKFIFQNC RANAAGSKFV IKLKEVRWFK RETVVGYAKI PSTLVPLDGT KLKLRIALVT
KRHRAKAVLS MTIGKATFEE PALGEGGHFP VVTGAEQELV SFEGNFPSRR EKLDLMVVNV
MSARGVFDAD GFGTSDVFIR LGFDTTPIEE RYQTTIKYRT RNPEWNERFL LRVPSVDAAR
GEPKAIVFTV WDKDRFSPSD FLGAAAIPLD RVSTTGSVAD LDMDLKARIA PDLAGEVCFI
HPRAPADLGK LRVKVSALIS DAAETIAKTV NLGRIDPTEG TSLATSVHVA VIAARKLLHV
DTKGSCDAFA YVRMDNAPKN EFCRTDTIAN TLHPVWNNGM GKTCSLIARP GSGDVLFQLY
DRNLLSKTLM GTASVSLASL PPDGSWTQIA TPVYGQDKNR NTLVGGSDSS MAWNAPERVK
GETHRSP