Gene OSTLU_15517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15517 
Symbol 
ID5001770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp601668 
End bp603053 
Gene Length1386 bp 
Protein Length461 aa 
Translation table 
GC content56% 
IMG OID640417191 
Productpredicted protein 
Protein accessionXP_001417818 
Protein GI145346692 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.569718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA CGGGCGAGAC GGACGACGGC GCGACGGCGC GACGACGAGG GGCGCGGACG 
ATGCGAGTGG GTGATGTGGT GTTGATTGAG ATCAACGAAG GCGAGCGAGC GACGTTCGCG
ACGCTCAAGG AGGGGAAGAG CGTGGACGTA GGGAAGAGAC AAAAAGTACC CGTCGAGGCG
CTGTTGGGGG CGCCGTTCGG GTCGCGGTAC GAGATCGTGC ATAACACCGG CGCGTGCGTG
CGATTGGGCG CGGAGGGGGT GGATGATGAT GTCGCGCATG GTTCGGCGCC GGTGGAGGAC
GAGCGGTCGA ATAAGCACGT GTCGAATAAA CACGAAGGCG CGCAACGGCT CACGGACGCC
GACATCGCGG CGTTGAGGAG AGAGTTTACC GGCGAGGAGA TGGTGGAAAT CATAGCGGCC
AATAGCAAGA CTTTCGACGA AAAGACGGCG TTCGCGCAAG AAAAGTATCG AGCGAGAAAG
ATGAAGAAGC ACATGACGAG AATTTTGGTG AGATTTCCGT CGCCGAGGCA AGTTTGCGAG
CAGTACTTTT ACAGCAATCC GTACAAGACA TCGCACATGC GCTTCGACGC GCTCTCGATG
TTGCTCAACG CGGGGAACGT GGGCGCACAC GCGCAAACTT TAGTGTTAGA CACGTGTGGT
GGCATCGTCC TCGGAGCCGT CACCCACCGC ATGGGAGGTA TGGGGCGCAT TTGCAACGGA
TTTATAGGCC AAAATCCAAC CGCGATGGAT GTGTTGCTGC AGATGAATTT GGAAGACGCA
CACTTCGATT GCATGCGACA CGCGGCGCTT TCGAAGCTTA TCGAGAGGCG AGAGGGAAGA
ATGGATGAAG ATTCGAAGAT GGATGAAGCG AATGGGTCAA AGGATGGCGA AAAACTGTCG
ACGACGGAGA AACAGGAGCG AAAGCACATG AAAGTCAAGT ACGCCGCCGA AGACGACTTT
GCCAACTTTG CCCGCCAGGG CTTCAGCTCA CTCATCGTCT CCTCCCTTTC AATCGAGCCA
AAATCTACGC TTGAACAATT GCTACCGCTG TGCGCGTCCT CAGCGTCGTT CGCTATATGG
TTCAACGCCG CCCAACCTTT AGCCGAGGCG TTGCATCATC TGAGAAACAC GAACACTGCG
ATTAATCTTT CCTTGGTCGA ACCATTCATG CGAGCCCAGC AAGTACTGCC TGGGCGGACG
CATCCCGTGA TGACTACTGA TGCTGGCTCA GGCGGGTGGA TTTTGAGCGG CACCTACGTC
GGTGTTGGCA ATTCAGTACA CGAAAACTCC AAGCCAGAGG CAGAGATGGC GGAGACAGCG
AAGACGGCGG AGACGGCGAC GCCAAAAGAC GAATCGAACA AAAAGGCAAA ATTGGAGGAT
GAGTGA
 
Protein sequence
MATTGETDDG ATARRRGART MRVGDVVLIE INEGERATFA TLKEGKSVDV GKRQKVPVEA 
LLGAPFGSRY EIVHNTGACV RLGAEGVDDD VAHGSAPVED ERSNKHVSNK HEGAQRLTDA
DIAALRREFT GEEMVEIIAA NSKTFDEKTA FAQEKYRARK MKKHMTRILV RFPSPRQVCE
QYFYSNPYKT SHMRFDALSM LLNAGNVGAH AQTLVLDTCG GIVLGAVTHR MGGMGRICNG
FIGQNPTAMD VLLQMNLEDA HFDCMRHAAL SKLIERREGR MDEDSKMDEA NGSKDGEKLS
TTEKQERKHM KVKYAAEDDF ANFARQGFSS LIVSSLSIEP KSTLEQLLPL CASSASFAIW
FNAAQPLAEA LHHLRNTNTA INLSLVEPFM RAQQVLPGRT HPVMTTDAGS GGWILSGTYV
GVGNSVHENS KPEAEMAETA KTAETATPKD ESNKKAKLED E