Gene OSTLU_17704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17704 
Symbol 
ID5004857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp501619 
End bp504681 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table 
GC content63% 
IMG OID640420278 
Productpredicted protein 
Protein accessionXP_001420868 
Protein GI145353102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.9022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG ACGCGCGCGC GACCGGCGTC GACGTCGACG CGCGACGCGC GAACGTCGAC 
GACGCGACGC GACGCTCGAA GGATGCGCGC GAGACTTTGC GCGAGGGACG CGCGCGCGAC
GACGACGACG CGCCGCGCGA CGACGACGAT GGCTCGGACG ACGGTCGCGT CGTCGCGCAC
GAGGATTACT CGTGCGCGAC GCCGTTCGAG ACGCTGGCGC GGGACGCGGA GACGACGCTG
CGGGAGTGGT TGGACGAGGA CGCGGCGACG GTTTCGACGC GCGCGAGCGC GGCGGCGGCG
TTTCGACGCG AGTTTCGCGA ACACGGCTTG CACTGGCGGA AAGAGGCGTA CGGGTGCGAG
ATGTACGTTG ATGTCGAACG GCTCAGACGC GGACGCGCGT GGGAGACGCT GCGAGGGAAG
AGTTTGCTCG AGCTCGCGCG CGACGCCGGA CGCGACGGAG CGCGCGCTTC GTCCGAGTAC
GACGCGAGGA AACGAACGGT GGATGGGTTC GATTACGTCA CGTTTTTCGA TTGTGCGTGC
GTGATTTTAG TCCGTCCGAT GAGCGCGAGC GGGGCGGGGC AGTTCGTGGA CGAGGACGAG
GCCCTGACGG TGCGATCCGC GTTCGCGTTG GCGATGTCGG CGCTTCGCGT CGGCGGCGCC
GTGGCGGTGG CGACCCCGCG AGGTTTTTAC GAGCGCGATT TTTCCGCGGA CGTGTGCGCG
GGTGATGATA ACGACGATGC GGTTTGTGAA ACCGACGTCA CAAGGTTCAC GCGTTTTAGT
TCGCGCGAGG AGGTAACGGC GCGAACTTTC GCGGGGTGCG TGAACGCGTT TTACAGAAGC
GTTCGCGAGA CGCGTCGCTC GAACCCATCT ACGGCAGATA TCGACGCGGA AACGAAAGCC
TTGGAAAGTT CGATTGTTTC AGCGCGCATC ACGCGCGAGT TCACGCGCGC AAAGGCTACG
ACATCGCCGA TGACGTCTGA AATTGAAGAC GAGAGCGACG ACGACGATTT CGGTTTGCGC
GTCGACGAGT GGGACGACGA TTGTCCTTGG GCTGCATGGG TACACGCGGA GGATCCGTGG
CGGACGTTAG AGGTGGACGC GCTCTGGCTC GATGTGCGCT TAGATTGCGT CACATTCTCC
GATCTCGACG TCGAAGACGC GCCGATTTGG CGTTTGCGGG GCGACCTCAC GCAAGCGGCG
CGAGATCGAG GCGAGGAGGA CGACGCCGTC ATTTCGAGCG ACACGTCATC GATGAGCGAG
GCGATATTCT CGCTCATCGA GAGCGCGGAG ATCATTCGTG GAGAAGATGT CGAAGATGTG
ACGGGGGCAT TGGACACGAT GACTTTGGCG AGCGCGGAGT TTTGGGTCGA GCGCGGATAC
CAGACGCCTC ACATACCGAA CGACGAAACG GCTCGAGGGG TGACGCGTGA TATCTTTACG
TCCTTCGCGA CGACGGCGTT GGTGACGCAA GAGTCGTTTC GTGAACCGTA CAAAACGGCG
CCTGCCAATT CTATCCTTGC CCGTTTCGCC CTTCACGCGT GCTGCTCGTT GAAAAATCCG
CGCGCGGTGG CGCATTCGTG GAACGCTTTC GCTCGCGAGT TGCGACGCGA ATACTGGGAA
CGGGGACGAC TCGTTCCAGG CGTCGTGCGC GATGAAATTC TAGGCATCGA TCACGGCGCA
TGCATAGTTT ACCAAAAATT GCAGATGCTT AATCAATGCA TCGCGCGAAG GAACGCGAAG
AGCGACGCCG AGTTGCACGC GTCGACGCGA GCGAGCGACG GCGCGCAGCC GAGATCCGCC
GACTCGCCAC GTGGTGACGA GTTCGATCGT TTGCTCAAAA CCACGAGCTC GACGTCGACG
AGAATGCGTC ATCCCGTGGA CGATGAATTA GATTTAAACG CGCTGTTGAG TGGCGACGTT
GAGCTTTCGA GCGCAGCGCT CGAGAGAGCG ATGAGCGGGA GCGAGAGCGC GACGACGGCG
AAGTCGGACG AGTACGCGAG CGCCGAGGAA GATTTGTCGC CCGAAGACGG CGACGACAGC
CGCGCGGAAG GGGTCTCAGA GACACTGAAA ATTCGTTTGT TGAACGCACC GCATGCGTTC
ATTCGCGCGC CGATCACGCA AGAAGCACCG TGCATGACGG AGGACGCGTT GGCGGAGCGC
GAAGCGGCGC TTCGAGCGTT CGGAGACGAC GACGAGGGTC GCGCCGCGCG ACAACGCATT
CAAAGCGATT CGCTCGTGTC CGATATGTCC GCGTTCAAAG CCGCGAATCC ACGCGCGGTG
TTTGAAGATT TCGTGCGTTG GCACTCTCCC AAAGATTGGA TCGTCTCCGG CGCCGGAGAA
GAACAAACAG ACGAATCGAG CGGACGCTTG AGCGACAGAA TGCGCCGCGA CGGAAACACG
TGGCTCGAGC TATGGACGCG CGCGCCGCGC GTGCCGGCGC ATAAACAGCA CCCTCTGTTT
GATCCGATCG TTGAGGGCGA GAAAGCGATG CATTATCTCG AGACGATCCC AGCCACCGCG
CTCTTCGACG TCGCCGCGCG ATGCGCGTGC GCGGCGACGG CGGCTATTTT GTCGTCGTGG
TGGCTCACGT CGTCGGAGGA AGCATCAGCG CGAGCACCGG CGGAGACGCA CGAATCGATC
AAAACAGCAA TCGACACGTG CTCGCACTTT TTCGCGCGCA CCGAGCCGTT GACGCTGGAC
GAGTACGATT TCGTCTTCGC CTCGCTCCAA GTCGCCGAGC GCGCGACGTT TCGCGCGGCG
TCCGCGCGCG CCAGGCTTCC CGACGCCCCC CCGGATCTCA TTTCCCGCCT CCTCACCGCC
GCCGACGCGT GCGAAACCAT CCGCGCTCGC GATATCCCGG CCTCATCCCT CCACGTCGTG
TACACCGACT GCCAATCCCC CGCCGAGCGC GCGTACTTGG CGCTTCGCCT CGAGCGTCGT
CGTCGCGTCG CCGAGTTCGC CGTCCGCGCC TCGCCACCCC TCCCCGACCA CGCCCATCGC
GTCATCGCGT ACGACGACCA CACCCGCATC TGCGTTCGCA CCGCCGCGCG TCCGCGCTCG
TAG
 
Protein sequence
MASDARATGV DVDARRANVD DATRRSKDAR ETLREGRARD DDDAPRDDDD GSDDGRVVAH 
EDYSCATPFE TLARDAETTL REWLDEDAAT VSTRASAAAA FRREFREHGL HWRKEAYGCE
MYVDVERLRR GRAWETLRGK SLLELARDAG RDGARASSEY DARKRTVDGF DYVTFFDCAC
VILVRPMSAS GAGQFVDEDE ALTVRSAFAL AMSALRVGGA VAVATPRGFY ERDFSADVCA
GDDNDDAVCE TDVTRFTRFS SREEVTARTF AGCVNAFYRS VRETRRSNPS TADIDAETKA
LESSIVSARI TREFTRAKAT TSPMTSEIED ESDDDDFGLR VDEWDDDCPW AAWVHAEDPW
RTLEVDALWL DVRLDCVTFS DLDVEDAPIW RLRGDLTQAA RDRGEEDDAV ISSDTSSMSE
AIFSLIESAE IIRGEDVEDV TGALDTMTLA SAEFWVERGY QTPHIPNDET ARGVTRDIFT
SFATTALVTQ ESFREPYKTA PANSILARFA LHACCSLKNP RAVAHSWNAF ARELRREYWE
RGRLVPGVVR DEILGIDHGA CIVYQKLQML NQCIARRNAK SDAELHASTR ASDGAQPRSA
DSPRGDEFDR LLKTTSSTST RMRHPVDDEL DLNALLSGDV ELSSAALERA MSGSESATTA
KSDEYASAEE DLSPEDGDDS RAEGVSETLK IRLLNAPHAF IRAPITQEAP CMTEDALAER
EAALRAFGDD DEGRAARQRI QSDSLVSDMS AFKAANPRAV FEDFVRWHSP KDWIVSGAGE
EQTDESSGRL SDRMRRDGNT WLELWTRAPR VPAHKQHPLF DPIVEGEKAM HYLETIPATA
LFDVAARCAC AATAAILSSW WLTSSEEASA RAPAETHESI KTAIDTCSHF FARTEPLTLD
EYDFVFASLQ VAERATFRAA SARARLPDAP PDLISRLLTA ADACETIRAR DIPASSLHVV
YTDCQSPAER AYLALRLERR RRVAEFAVRA SPPLPDHAHR VIAYDDHTRI CVRTAARPRS