Gene OSTLU_16659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16659 
Symbol 
ID5003637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp77019 
End bp79247 
Gene Length2229 bp 
Protein Length742 aa 
Translation table 
GC content65% 
IMG OID640419058 
Productpredicted protein 
Protein accessionXP_001419484 
Protein GI145350159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAG GGGCGCACGC GCGGACGCGC TCGATGCGCG CGCTCGCGGT GACGCTGACG 
CTGACGCTGT CGCGATGGCG CGTCGGCGCG GCGATGACGA CCGACGGTTT GACGTCGTCG
GATCACCGAT CGCGACGCGC GCGCGGCGAC GAGGACGCGC GACGCGCGCG CGGTCGTCCG
GCGACGTGGA CGGCGTGCGC GGGTCCGTCG CGCGCGGCGC TCGAGCTCGC GTGGCGCGCG
GAGACGCACG CGTCGGTGTA CGCGACGCCG CTCGCGGAGG ACGTCGACCG GGACGGACGA
GGCGCGGCGG TGACGCAGAC GACGAGCGCG CGCGTTCGAG CGCACGACGG CGCGACGGGA
ATCGATTTAG ACGGCGAGAG CTGGGGCGCG CGACTCGGCG CGGCGTCTCG GGGAGGCGTG
GTGAGGATCG GGGAGGCGTA CGCGAGCGCG TCGCTGGCGG GCGAGGCGAG GACGTTCGAC
GGCGAGCGAA CGCGCGCGGT CGCGCGATTA GCGCCGCTGG CGATGGCGGT GGATTGGTTC
GGTGACGATG GATCGGGGAA GAGCGACGCG GGCGATCCGC GCGAGGTTGA GCGGAGACAC
GAGGCGAGAC GACGCGCGCC GGGATCGAGG CGGTTGTTGG ACGTCGAAGA CGAAGACGAA
GAGGAAGAGC CGACGACGAT GAGCGCGGAG GAAGAGGCGA GGAGCGATCG CGGTTGGGAC
GCGATCGATA GTGGCGGTGA TTCGACGCGC GATGTCGCGA GTGGGTCCGT CGAGGCGCGG
CGAGCGGTGG ACGGACGAGT CTTCGTGGAC GCGCACGCGT TGTGCACGCC CGCCGTCGGC
GACGTGGACG GTGACGGCGA ACCCGAGCTC GTGCTCGCGG TGTCGTATTA TTTCGATTCT
TCCGTGCGCT TTGAAGACGA CGTCGATCCG AAGCAGTACG CCGCGACGGC GTTGGTCGTG
TTGAACGGCG AGGACTTGTC CACCAAGCGC TCGATCGCGC TCGATCAGAG CGCGGCGACG
GCGCTGTTCA AAGCCAGAGC GTACGCCCCG CCGACGCTCG TCGATGTCGA CGGCGATGGA
CGATTGGACA TCGTGATCGG CACGTACGCT GGCGTTTTGC ACGTCGTGGA CGGCGTTTCG
GGGAATCCCT TGCCGGGATG GCCTCGACGG CTCGGACAGA TGGAAGCGCA AGTCACCGCC
GCGGACGTCG ACTCGGATGG TGACATCGAA CTCATCGCGT GCGACGTTCG CGGGACGGTG
GCGGTGTTTA AATCGAACGG CGTCGAGCTT TGGAACAAGC ACGTCGAGTC TCGGATTGCC
GTCGCGGCGA GCGTAGGAGA CGTCGACGGC GACGACGAAA TCGAAATCGT CGTCGGCGAC
ACCTCTGGCG CCGTGCACGC GTTTCGCGCC AAAGACGGCA CCGCACGCGA GCACTGGCCG
GTGTACGTGG GCGATAAAAT TCTCGCCCCG ATCGTGCTCA CGAAACTTCG TCAAACAAAA
CGCGGGCTCG ATGTCATCGT CGCCACGCAC GACGGCGTGG TGCACGTGCT CGAGGGCCGA
AGTCGATGCC GCGACGTCTT TGATATCGCC GAAAAGATTT ACGCCGCGCC GTTTGTCACC
TCTTTTGCCG GCTTCGGCGA ACTCGACGTG ATAATTTCCA CGATGGAGGG TCACGTGCGC
TCGTTCAAGG CGAAAGGGTC AAAGTTTAAT GCGTTGGCGT TGACGTCGAG CGATCACGTG
TCTCGCTACG ATTATTTCGG TGTGGCGCTG TACGATCGCA CTTATCGCGT CATTCGTGGA
ACGTACGTTG ATGTGGTTTA TGAAATCATC GATAGACGCG TGCTCGATCA CGCCAAGGCG
AGGAAATCGT CGCTCGCGCC GTACAAGGTT GCGATTACGG TGACGAGTTT GGATGGTTTC
AAGAAGATCG TCTCCGCTCG ACACGATCGC GCCGGTCGGT ACACGCTTCG GGTGGGCGTT
CCATCGACGA AAACGCGCGG GGAGATCCGA GTGCGCGTTC AAGATGTCAA CTTGATCGCC
GACGAAGACG CGTATTCGGT TTCGTTTCAC GAAAATTACG AGATCGCCTT GAAATGGCTC
GTCGCCCTAC CGTTTCTTCT CGCGAGCGCG GCGGCGATTC GGCGCGCGAG CGAAGACGCG
CTCGAGATGG ATGTCTTTGG CGCGTCAGCC TCGATTCCAA CCACCACGAA GGGGATGCAC
GAGGAATAG
 
Protein sequence
MARGAHARTR SMRALAVTLT LTLSRWRVGA AMTTDGLTSS DHRSRRARGD EDARRARGRP 
ATWTACAGPS RAALELAWRA ETHASVYATP LAEDVDRDGR GAAVTQTTSA RVRAHDGATG
IDLDGESWGA RLGAASRGGV VRIGEAYASA SLAGEARTFD GERTRAVARL APLAMAVDWF
GDDGSGKSDA GDPREVERRH EARRRAPGSR RLLDVEDEDE EEEPTTMSAE EEARSDRGWD
AIDSGGDSTR DVASGSVEAR RAVDGRVFVD AHALCTPAVG DVDGDGEPEL VLAVSYYFDS
SVRFEDDVDP KQYAATALVV LNGEDLSTKR SIALDQSAAT ALFKARAYAP PTLVDVDGDG
RLDIVIGTYA GVLHVVDGVS GNPLPGWPRR LGQMEAQVTA ADVDSDGDIE LIACDVRGTV
AVFKSNGVEL WNKHVESRIA VAASVGDVDG DDEIEIVVGD TSGAVHAFRA KDGTAREHWP
VYVGDKILAP IVLTKLRQTK RGLDVIVATH DGVVHVLEGR SRCRDVFDIA EKIYAAPFVT
SFAGFGELDV IISTMEGHVR SFKAKGSKFN ALALTSSDHV SRYDYFGVAL YDRTYRVIRG
TYVDVVYEII DRRVLDHAKA RKSSLAPYKV AITVTSLDGF KKIVSARHDR AGRYTLRVGV
PSTKTRGEIR VRVQDVNLIA DEDAYSVSFH ENYEIALKWL VALPFLLASA AAIRRASEDA
LEMDVFGASA SIPTTTKGMH EE