Gene OSTLU_16081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16081 
Symbol 
ID5002820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp256834 
End bp258477 
Gene Length1644 bp 
Protein Length547 aa 
Translation table 
GC content60% 
IMG OID640418241 
Productpredicted protein 
Protein accessionXP_001418886 
Protein GI145348911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.355466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAC GGGCGGAGGA GGAGACGTTT AGGCCGCGAA CGCACGGCGA GGGGAGACGA 
GACGCGGGAA AGCGGGCGTC GTTGGAGAAG TTGGCGGCGC CGAGGACGGC GTTGTGGGAA
CGATCGGCGA GCGTGAAGAA GGAGAAAGAT GAGGCTGTGT TTGCGGAGAA TTGCACGTTC
GCACCGAAGG TTGGTCGAGG GCCGAAGACG CCGTCGACGA AACCGGCGGC GGAGCGATTG
TACGAGTACG CGGAGAAGCG ATTAGAGACT AGGGAGCGCG TACAGGCGCG CGTGGTGGAG
GAAGAGATGG AACTTTTGAC GTTCAAGCCG ACAGTTAACG TCAGGACGAG CGCTGGGATT
CGAGACAAAG TCGCCAAGAC GCCGCCGTTG CACCGGCGCG TCGCCGACGT GTTGCGCGCC
AAAGAAAACG TGAGAACAGA GGCTCGTTTA AAGGTGGAAG ACGAACTCGC CAAGGCGCAC
ACGTTCAAGC CCACGATCAA TCCGACGAGT GTGATTCTCG CCATGCAGCG TGCCGAGATT
CAGAAGGCTA TGGAAGATGA TGACGACGCC GGCGACGAAG CGCCAACGAT TCACCGCAGA
CGATCGTCGG CGTTGGACGG CGAGGATGAA AATCTCACTT TTGCGCCTAA AATCACGCGC
GAAAGCGAGC GCGTGGTCGA CGAACTTGAG CGTCAAGGCA AACTAGGCGC CGGCTTCCTT
GAACGACAGC GTGACTTCAG CGAAAAAGTC GCGCGGCGCG CCCAGGAAAA GCGCGCTATG
GTCGACGACG AATGCACATT TATGCCAGAT ATCGGCAACG CCGCCAGCGT GTTGCGCCGA
GGGCGGCACG TGTACAAGTT GCTCGAAACG CCCGAAGAGC GCTCAGATCG ATTGGCGGTG
AAAGACGCCG AGCGTAAGCG TGCCGCGCAA CGCGTCCGCG AGCGAGAGCA CTACGCGCAG
TTTACGTACC AACCCGAGTT GAATGAAAAG TCGTACGAGC TCGCGCCTCA CGGTAGTACG
ATCGACGATT TGGCGCGCGA CGAGCGACGG GACTTGGCGC GACGACGCGC GCAAGCCGAG
CTCGAACGTG AGTTCCGAGA ACAGCACACG TTCGAACCAA ACCTCGATCG GTCGAATGAG
GCCCGAAAGG CTCGCGATAC GAGTCAGTTC GCGATGGATT ACGGCGTCGG CGGCGATGCG
GTGAGCGCTC GCATCGAAGC ATACCGGCAC GAAAAAGAAA CTGCGCTCGA AAATCTCCGT
AGGCGCGCCG AGTATCGCGA GTTGGAGCAG TGTACGTTTC GTCCCGAGTC CATCGCGCGA
GAGCCTCGCG CCATGGGCTC CCCGTCCGCG TCGAAAGTCA AAGGTATGGA TTCGTTTTTG
CGCAAGCAAG CCAAGGCGCG CGAGCTCGAA GAAGAAAAGC GCGAGCGATA CGCCAAGGCT
TTCCTCGAGA ACTTGGACGA TTTCGACCGA TGGGGCCGAC GGACGATTCC CGAGCCGTTC
ACCGGCGCCT TCGCCGAAAA CGTCGTCGAA AAGGCTGAGG CTCGACGCAA AGCGTTGGCG
GAGGAGCACT TGCGGCGCGA GTTGGAGGAG TGCACGTGGG AACCGGCGAC GAATCATTCT
CGCAAGTCTA CTAGTATTAA ATAG
 
Protein sequence
MARRAEEETF RPRTHGEGRR DAGKRASLEK LAAPRTALWE RSASVKKEKD EAVFAENCTF 
APKVGRGPKT PSTKPAAERL YEYAEKRLET RERVQARVVE EEMELLTFKP TVNVRTSAGI
RDKVAKTPPL HRRVADVLRA KENVRTEARL KVEDELAKAH TFKPTINPTS VILAMQRAEI
QKAMEDDDDA GDEAPTIHRR RSSALDGEDE NLTFAPKITR ESERVVDELE RQGKLGAGFL
ERQRDFSEKV ARRAQEKRAM VDDECTFMPD IGNAASVLRR GRHVYKLLET PEERSDRLAV
KDAERKRAAQ RVREREHYAQ FTYQPELNEK SYELAPHGST IDDLARDERR DLARRRAQAE
LEREFREQHT FEPNLDRSNE ARKARDTSQF AMDYGVGGDA VSARIEAYRH EKETALENLR
RRAEYRELEQ CTFRPESIAR EPRAMGSPSA SKVKGMDSFL RKQAKARELE EEKRERYAKA
FLENLDDFDR WGRRTIPEPF TGAFAENVVE KAEARRKALA EEHLRRELEE CTWEPATNHS
RKSTSIK