Gene OSTLU_31789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31789 
Symbol 
ID5001775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp534635 
End bp535945 
Gene Length1311 bp 
Protein Length436 aa 
Translation table 
GC content61% 
IMG OID640417196 
Productpredicted protein 
Protein accessionXP_001417802 
Protein GI145346658 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00179288 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0182063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG TGGAACCTAC GGAGATACAG ACGAAGGCGA TCGATGTCAT CGGTCGAGGG 
GCGGGGAACG CGTTCGTCGC GTCGCACACG GGGAGCGGGA AGACGTTGGC GTACTTGTTG
CCGGTGATTC AACGCATGAA GGCGGCGGAG ATCGCGGCGG GGGATCGGTT GGCGAAACCG
AAGAGACCGA AAGTGGTCGT GGCGTGCCCG ACGCGAGAGC TGGCGGAACA AGTCGCGGAG
GTGGCGAAGG CGTTGAGTCA CGTGGCGAAA TTTAGTTCGT ATTTAGTCGT CGGAGGTAGA
CGTTTAGGGA CGCAAAAGGA GCGGTTAGAC TCCGCGATCG ATGTCGTGAT CGGGACTCCG
GGTCGATTGA TCAAGCACGT CGATCAGGGG AACTTATTCT TGGGGAGCGT GGACGCGATG
GTGTTGGACG AGGCGGACAC GCTCTTTGAA GCCGGATTTG GCGACGAGGT AAAGCGGTTG
TTACGACCAC TCAAGGCGCG TCCAGAGGGA AAGACGTGCG TCCTCGTCTC GGCCACCATG
CCGGATCGAC TAAAGAAGCT CGTGGACGAG GAGCTTCCGG CTTTGCAGTA CATTAAGACG
GATTCATTGC ATCGCTCCGC GCCAGGGCTC AAGCACCGCT TCGTCGACTG TCCGGGCGAC
GTGGACAAGA TGACGGTGCT CGAGCAAATC GTCGCGCCCG AGCACAAACA GGGGAAAAAG
CTGATGATCT TTTGCAACAC GCTTCCCTCG TGCATCGCGG TCGAGCGCAC CATGTTCGAG
GCAGATATTC GCACCGTGCA GTACCACGGC GACATGACGA GCGACGCTCG CGCCGACGCC
ATGCGCGAAT TCATCGACGC CGACGCCGAC GAAAACCTCA CCATGGTGTG CACCGACCTC
GCCGCTAGAG GTTTGGATTT TGGTCGCGTC AAGGTCGATC ACGTGGTGAA CTTTGACTTC
CCCATGAACT CGCTCGACTA CATTCACCGC TCCGGTCGCA CCGCTCGCGC GGGCGCCGGC
GGTAAAGTCA CCAACCTCGT CGCCAAAAAG GACCGCGTTC TCGCGAGCGA GATCGACAAC
GCCGTCAAGC TCGGTCTGCC GATCGACAAC GCCACGAGCT CACGCGCCGT GAGCGAAGCT
CGCAAGAAAA AATCCATCGC CGACGCTCGC GACAGGCGCA CCGGAGGCCG TTCTCGCGCC
AAGCCGAGCA CCGTGCGCGA TTCCAAACCT TCCAACCGCG GTCGTCGCGG CGCCGCGCGG
TTCACCACGG ACGACACTAA GACTAAGCCT TCCAACCGAG GTCGTCGCTG A
 
Protein sequence
MNIVEPTEIQ TKAIDVIGRG AGNAFVASHT GSGKTLAYLL PVIQRMKAAE IAAGDRLAKP 
KRPKVVVACP TRELAEQVAE VAKALSHVAK FSSYLVVGGR RLGTQKERLD SAIDVVIGTP
GRLIKHVDQG NLFLGSVDAM VLDEADTLFE AGFGDEVKRL LRPLKARPEG KTCVLVSATM
PDRLKKLVDE ELPALQYIKT DSLHRSAPGL KHRFVDCPGD VDKMTVLEQI VAPEHKQGKK
LMIFCNTLPS CIAVERTMFE ADIRTVQYHG DMTSDARADA MREFIDADAD ENLTMVCTDL
AARGLDFGRV KVDHVVNFDF PMNSLDYIHR SGRTARAGAG GKVTNLVAKK DRVLASEIDN
AVKLGLPIDN ATSSRAVSEA RKKKSIADAR DRRTGGRSRA KPSTVRDSKP SNRGRRGAAR
FTTDDTKTKP SNRGRR