Gene OSTLU_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1794 
Symbol 
ID5006094 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp381266 
End bp382558 
Gene Length1293 bp 
Protein Length421 aa 
Translation table 
GC content62% 
IMG OID640421515 
Productpredicted protein 
Protein accessionXP_001421924 
Protein GI145355346 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00327182 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CCGTTTACGA CGTTCGACGA GGCGGCGTTT CCGAAACCGC TGCGAGCGGC GCTGAAAGCG 
CAGGGATACG ACGCGCCGAC GCCGATTCAG GCGGAAGCGT GGCCGATATT GCTGAAGGGG
AAGGATGTGG TGGCGATCGC GAAGACGGGG TCGGGGAAGA CGTGTGGGTT TTTGTTGCCG
GCGCTGGCGA GCATCGTGGC GAGAGGATCG CAAAAGGCGC CGGAGATGCA GTTGCTCGAT
GGACGATGGC GTCCGGGGGC GGTGACGCCG ACGGTCATCG TGTTAGCGCC AACGCGGGAG
TTGGCGATTC AAATCCACGA CGAGTGCGCG AAGTTTTGCC CCGCCGCGGG GTGCCGCTCG
GCGGTGCTCT ACGGCGGCGC CGCCAAGGGC GATCAGTTGC GCGCGTTGCG TTCGGGCGCC
GACGTCGTCG TCGCCACGCC CGGGCGATTG AACGATTTTC TTGAACCACC CCCGGGATTC
ACCGCGCCCG TGAGCGCGGT GAAGGCGTCG TACGTCGTCC TCGACGAGGC GGATCGAATG
TTGGACATGG GATTTGAGCC GCAGATTAAA AAGATTTTCA AGCTCTGCCC GTCGGCGCGT
CAGACGGTGA TGTTCACCGC GACGTGGCCG AAAGCGGTGC AAAAGATTGC AGACTCTTTC
ACGACGAAGC CGATTCACAT TCAAATCGGT AGCGGCGGCG ATAAACTCAC GGCGAATAAG
TCGATTACGC AAACCGTCGA AGTACTCGAG GAGGAGGAAA AGTTTGACCG TTGCGTCGCC
ATCCTGAAGA AGGAGCTCGG TAAGGACGAC ACGTGCATTA TGTTTGCCGG CACAAAGCGT
CGATGCGATT TTTTGGACCG CAGATTGAAG CAGTCTGGGT TTTCCTCCGC CGGCGCTATT
CACGGCGACA AGGACCAATA CGAGCGCGAG ATGGTCCTCG ACAACTTTCG TCGCGGTCGT
GGCAATATTC TCGTCGCCAC TGACGTCGCT GCGCGTGGTT TAGACATTCC TGGCGTCGCA
GCGGTTCTCG TGTACGATTT TCCGCTCCAA GTGGAGGATT ACGTGCACAG AATCGGTCGC
ACCGGACGCG CCGGGAAGGA GGGCAAGGCG TTCACCTTCT TCACTAAAGA TAACCGTGGC
GCCGCAAACG AGCTCATCGA TATCCTCCAA GGAGCCGGAC AAACCGTACC TTTGGCGCTC
CAAGCGATGC AGCGCAAGGG CGGCGGCGGC GGAGGCGGCC GCGGTTGGTC GGGCGGCCGA
GGCCGAGGCG GCGGCCGAGG CCGAGGCGGC GGT
 
Protein sequence
PFTTFDEAAF PKPLRAALKA QGYDAPTPIQ AEAWPILLKG KDVVAIAKTG SGKTCGFLLP 
ALASIMQLLD GRWRPGAVTP TVIVLAPTRE LAIQIHDECA KFCPAAGCRS AVLYGGAAKG
DQLRALRSGA DVVVATPGRL NDFLEPPPGF TAPVSAVKAS YVVLDEADRM LDMGFEPQIK
KIFKLCPSAR QTVMFTATWP KAVQKIADSF TTKPIHIQIG SGGDKLTANK SITQTVEVLE
EEEKFDRCVA ILKKELGKDD TCIMFAGTKR RCDFLDRRLK QSGFSSAGAI HGDKDQYERE
MVLDNFRRGR GNILVATDVA ARGLDIPGVA AVLVYDFPLQ VEDYVHRIGR TGRAGKEGKA
FTFFTKDNRG AANELIDILQ GAGQTVPLAL QAMQRKGGGG GGGRGWSGGR GRGGGRGRGG
G