Gene OSTLU_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2239 
Symbol 
ID5003390 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp51627 
End bp53219 
Gene Length1593 bp 
Protein Length514 aa 
Translation table 
GC content59% 
IMG OID640418811 
Productpredicted protein 
Protein accessionXP_001419267 
Protein GI145349702 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0895904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCGAGT ACCGAAAACT TCCAATCAAA CGCTACGCCG CGCGAGCGAA GCGAGAGACG 
GGCGAGGGAA GGTACTGGCG AGAATATAAA TCCACCGCGC TGAGCGAACA GGTGAACGCG
GTGACGAGCG TGTCGTACGG AGGCGCGGGG TCGTCGGGGG GCGAACGCGG CGCGTTGGCG
GCGACGAGCG GGGCGAGGGT GACGCTGTAC GCGCCGAGCG GAGCGAGAAA ATTGAGGACG
TTCGCGCGAT TTAAGGACGT GGCGTACAGT GGTGTGCTGA GAGACGATGG GAAGGCGCTG
GCGGTCGGAG GACAGGCTGG GGTGGTGCAG TTGTTTGATT GCGGGTCGCG AGCGGTTTTG
AGAAAGTTTA CGACGCACTC CGCGGCGGTT CGCGCGGTGC GATGGAGCGC GGATAAGCTG
CACTTAGGGT CGGCGAGCGA CGACGCGACG GTGCGAATAT GGGATATTTC CACTGGGAAT
TGCGTGCGAA GGCACGATGG GCACACGGAT TACGTTCGAG CGCTCGAGCG GAGTACGGTT
TCTCAAGAGA TGTGGGCGAG CGGGTCGTAC GACCACACGG TGAAAATTTG GGACGCTAGA
CAAGGACGCG AGGCGGTGAT GACGCTCGAT CATGGTTCGC CCGTGGAAGA TGTCGCGTGG
TATCCCAACG GAAACTTGCT CGTCTCCGTC GGTGGCGAGG ACGTGTGCGT GTGGGACGCC
ATCGGCGGCG GCCGGTTGCT TCGTCGGTTG CGCAGTCACC AGAAGACCAT CACCACCGTG
CACGTGCACC CGGACGCGGG CCCGCCATCG TTCGCATCTG GATATGAGAT CGGTAGCGAA
AGCGCGCTCG AATCAAACGC GCCTCGCATG ATCACCGGCT CCTTGGACGG CTTCGTCAAG
ATTCACGAAC TCGACACTTT CACCGTGACG CACTCGATCA AGTACCCTGG ACCCGTGCTG
ACGTGCTCGC TCTCGCCAGA CGCGAACTGC CTTGCCACTG GACTAGCCAA TAAAGTATTG
AGCGTTCGCA GACGAACGAA ACCTCGTAAC AGTGACGACC CTTCGGGGTA TCAAGGCGTT
CGAAGTAAAA AGAAGGGCTT CACGGTGAAG AAACCTCGAC GACTGGATGC GAGTCATTGG
CGGTACTTTA TTCGAGGTCA AAACTCTAAA GCGGCGGCAG ACGCCACGCG AGTGCTTCGT
CGACGACGTG TGCACTTGGC CGCGCACGAT CGAATGTTGA AACAATTTAG ATACGGAGAC
GCCCTGGACG CGGCGTTGCA CGTGGGTAGG GCGGAAGTTG TCGCCGCAGT CATAGAAGAA
GTCGGTCGAA GAGGGGGGCT TCAGAAGGCA CTCGCCAACC GCGACGACCA GTCGCTGCTT
CCGATTTTGG AGTATATAGA GAAAAATATC TCCAAGCCGC GCCACACGGC GCAGATGGTG
AACATTGCGA ACCGAATCGT CGACTTGTAC GGTGGCGACG TCGGCGCGAG TTCGGCTGTG
GACAACGCAT TGCGTAGAAT CCAGTTGAAA ATCAAAGCGC AACTGCGACT ACACGAAGCA
TTGACGCAGT TACAAGGGAT GGCTTTGACG ATA
 
Protein sequence
LGEYRKLPIK RYAARAKRET GEGRYWREYK STALSEQVNA VTSVSYGGAG SSGGERGALA 
ATSGARVTLY APSGARKLRT FARFKDVAYS GVLRDDGKAL AVGGQAGVVQ LFDCGSRAVL
RKFTTHSAAV RAVRWSADKL HLGSASDDAT VRIWDISTGN CVRRHDGHTD YVRALERSTV
SQEMWASGSY DHTVKIWDAR QGREAVMTLD HGSPVEDVAW YPNGNLLVSV GGEDVCVWDA
IGGGRLLRRL RSHQKTITTI GSESALESNA PRMITGSLDG FVKIHELDTF TVTHSIKYPG
PVLTCSLSPD ANCLATGLAN KVLSVRRRTK PRNSDDPSGY QGVRSKKKGF TVKKPRRLDA
SHWRYFIRGQ NSKAAADATR VLRRRRVHLA AHDRMLKQFR YGDALDAALH VGRAEVVAAV
IEEVGRRGGL QKALANRDDQ SLLPILEYIE KNISKPRHTA QMVNIANRIV DLYGGDVGAS
SAVDNALRRI QLKIKAQLRL HEALTQLQGM ALTI