Gene OSTLU_33188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33188 
Symbol 
ID5003191 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp527893 
End bp529526 
Gene Length1634 bp 
Protein Length510 aa 
Translation table 
GC content57% 
IMG OID640418612 
Productpredicted protein 
Protein accessionXP_001419191 
Protein GI145349544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.860325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.52884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCC CCGCGCGATC GTCCAACAGT CGAGTGAAGC GCGCGAAAAC GGCTCAAAGT 
TCCGCCGACG TGGTCGCTTC CGCCCCGGAA GTCGGCGCTT TGTACACGGA AGACTATCGC
GTGAAAGGTT TGCACGTCCG CGATCACTTC ATCGCGGTGC GTCCCGACGC GTGTCGATGC
GTTTTCGACT CGACGCGGCG CGCGCGGCGC GCGAGGGCGA AAAACTGCTG ACTGACGGCG
ACGGCGACGC GACGCAGGTG CCCGTGTGTC ACGCTCGAGG CGATTCGAAC GCGATGCGCG
TGTTTTTTCG AGAAGTGGTG ACGGCGGCTC GAGGGAAACT CACGAGCGAG GAGCGGAAGT
CGCTGCCGGC GGTACTGTTT TTGCAAGGCG GACCCGGATT CGAGTGCGCG GGACCGCTCG
AGGCGAGCGG CTGGTTGGGG GAAATGGTCA AGGAACATCG AGTGTTTTTG ATGGATCAAC
GAGGCACGGG ACGTAGCGAC AGCGAGATTG TCCATCCAAC GCTCAACCGG GATGCGTCTG
GACATCCCTT GTCGTACCCC AGACATTGGA CCGACAAAAA CACGTCGCCG GCGAAGGCGT
GGGCCGTTCA CTTGAAGAAT TTCCGAGCAG ACAGCATCGT GAAAGACGCC GAGTTGTTCC
GTAAGACGGT GCTCGGTGAA GATGTGAAAT GGACGCTGCT TGGGCAATCC TTCGGCGGCT
TTTGCATCAC GACGTATTTG TCTTTCGCTC CCGAAGGCGT GAAAGAAGCG CTGCTTACTG
GCGGTTTGCC TCCGCTCATC GACGAACCAG CGAGTGCGTT AAACGCGTAT CGAAAATTGT
TCGAGCGCGT TCAGACGCAA AACAGAAAAT ATTTCGAAAG ATTTCCGTAC GATGTCGACC
GCCTCTATGC GCTGTATGTC CAACTTCAGA ACGAAGGTCC ACGGATCTTG CCCGGCGGTG
GGTTGCTCAC CGTTCCGCTG GTACGAGCGC TCGGTTTTTC TAATTTAGGC ACAGCGCAGG
GGATGGAGAG GTTGCATTAC ATCATGCAGT ACGTAGAGAT TCATTATGCC GACGAAGAAA
TTGTTGGAGC TCACTTGCCG CACAAGTTTT TGATTGAAGT GGAAAACTCG TTTAGGCACT
TCGAGACGAA CCCGTTGTAC GCGGTCCTGC ACGAGGCGAT TTATTGCAAC GGTGCGTGCG
CCATCGGTGC GGCCGACCAA GTGTGGCTCG AGCGAGTTGG CGAAGATCTT TACAGCGCGT
TTGGTGAGCC TGAGTCTGAT GACTCATTGC GCCGTGCGTT TACGGGGGAG TGTGTTTACT
CTTCATTCTT CGAAGACATC CCATCGCTTC GCCCGTTCAA AGCGATCGTT CAAGAGCTCA
AAAAGGACAA AGATTGGCCA AAGGTGTTGT ACGATACGGA GCAGCTGGCC AAGAATACCG
TCCCAGTGGC GTGCGCGAGT TACGTCGAGG ACATGTTCGT CGATTTCGAT CTCGCCTCTG
AAACGGCTGC GAAAATTCGG GGCGCCCGCG TCTGGAGCAC CAGCGAATAC ATGCACTCCG
GCATTCGCGA AGACGGCGCC CGCATCGTCC AAAAGCTGTT GTCCTTCGTT CGCGACGAGG
ATCCAATTCG TTAG
 
Protein sequence
MTSPARSSNS RVKRAKTAQS SADVVASAPE VGALYTEDYR VKGLHVRDHF IAVPVCHARG 
DSNAMRVFFR EVVTAARGKL TSEERKSLPA VLFLQGGPGF ECAGPLEASG WLGEMVKEHR
VFLMDQRGTG RSDSEIVHPT LNRDASGHPL SYPRHWTDKN TSPAKAWAVH LKNFRADSIV
KDAELFRKTV LGEDVKWTLL GQSFGGFCIT TYLSFAPEGV KEALLTGGLP PLIDEPASAL
NAYRKLFERV QTQNRKYFER FPYDVDRLYA LYVQLQNEGP RILPGGGLLT VPLVRALGFS
NLGTAQGMER LHYIMQYVEI HYADEEIVGA HLPHKFLIEV ENSFRHFETN PLYAVLHEAI
YCNGACAIGA ADQVWLERVG EDLYSAFGEP ESDDSLRRAF TGECVYSSFF EDIPSLRPFK
AIVQELKKDK DWPKVLYDTE QLAKNTVPVA CASYVEDMFV DFDLASETAA KIRGARVWST
SEYMHSGIRE DGARIVQKLL SFVRDEDPIR