Gene OSTLU_119396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119396 
SymbolUnk4 
ID5000210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp200959 
End bp202743 
Gene Length1785 bp 
Protein Length594 aa 
Translation table 
GC content54% 
IMG OID640415631 
Producthypothetical protein 
Protein accessionXP_001416101 
Protein GI145342033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTTCA AAGAGCTCCT CAAAGACTCC GTCACGGAGG TCTTCACCTA CCGCACCACG 
AAGCTCGTGA AGACGAACGA TAAGTTCTTA GTGACGCTGC ACTTCATATT CATGAGCCTG
ATCGCGACGT TCGCGCTGGT GTCGATCTTA TTGAGTCATA ACTACATGCT TTTCGAGCTT
CCGGCGCTGT ACGTGACGAC GACGTATCAG AAATTCGGAC CGGATCCGGT GACGAGCAAC
GTGGTGCTGG CGCGTGAAAT CGCGCAGAAT GCGACTGTGG ATTATTGTAA CACAGGGTAC
GTGAATTGGC GTCGAGGGGG GAAGGTGTTT GACGACGCTT ACATGACGGC GCTCGAATGT
ACGGCGCAGC ACGCGCCGGC TGAGTACGTG TGGCCGTTCG ATGCGGGGAA TGGGATGACC
ATCGGCACGT TTGCAGAAGT ACACTCCGTG ACGAGATCGT GTACATCACC GGGCGCGTCG
ACGTACGGGG CGGCAGGGTG TACGGAGACT CAGACGTCGC CGCAAGCGTA CATGGTGGTC
GGCGTAGACT TGCTGAGCAT GCAGTTAGAT ATTCGGTACC AAACAGCCAC TGGAACTCTT
AACGAGGCGG AAGTCGTTGA CGTAAATGGA GTAGAGTATA TCACCTCGAT AACCAAGCCG
ACGGTCACGT TTCCGCAGTT GTTAGCCATG GCGGGGATCA ACTCGCTCGA CGAAACAAAT
CCCTCAATCG TCGGCGACCG AGGCTCCGTG ACAGGGCTGC CCTATCGCAT GTCGGGCCTG
CGCCTGAACG TGAAGGTGTC GTTTACAAAC ACTTATTTTA GTAAACCGTT GAAGACGACG
GTCACGGCGA CTTTGTCTGC GGAACAAGTG AAGACGAATA ACTTCAACGC GAAGACGACG
GTGACGTATT TGCCTCACCC AACAGATGCC GGGATACATT CCATATCGCG GTACTTCCAC
ACGTGGGCCA CGTCAACCGT CACGTTTGTG TCGTCCGGGC ATATCGGCAA GTTCGACTTG
TTCGCCTTGG CTCTCGCCAT CACGAACGCT TTCGTGCTAA TTGGCATGGC GACGACCATC
GTAGATTTTG TTGGCGTCAT GTCTTCTGAA ACGTTTTTAG ACGACAAGTA CGAAGACGAC
GGCGAACGTT TCGGTTTAGA AATGATGCTA GCGAACATCG AGAACGATGA TCACCCGGGA
GTGCCGTTCG ATCCCAACGA CTTGCGCTTG AAGGACGCTG CCGGCGACCC TGGCCTGAGC
TACGAAAAGA CACTCGAGCA GTTACTCGAC GAGGTGCGGG AAATCCAAGA ACAGCTCAGT
CTGCTGCCGG AAGATGAAAA TGAGCTTCGC GCTATCACTA CTGGCCATGC TGAGGAAGAA
GAGTATAGAA AGCTGCGCTT GATTTACGTC CCTGACCCGC TCTCAACTGA GGCAAACGAC
AAGTCCTACG TTCCTCCAGA GATTTTATTG CACGACGGTC AGCAAACGAT CGGTCGTGGC
ATGGGGGGTA TTGAAAACAA GGGCGTGAGT AGACAGCAGT TCTCAATCGC GGTCATCAAG
GAAAGGATCC GCATGAAGTC TCTGCACGAA GGACCTGGGG TGTGGCGCCA GAGCTCAGGT
CGTTGGGAGA TGCTCCCAGT CGGCAAGGCC GCTGTCTTGA GTGTTGGCGA TCGTTTGTGC
TTTAGAATGC GAGAGGGTAA ATTGGGGGGG CACGAAGGCG TGTTCACGCT CGATTTCCAA
GACACGCGCA TGGAGTGCAC GGTGTTTGGA ATCCCGTTGC GTTGA
 
Protein sequence
MGFKELLKDS VTEVFTYRTT KLVKTNDKFL VTLHFIFMSL IATFALVSIL LSHNYMLFEL 
PALYVTTTYQ KFGPDPVTSN VVLAREIAQN ATVDYCNTGY VNWRRGGKVF DDAYMTALEC
TAQHAPAEYV WPFDAGNGMT IGTFAEVHSV TRSCTSPGAS TYGAAGCTET QTSPQAYMVV
GVDLLSMQLD IRYQTATGTL NEAEVVDVNG VEYITSITKP TVTFPQLLAM AGINSLDETN
PSIVGDRGSV TGLPYRMSGL RLNVKVSFTN TYFSKPLKTT VTATLSAEQV KTNNFNAKTT
VTYLPHPTDA GIHSISRYFH TWATSTVTFV SSGHIGKFDL FALALAITNA FVLIGMATTI
VDFVGVMSSE TFLDDKYEDD GERFGLEMML ANIENDDHPG VPFDPNDLRL KDAAGDPGLS
YEKTLEQLLD EVREIQEQLS LLPEDENELR AITTGHAEEE EYRKLRLIYV PDPLSTEAND
KSYVPPEILL HDGQQTIGRG MGGIENKGVS RQQFSIAVIK ERIRMKSLHE GPGVWRQSSG
RWEMLPVGKA AVLSVGDRLC FRMREGKLGG HEGVFTLDFQ DTRMECTVFG IPLR