Gene OSTLU_28580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28580 
Symbol 
ID5006496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp107245 
End bp108495 
Gene Length1251 bp 
Protein Length416 aa 
Translation table 
GC content65% 
IMG OID640421917 
Productpredicted protein 
Protein accessionXP_001422393 
Protein GI145356345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value0.270066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000135874 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGAGA GGGAAAAACG GTTGGCGGGT CGCGTCGCGT GGCCGAGGGC GAACGGGACG 
CGCGCGAAGG CGAAGGCGAG GACGCGCGCG AGCGCGACGA AGGGCGAGGG TTCGAGCGGC
GCGGCGCGAG CGAACGGCGA ACGCGCGGCG AAGACGGACG CGGTGAAGGA TGCGGAGGAA
GACGTTTCGA ATAATCGACC CGGTGAACGC GGGTTTACGG ATGAAGAGAT CGCCGCGTCG
CGCGCGGCGC TCGAGGAGGA ATTGCGGCGC GGCGGCGCGG ACGAAGACTC GTTGTTGGGG
TTGCGCACCA TCGGCGCCGC GGGGGCGCGC GAGGGGTGGG ACGACGAGGA GGTGGCCGCG
TTCGCGGAGC ACGCGACGCG ATACGGCGAC GATCTCTTTC GCTTGCGCGC CAAGCTGCCG
AAGAAATCGA TGCGCGACGT CGTGAACTAT TACTACAACG TGTGGCAGGT CGGGTTCGCG
AATTACGGTC GCGTCGACGT CGAGGGCGCG GAGGAACCGG CGCCGCGCGA GCGAGCGCGA
CGCGGACCGG CGCCCAAGTA CACCGTCGAA CAGGTGCGAC GGGAGAAGGA TGAAAAGTCG
CTGCGCGGAT TCGTGGATTG GATTCGAGGC GTCGCGATCA ACACCAAGCG CGCGATGCTA
AACGTTCACC GAGCGCCGAC GACGGCGCGC GTAAAGGGCC ACATGATGAC GCGCTGGCGC
ACCGTGACGC GGAGCGAGGA CGCGGACGAC GGCGTCGCGA GAGAGGCGTA CTTGAAGGAT
TTGAAGCGAC GCATGACGGC GGCGAGGTTC ACGAAGGAGG AACAAGAGGC GGCGGCGAGA
ATGAAGACGA AAGCGAAATC GTCCAAAGCG TCGTCCGCAA AGGTGACGAA GACCGCGTCG
AGTGGTGACG CGGCGACGAA GACGAAGAAG ACGGCGTCCG CGCCCGCGGA CGGCGCGCCG
ACGCCGAAGA AACGAAAACG CCGAATCGAC GATGGTCAGC CGAAGATTTG TCGAAATTGC
CGAGCGATGG AAACGAAACA GTGGCGCTTA CCCGTCGAGG GCGCGGGCGT GCTTTGCAAC
GCGTGCGGGT TACGCGATAG AAAACAAGCG AAGAAGAACG AAGCCAGCGC CGCGGGCGAG
ACGGAGCCTA CGCCAAAGGA AAATAAGACG CCCGATCGGG GGAAGGATGG TTTGAAGAAG
AAACGCTCTC CGGGATTGAA ACCGACGCCC GACCGCAACT TTCAGCTTTA G
 
Protein sequence
MSEREKRLAG RVAWPRANGT RAKAKARTRA SATKGEGSSG AARANGERAA KTDAVKDAEE 
DVSNNRPGER GFTDEEIAAS RAALEEELRR GGADEDSLLG LRTIGAAGAR EGWDDEEVAA
FAEHATRYGD DLFRLRAKLP KKSMRDVVNY YYNVWQVGFA NYGRVDVEGA EEPAPRERAR
RGPAPKYTVE QVRREKDEKS LRGFVDWIRG VAINTKRAML NVHRAPTTAR VKGHMMTRWR
TVTRSEDADD GVAREAYLKD LKRRMTAARF TKEEQEAAAR MKTKAKSSKA SSAKVTKTAS
SGDAATKTKK TASAPADGAP TPKKRKRRID DGQPKICRNC RAMETKQWRL PVEGAGVLCN
ACGLRDRKQA KKNEASAAGE TEPTPKENKT PDRGKDGLKK KRSPGLKPTP DRNFQL