Gene OSTLU_92854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_92854 
SymbolSDG3513 
ID5002266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp136942 
End bp138861 
Gene Length1920 bp 
Protein Length639 aa 
Translation table 
GC content61% 
IMG OID640417687 
Productpredicted protein 
Protein accessionXP_001418163 
Protein GI145347416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.798313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0275457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCC CGTTCGAGCG CTTCGACGTC CCGGGCGCGC GCGACGGCGC CGTTTCGAAC 
GAGCGCGGCG TGCGCGCGAC GCGAACGATC CCCGCCGGCG CGCGCATAAT GACCACGGAG
CCTTACGCCG CGGCGCTGCG CGCGGAGAAG CGCGAGAGCC ACTGCGCGTG GTCGTTTCAA
CCGCTGCGCC TCGGCGCGCG CGCGCGCGTC GCGCGCGCGG CGTGCGGCGC GGCGTTCCGA
GACGAAGAGA GCCTGAAACT CGCGAACACC GTCGAGGCGT TCGAACGCGC GAGCGCGTGG
GTGCGAAAGT CGAGTCGAGG AACGCCGGAG ACGTCGGCGC GGTGCGCGCT GCAGTGCCTG
GCGCGACGCG CGGGCGAGCG GGACGGGACG TGCGAACGCG CGCGGTACGA ACTGTTGGGA
GAAGACGCGC GGGGATTCGA TGGGGTGTGG GCGCTGAGGG AATCGAGGGG ACGCGATGGG
AAGAGCGCGC GGGAATTGGA GAAGGCGATG GAAATCGCGA GCGCGGCGAC GGTGGTGGCG
GCGCTCGCGG GCGAGGCGAT GGTGAAGGGG GCGACGCCGG AGGAGATAGA GACGAAAATG
CTCAACCTGA AGGTCGAGTC GGGGGTCGAC ACGCAGTTTG TGATCTCGTT GTTGTCGCGA
TTTGAAATTA ATGGGTTCAC CATCGCGGAC GACGACATGC AACGCGTCGG TTTCGGGATT
TATCCCGAGG CGTCTCTGTT TAATCACTCG AGCACGCCCA ACGCGCAGGT GATGTTCAAG
GGTAAGACGC TCGTGGTGAA GACGTTGAGG GAAATCGCGG TCGGCGAGGA AATCACGATC
TCGTACGGCG AGCAGTACAT GCCGCGAGAA TGGACGAGAC GCCGGATGCT CTCGTCGTAC
GGTTTCGACG CATATGCGGC GTACCCCAAG TATGAAGTCG CGGAAGCGGC GCGGCGACGA
GTCTTGGACG CCGCGACGCG AACGCGGCTT CCCATGCGAT TGGGCGAACT AGTCGATTTA
GGCGAGGACG TGTGCTGGTA TGCGGGCGAA CTTCTTCCCG ACGAAGATCT CGCGCGCGAC
CGGTTTTGGC ACCAACTCGA CGTCGACGCG TACGGGGACG AGTTTGCAAA CTCTGGAATC
ATGCTCATCA AAGACGAATC ACGCGCACGC AAAACGAGCG ATGATGACGA CGACGATCAC
GACGACGGTT GGAACGAAAA CAATGAGATC ATGATTTGGG GCAAATTCCC AGAACATTGC
GACCGCGAGC TCACCGCCAT TAATTTTGCC AACGCCGCGC GCTCGCTCGA ACTTCTCGGC
GCGGATGGCG AAGACGACGA CGAGGACGAC GATCGCGATC CCGTGATCGC ATTGCAGGGC
TATGAGAAAG TGGCGCGCGC GTTGTTATCA GGAGACGATA AAGCAGCCGC AGTGGGAAGA
AATCACGAGA TATTAAAGCA CGTCAATCTA AAACGGACGT TGAAGTTGAC GGAACTCACG
GCGCGCGTGA TGCGAGATTT TGATGAACGC AAATCTGCGT GTTTCGTTGC CGTGGTGGAA
AGTTCCGTGG GGGCATTTCG CGCGTGTCAA GCCACTGAAA CGGTGTACAA AATGAGCGCC
GGATTCAGCC CGTTCGATTC CGTGTACGTC CATCTTAAAT TTCAAATGCT CAAGCTCGGC
GTCTTGGCGT TGGGGTACCT CGCGCACCTG TGCGATCAGA GCGGTGCGCG CGCTGACTAT
CGCAAGCTCG CTCGTGAGTT GTGCCGACAC GCGATTCTTA CCCATAACGA ACTCAAAACT
GTCATGAGTA AAGCGAGCTG CGACGGCATG GTGATGCACA ACGAGTGGAG TCGAGATGCT
CAATCACTTT TCGCCGACTT GAGCTTCATT CGTCAACGCA TCCAACAGTG GGGGAAATAG
 
Protein sequence
MSRPFERFDV PGARDGAVSN ERGVRATRTI PAGARIMTTE PYAAALRAEK RESHCAWSFQ 
PLRLGARARV ARAACGAAFR DEESLKLANT VEAFERASAW VRKSSRGTPE TSARCALQCL
ARRAGERDGT CERARYELLG EDARGFDGVW ALRESRGRDG KSARELEKAM EIASAATVVA
ALAGEAMVKG ATPEEIETKM LNLKVESGVD TQFVISLLSR FEINGFTIAD DDMQRVGFGI
YPEASLFNHS STPNAQVMFK GKTLVVKTLR EIAVGEEITI SYGEQYMPRE WTRRRMLSSY
GFDAYAAYPK YEVAEAARRR VLDAATRTRL PMRLGELVDL GEDVCWYAGE LLPDEDLARD
RFWHQLDVDA YGDEFANSGI MLIKDESRAR KTSDDDDDDH DDGWNENNEI MIWGKFPEHC
DRELTAINFA NAARSLELLG ADGEDDDEDD DRDPVIALQG YEKVARALLS GDDKAAAVGR
NHEILKHVNL KRTLKLTELT ARVMRDFDER KSACFVAVVE SSVGAFRACQ ATETVYKMSA
GFSPFDSVYV HLKFQMLKLG VLALGYLAHL CDQSGARADY RKLARELCRH AILTHNELKT
VMSKASCDGM VMHNEWSRDA QSLFADLSFI RQRIQQWGK