Gene OSTLU_31845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31845 
SymbolSDG3511 
ID5001720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp632728 
End bp634692 
Gene Length1965 bp 
Protein Length654 aa 
Translation table 
GC content62% 
IMG OID640417141 
Productpredicted protein 
Protein accessionXP_001418071 
Protein GI145347218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC GCGACGACGC GCGGGACGAC GACGACGCGC GGCGCCTCGA GGCGGGCGCG 
CGCGCGCTCG AGGCGGCGCT GACGGTGGAC GACGACCCGC TGCGGTCGGG GGCGTCGGAC
GTCGAGCGAC GACGACCGGC GTCGGCGCGG CGCGGACGAG GCGCGAGAGC GCTCGACGCG
AGCGAGTGGG TGGCAAAGTT CGGCGTCGCG CGCAGGGGAG GGTGCGATGG ACGCGTCGGG
ACGCGGGCGA CGGGGGCGAC CGAGGCGACG AGGGCGGCGA CGGCTGGAGG CGGCGGCGGC
GAGGGAGGGA CGACGACGAC GAGCGGGAGG CTGGGATTGG TGGCGAAAGA ACGCGTGCGG
GCGTGTGAGG TGCTGATCAG AGAGCGAGTG ACGTCGATTC AGGGGACGTT TCGGAGTCAA
GGGGCGTGCC TGGCGAGACT CATGGAACGC TTGCGAGAAC GCGGAGAGCA AGACGCGAGT
CGAGCGTGCG CGCCGTCGAG GGAGATGGCG AAGAAACTGC TCTCGAACGC TGAAGAAAGA
GAGGAGGTTT TAGCGCTCGG GGCGAAATTA GGCGTGGAGG ACGAAGAGGA GTGGTTAAAG
TTTGCGGCGT TTGTCAAATA TAACGCCGTG ACGAGAACGT GTCGACCGGC GACGACGAGC
GCGGGCTCGA TGATAGGCGC CGATGCCTAC GTGCAGTGCT CAATGGTGAT GTTTGACGCA
ATTAGCGCGT GCAACCACTC GTGTGACCCG AACGCGGAGG TGAGTCACGT GTCCGACGAG
GGTGAGGTGT CGTTGTATTC GCTTCGCCCG ATTGAGCGCG GAGAGGGGAT AACAATTGCG
TACGGAAAGC CTTCCCTCCG TTGGCTTCCA GCGCGCTGTC GAAAGAAAGC TTTACGCAGG
GATTGGTATT TTGATTGCGC GTGCGCGCAG TGCAAGGCGG AAGTCGCCTC GGGATTAGCT
GTAGATAAGC CTTTGGCGCG ACCGTGGGAC ATACAAGATC CGCGATGGTT CTTTTGCACA
CACGACTACG TCACGGGATT CGAGTCGCAC TTTGACGGCG AAGGCAACTT ACTCTCGCTG
AAATCGGCGA CTGCGTCTCG ATCGAATGTT TCACCGAGCG CGAGCGGGGC AAACACGTCT
GAATCTGCCG CAGACGGCGT CGCCGCGGAA TTCGGCGCCA AGTTGCACGT TGGCGGCTTG
GGACTCGCAG CCGATGCGAG CGGTAGTAGT AGTGGCTTAT GCGATTCATC TGATTCGTCT
TCGTGCCAGT CGGTGGATTT AGACGAATAC GACCGCGAAT CGAGCGGAAG CGAGGACGAA
GACGTCCTGC GATGGCACGA GCGCTGGCGC TCAAAACGCA TCAAGGCATA CAGCGTCAAG
AACGTTGGAT TGTATACCCC TTTGCAGCTT TATCACGCCA TGCAGCGCTG TCAAATCAGA
CAGGATCATT GGCAACTTCT TGTCGTGCGT GAGGCTTTGA TCAGTCAAAT CATGAACGAC
GCGGCGATGA AAGGAAACGC GTCGTCTTCG CGCCCCAACG CTCCGAGTCT TGACGGCGGA
TGGGGCGAGC GCGGGAAGTT CCAAGCCTTC AAGCTCATCT TAAATCAGTG TAGAAGTCTG
GCGCGTATGG CGCCGAACAC AGGAAGTTTT GCAGAATTAT TTGCGACGCT CGAAAACATC
GTGTACTGGT GGAGCACCGA CGGCTGGCAC TACGTCTCGA CGAAGCGCGA GCGCCGCGAG
CAGGAGCGCG CTTTCTCGCG CGCTCGCGCG AGACGCGCGA GAAACGGCGC GGGAGAAACG
GCGCGGGAAT CCGATCGCTC CGATTCCGTC GATGACGATG AATTTTTCGA AATTCCCGTC
CGTTCGCGAT GGGCGTTTCG CTTGGAACGC CTGCGAGACG CCGCGCACGC CGACGTCTTG
GCGTGGAACA TGCAGTTTGG GAAGCTGCCG AGCGCGCCGT TTTAG
 
Protein sequence
MTARDDARDD DDARRLEAGA RALEAALTVD DDPLRSGASD VERRRPASAR RGRGARALDA 
SEWVAKFGVA RRGGCDGRVG TRATGATEAT RAATAGGGGG EGGTTTTSGR LGLVAKERVR
ACEVLIRERV TSIQGTFRSQ GACLARLMER LRERGEQDAS RACAPSREMA KKLLSNAEER
EEVLALGAKL GVEDEEEWLK FAAFVKYNAV TRTCRPATTS AGSMIGADAY VQCSMVMFDA
ISACNHSCDP NAEVSHVSDE GEVSLYSLRP IERGEGITIA YGKPSLRWLP ARCRKKALRR
DWYFDCACAQ CKAEVASGLA VDKPLARPWD IQDPRWFFCT HDYVTGFESH FDGEGNLLSL
KSATASRSNV SPSASGANTS ESAADGVAAE FGAKLHVGGL GLAADASGSS SGLCDSSDSS
SCQSVDLDEY DRESSGSEDE DVLRWHERWR SKRIKAYSVK NVGLYTPLQL YHAMQRCQIR
QDHWQLLVVR EALISQIMND AAMKGNASSS RPNAPSLDGG WGERGKFQAF KLILNQCRSL
ARMAPNTGSF AELFATLENI VYWWSTDGWH YVSTKRERRE QERAFSRARA RRARNGAGET
ARESDRSDSV DDDEFFEIPV RSRWAFRLER LRDAAHADVL AWNMQFGKLP SAPF