Gene OSTLU_33487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33487 
Symbol 
ID5003667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp296266 
End bp297450 
Gene Length1185 bp 
Protein Length394 aa 
Translation table 
GC content65% 
IMG OID640419088 
Productpredicted protein 
Protein accessionXP_001419770 
Protein GI145350768 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.265227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0616217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGC GCGCGAACGT CGACGACGGC GCGCGCGAGG CGAACGCGCT GCGAACGTTC 
CTGCGCGACG CCAAGGAGGC GTTCGCGCGC GATCCGGGGG CGTGCGACGT GAGCGTGGGG
AACGAGGCGT GCGATTTGGA CTCGGTGGCG TGCGCGGTGG CGACGGCGCG AGCGGCGAGC
GCGAAGCGCG GACGCGACGA TGGCGAGCGC GAAACGCGCG CGGTGCCGAT CGTGTCGTGC
GCGAGGGAAG AACTGAAATT ACGACCCGAC GTGGTCTTGG CGCTGGCGAA CGCGGGGGTG
AAGTTGGGCG ATTTGACGTG CGCGGAGGAC GTCGCGGCGG CGGCGACGAA GGCGACGCCG
CGAAGCGTGA CGTTGGTGGA TCATAACGCG CTGAGCGCGC GGTTGTTTCC GGACGCGTGG
CAAGCGCGCG TGGTTCGGGT GATTGATCAT CACGAGGATT CGGGGATGTA CGCGGAACGG
GCGGATAGGG TCATCGAGTT GATCGGATCG TGCTCGAGTT TGGTGTACAG GGACGTCGTG
GCGAAAGCCG CGGACGAGGG CGTCGCGCGA GACGTCGCGC GTTTGCTTCT GGGAGCGATC
GTGTTGGACA CGAGAATGCT GGACGCGACG ACGACGCGGG CGGCACCCGT GGACTTTGCC
GCTGCGGAAT CGCTGCGAGA TATTTTGGGA TGGGACGAGG ACGCGACGCG AGCGGAGTAC
GAATCGCTCT CTCGCGCGCG TCACGATCAG AGCTCGTTTT CGTGCGCGCA ACTCTTGGCG
AAAGATTACA AGCAGTGGAC GATGGGGTCG CTCGAGGTCG GCATCGCGTC GTTCGGCGTG
CGGTTTCAGG ATTTGCTGGC GCGACAGGAC GCTTCATCCG TCAACGATGA AATCGTCGCC
TTCGTCGACG CGCGGCGCAT CGACGTGTTA TTTATGATGT CCTCGTTCGA AGACGCCGAC
GCCGACGGCG CGTTCGCGCG TCAGATCGAT GTCACGAAAT CGAGCGCGTG CTCGATCGAG
CTCGAAGCCG TCATGCGCGA CTTGGGCGAG CGAACGCCGC TCGCGCCGCT GCGTCTTCCC
GAAAACGACT TCGGCGTGTT CAAATCCGCG CGCGCGCAGC TCGACGTCAA GGCGAGTCGG
AAGAAAGTCC AACCGATTCT CCTCGAGATT TTAGCGAGAT TTTAG
 
Protein sequence
MATRANVDDG AREANALRTF LRDAKEAFAR DPGACDVSVG NEACDLDSVA CAVATARAAS 
AKRGRDDGER ETRAVPIVSC AREELKLRPD VVLALANAGV KLGDLTCAED VAAAATKATP
RSVTLVDHNA LSARLFPDAW QARVVRVIDH HEDSGMYAER ADRVIELIGS CSSLVYRDVV
AKAADEGVAR DVARLLLGAI VLDTRMLDAT TTRAAPVDFA AAESLRDILG WDEDATRAEY
ESLSRARHDQ SSFSCAQLLA KDYKQWTMGS LEVGIASFGV RFQDLLARQD ASSVNDEIVA
FVDARRIDVL FMMSSFEDAD ADGAFARQID VTKSSACSIE LEAVMRDLGE RTPLAPLRLP
ENDFGVFKSA RAQLDVKASR KKVQPILLEI LARF