Gene OSTLU_18031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18031 
Symbol 
ID5005345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp179720 
End bp180730 
Gene Length1011 bp 
Protein Length336 aa 
Translation table 
GC content74% 
IMG OID640420766 
ProductZIP family transporter: zinc ion 
Protein accessionXP_001421227 
Protein GI145353880 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0428] Predicted divalent heavy-metal cations transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.265806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0301114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGGC GCGCGGCGAC GGCGGTGACT CTCGCGGTGA CGCTCGCGTC GCTGGTCCGC 
GCGCGCGCGC CGCGCACGGC GCGCCGCGGC GGCGACGAGG GCGACGCGCG GGCGGCGGGG
GCGTGGACGG TCGGGGACGC CGCGCGCGCG ACGCTGGCGT CCGCGTGCGT GTCGCTCGCG
TCCCTGGTCG GGTGCGCGCT GATGGCGATC GGCGCGCGCG CGGACGCGCT CGCGTGGCTG
AGCGACGCGG CGATCGGGGC GATGCTCGGG GACGCGCTCG GGTGTCAACT GCCGTCGGCG
CTCGAGGCGG CGACGCGAGC GCGAGGACGC GACGGCGCGG GCGTCGCGGC GTGCGCGACG
ACGTGCGGCG TGCTGGCGTT TCATCAGTTG GAGGTGATCG TGCGCGCGGT GAAGGCGCGA
AACGATGGGA AAGTGGGGAC GACGCGGCGG CGTCGAACGC CGAGCGAAAG CCGAAGCCGA
AGCCGAAGCC GAGGCGCGAG TGGTCGAGCG CGCGAGCGAC GCGCGGCGGC GCGAGAGATC
GCGGCGAGCG GATGGCTCAA TCTGTTCGCC GATGCCGCGC ACAACTTCAC CGACGGCGTC
GTGATCGCGA TCGCGTTCGC CCGGCGCGGC GCGACGCGCG GCTACGCCGC GGCGTGGACG
ACGCTCGCGC ACGAGCTTCC GCAAGAGCTC GGCGACTACG GCATCTTACG ACGCTCGGGA
TTCACCGACG TCGAGGCGTT ATGGTTCAAC TTTCTCTCCG CCCTCGTCGC CGTCGGCGCG
ACCGCGCTCA CGTTCCTCGT CCTGGCCGCG CTCGACGCCG CGAGCGCCTC CGCGTCGTCC
TTCGCCCGAC GTCTCGCCCT CGACGTTCCC TACCTCGTCG AGGCCTTCTG CGCCGGCGGG
TTTCTCACCG TCGCCTTCAC CGCCCTTCGC GAGGACGATT CGGGATCCGC GTTCGCGCGC
GTTCGCGTGT TCGTCGCCGC CGTCCTCGTC GCGCGTCGCG GCGCCCACTG A
 
Protein sequence
MSRRAATAVT LAVTLASLVR ARAPRTARRG GDEGDARAAG AWTVGDAARA TLASACVSLA 
SLVGCALMAI GARADALAWL SDAAIGAMLG DALGCQLPSA LEAATRARGR DGAGVAACAT
TCGVLAFHQL EVIVRAVKAR NDGKVGTTRR RRTPSESRSR SRSRGASGRA RERRAAAREI
AASGWLNLFA DAAHNFTDGV VIAIAFARRG ATRGYAAAWT TLAHELPQEL GDYGILRRSG
FTDVEALWFN FLSALVAVGA TALTFLVLAA LDAASASASS FARRLALDVP YLVEAFCAGG
FLTVAFTALR EDDSGSAFAR VRVFVAAVLV ARRGAH