Gene OSTLU_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2171 
Symbol 
ID5002273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp479282 
End bp480685 
Gene Length1404 bp 
Protein Length468 aa 
Translation table 
GC content63% 
IMG OID640417694 
Productpredicted protein 
Protein accessionXP_001418496 
Protein GI145348104 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes
[COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase 
TIGRFAM ID[TIGR01496] dihydropteroate synthase
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.344916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.14171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCGC TCGGATCGAA TCAAGGCGAT CGCGTCGGGT TGTTTCGCGA CGCGTTCGCT 
AAACTCAGGC GTGACCTCGG TTTCGAGCTT CACGCGCATT CCTCGCTCTA CGAGACCGCT
CCGGCGTACG TCGAGGACCA GGGGAAGTTT TTGAACGCCG CGTGCGTCGG ATCTTTTCCC
GACGACGTCG CGCGAGATCC GCTGGCGCTG CTGGATGGGC TGAAAGCCAT CGAGGCGGCG
CTGGGGCGAG ACTTCGGGAC GCGGCGGTAC GGACCGAGGC CGATGGATTT GGACGTCATA
TTCCACGGGC AAGGCTCGCA TTCGTGCGAC AGATTGACGG TTCCGCACGC GCGCTACGCC
GAGCGGCCGT TCGTGCTGGC GCCGCTCGCA GATTTGACGG GCGCAGCGAC GGCGGCGACG
ACGAGCGACG CGACGACGGA AGGGCTGCTC GAGGCTCGGA GGATTTGGGA CGGCACCGAT
GGAGAGGTTA CGGCGATGGA AAGTGGTGAT ATAGCGCGCG TGATCCCGAT GAGAGACAGA
TTGTGGAGCT GGGGTCGAGA GACGATGGTG ATGGGTATTT TGAACGTGAC ACCGGATTCG
TTTAGCGACG GCGGCGCGTA CGACGGCGGC GTGGACGTGG CTGTGCGACA CGCCAGGGAA
ATGGTCGCCG CGGGGGCGAC GATAATAGAC GTTGGTGGGC AGTCGACGCG ACCAGGGGCG
ACGAGGGTGA GTGGAGAAGT AGAGAGTTCG CGGGTGATCC CCGTCATACG CGCGCTCGCT
CAAGCGTTTA GCGAAAGAGA AGACGTTTAC ATCTCTGTAG ATACGTTTTA TGGCGCCGTC
GCGAGCGCGG CTGCGGATGC TGGGGCTGAC ATCATCAACG ATGTCAGCGG CGGAGCGTGG
GACCCCGCGA TGCTACCGAC GGTGGCGCGT TTGGAGAAGC CTCTGCCGTA CGTCGTCATG
CATGTTCGAG GCGATCCGAA CAGTATGCAG AGCGCGAAGA ACACGACGTA CGATGGGCAC
ATTTGTGACG AGGTTGGTGA TGGTCTCTTA GCGACCGCAC GTCGATGTGT GGAGTACGGT
ATAGAGCCAT GGCGTCTGTG GATTGATCCG GGCATCGGTT TCGCGAAGAC GGGTCGAGCC
AACATCGAGC TGTTGCGAGA TTTGCCACGC GTCCGAAGCC GCTTAGCCCC CTTAGGCGGA
GCGCTCATGA ACGCCCCGAT GCTCGTGGGT GCGTCTCGCA AACGTTTTCT CGGTGAGATA
TCGGGAAGGT CCGAAGCGAG CGAGCGAGAC GCCGCGTCCG TGGCAGCGCT CGTCGCCGCC
GTTAGAGGTG GTGCGGACGT CGTCCGAGTT CATAACGTCG CGCTGTCCGC GGACGCCGCG
CGAGTAGCCG ACGCGCTGTG GCGA
 
Protein sequence
VLALGSNQGD RVGLFRDAFA KLRRDLGFEL HAHSSLYETA PAYVEDQGKF LNAACVGSFP 
DDVARDPLAL LDGLKAIEAA LGRDFGTRRY GPRPMDLDVI FHGQGSHSCD RLTVPHARYA
ERPFVLAPLA DLTGAATAAT TSDATTEGLL EARRIWDGTD GEVTAMESGD IARVIPMRDR
LWSWGRETMV MGILNVTPDS FSDGGAYDGG VDVAVRHARE MVAAGATIID VGGQSTRPGA
TRVSGEVESS RVIPVIRALA QAFSEREDVY ISVDTFYGAV ASAAADAGAD IINDVSGGAW
DPAMLPTVAR LEKPLPYVVM HVRGDPNSMQ SAKNTTYDGH ICDEVGDGLL ATARRCVEYG
IEPWRLWIDP GIGFAKTGRA NIELLRDLPR VRSRLAPLGG ALMNAPMLVG ASRKRFLGEI
SGRSEASERD AASVAALVAA VRGGADVVRV HNVALSADAA RVADALWR