Gene OSTLU_31578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31578 
Symbol 
ID5002053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp166650 
End bp168039 
Gene Length1390 bp 
Protein Length362 aa 
Translation table 
GC content63% 
IMG OID640417474 
Productpredicted protein 
Protein accessionXP_001417939 
Protein GI145346940 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.667857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0330818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCGCGGAC CCGCGCGAGG CCGACGACCC GGCGCGCGCG CGAACGCACG CGGCTCGACG 
CCCTCGACGC CCGGCGCGGG ACCGGCGCGC GCGCGACTCG CGAGTGGACG CCGAAGGACG
CGCGATGGCG ACGCGCGCGA TGCGATCGAC GCCGTCGACG CCCGCGCGGG CGGCGACGCG
ACGACGAACG CGACGCGCCG AGGCGACGCG GACGCGCGCG AACGCGAACG CGACGCCGGC
GAACGATCTG ATGATTCGCG CGGCGCGCGG AGAGCGGGTC GAACGCACGC CGGTGTGGCT
GTTCAGACAG GCGGGACGCC ACCTGCCCGA GTACAACGAG TATAAGAAAT CGACGGGGAA
AAACTTTCTC GAGCTGCTGG CGGACCCGAA AGACGTGGCG GAGGTGACGA TGCAACCGAT
TCGACGTTAT AACCTGGACG CGGCGATTTT GTTCAGCGAT ATCTTAGTCA TCGCGGAGGC
GCTGAACATC GAGGTGGAGA TGCCGGGCGG GAAGGGGATA TTGGTGCCGA ATCCGCTTCG
GGGACCGGAG GATATGCATC GAGTGCCGGA ATCGATCGAT GTGAACGAAA AGTTGGCGCA
CGTGCTGGCG AGCGTCAAGG AGATCAACGT GCAGATCGAA AAGGAAGGCT TGGGAGTGCC
GCTAATCGGG TTCAGCGCGG CGCCGTGGAC GCTCATGTAT TACATGGTCG GCGGGTCGTC
GAGAAAGAAC ACGGATGCGG GGATGCGATG GTTGAAAGAA CACCCCAAGG AAGCGCAAAA
GTTGTTGGAC ATTCTGACGG ACGTTGTGAT CGATTACCTC GACGCGCAAG TCAAGGCTGG
GGTGCACATG GTGCAAGTGT TTGAGGCGAT GTGCGAGCAC ATCACCGAGG AAAGCTTTTA
CGAGTACGCC ATGCCGTGCA TGGAGAAGAT TGCGCGCGAA TTGAAAAAGC GCCACCCGAC
CGTGCCTTTG CTGGGATTCG CGCGCGATGC CCCGTACGGT CTCGCCGCCC TTCAGCGCGC
CGGTTACGAC ATCATCACCA TCGACACCGC GATGGACCGC GTCGTCGCCC GCGACGTCTT
GAGTCACGAC GCCGAGACGC GCGGCGTCCA ACCCTCCGGC GTTCAGGGTA ACTTCGACCC
AAGCCTGTTG AACAAAGAAG GCGGCTCGTC TTTCGAAGGC ATCGAAAGAG AGGTGACGCA
AATGCTTGAG GATTTCGGTC CGCAAAAGCT CATCGCGAAC CTCGGCGCCG GTTTGGGCGG
CAAGGAAGAC GTCGAAAAAG TCGCCTACCT CGTCGACAGC ATCCACCGCG TGTCCGAGGA
CATGATCGCG GGCAAGTGAG CGCGCGCGCG CTCGCGTCCG CGTCGCGACC TCCGCGCGCT
TTAATATTTC
 
Protein sequence
MIRAARGERV ERTPVWLFRQ AGRHLPEYNE YKKSTGKNFL ELLADPKDVA EVTMQPIRRY 
NLDAAILFSD ILVIAEALNI EVEMPGGKGI LVPNPLRGPE DMHRVPESID VNEKLAHVLA
SVKEINVQIE KEGLGVPLIG FSAAPWTLMY YMVGGSSRKN TDAGMRWLKE HPKEAQKLLD
ILTDVVIDYL DAQVKAGVHM VQVFEAMCEH ITEESFYEYA MPCMEKIARE LKKRHPTVPL
LGFARDAPYG LAALQRAGYD IITIDTAMDR VVARDVLSHD AETRGVQPSG VQGNFDPSLL
NKEGGSSFEG IEREVTQMLE DFGPQKLIAN LGAGLGGKED VEKVAYLVDS IHRVSEDMIA
GK