Gene OSTLU_30642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30642 
Symbol 
ID5000783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp513738 
End bp515128 
Gene Length1391 bp 
Protein Length368 aa 
Translation table 
GC content64% 
IMG OID640416204 
Productpredicted protein 
Protein accessionXP_001416682 
Protein GI145344318 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00137682 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0422052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGACACGC GGATCGGATC GCGCGCGCGC CCGACGGTCG TCCGACGACG CGCGAACGAG 
GACGACGCGC GACGCGCGAG ACCGCGATCG AACGAGGACG CGCGAAGGAT GCGCGCGGCG
ACGACGACGA CGCCGATGTC GCGGGCGATG CGGGCAACCG GTGTGGATAA ACCTCGCGCG
GTGCGGGCGT CCGCGGTGCG GGCGCGAGCG ACGCGGGGAG GCGCGACGCG CGCGCGGGCG
GCGGCGCAAG GGATCGAACA GGAGGTGAAG ATCGACGCGG CGCCGACGAC GCTGCTGAGA
GAGGGATCGG GCGAGGAGGG GGACGCGACG CAGATGCGGG CGAGGTTTGA GAAGATGATT
CGCGCGGCGC AAAATGAGAT TTGCGACGCG ATCACGGCGT TGGATGGGAA GCCGTTTCAC
GAGGACGCGT GGACGCGACC GGGTGGTGGT GGTGGGATCT CGCGCGTGTT GCAAGATGGG
AACGTGTTCG AAAAGGCTGG CGTGAACGTG TCGGTGGTGT ACGGACAGAT GCCGCCCGAG
GCGTATCGGG CGGCGACGGG GGAGGAAGGC GCGTCTAAGG AGATGATTCC GTTCTTCGCG
GCGGGTATTT CGAGTGTTAT GCACCCGCAT AATCCGATGG CGCCGACCGT TCACTTCAAC
TACCGTTATT TCGAGACGGA TGCCCCCAAG GGCTCCGCGG GCGCGCCGCG CGCGTGGTGG
TTTGGCGGCG GCACGGATTT GACGCCGTCG TACATTTTCG ACGAAGACGT CACGCACTTC
CACCAAACTT TGAAGGATAT CTGCGATAAG CACGATGGCG AGTTTTACCC GAAGTTCAAG
CAATGGGCGG ATGATTATTT CATGATCAAG CACCGCGGCG AACGTCGCGG CGTCGGCGGC
GTCTTCTTTG ACGACATGAA CGACCGCAGC AAGGATGAAC TCCTCGCGTT CGCGACGGAC
ATGGCGGGCG GTGTCGTCCC GGCGTACGTC CCGCTCGTCG CCAAGCACAA GGACGATGAG
TTCACGCCCG AACAACGCGC CTGGCAACAA ATGCGCCGCG GTCGCTACGT CGAGTTCAAC
CTCGTGTACG ACCGCGGGAC GACGTTCGGT TTGAAAACCG GCGGTCGCAT CGAATCCATC
CTCATGTCTC TCCCGCGCTA CTGCGAGTGG CAATACGACC ACGCGCCCGA AGCCGGTTCT
CGCGAGGCCG ACGCGCTCGA CGCTTTCAAG AACCCGAGAA CGTGGTGCGC GTAAGCGCGC
CTCGTCCCTC GAGCGCGTGC TCTCTTAGTT TCGCCTCGTT CCGTCGTCGC GTCGACGCGC
GTCGACGCGC CGCCTGCCGA GGCCCCTTTA CGCACCATTC AATCATTCAA ACGACGAAGA
TTCGAGTTTT A
 
Protein sequence
MRATGVDKPR AVRASAVRAR ATRGGATRAR AAAQGIEQEV KIDAAPTTLL REGSGEEGDA 
TQMRARFEKM IRAAQNEICD AITALDGKPF HEDAWTRPGG GGGISRVLQD GNVFEKAGVN
VSVVYGQMPP EAYRAATGEE GASKEMIPFF AAGISSVMHP HNPMAPTVHF NYRYFETDAP
KGSAGAPRAW WFGGGTDLTP SYIFDEDVTH FHQTLKDICD KHDGEFYPKF KQWADDYFMI
KHRGERRGVG GVFFDDMNDR SKDELLAFAT DMAGGVVPAY VPLVAKHKDD EFTPEQRAWQ
QMRRGRYVEF NLVYDRGTTF GLKTGGRIES ILMSLPRYCE WQYDHAPEAG SREADALDAF
KNPRTWCA