Gene OSTLU_41064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41064 
Symbol 
ID5002513 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp683838 
End bp685029 
Gene Length1192 bp 
Protein Length332 aa 
Translation table 
GC content61% 
IMG OID640417934 
Productpredicted protein 
Protein accessionXP_001418547 
Protein GI145348207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.854437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGC CGGCGCCGAG CGACGGCGCG ACCGCCGTCG CGCCGCGGGC GGCGCCCGCG 
AGCGCGAGCG AGGTCGTCGT GCACCCGCTC GTGCTGCTCA GCGTCGTCGA TCACTTTCGA
CGATGCGACG AGGTGCGACG ACGACGACGA CGCGCGACGA CGACGACGAC GACGACGACG
ACGACGCGGG ACGCGGGATC GCGCGCGCGC GACGCGGGAC GACGCGGGAC GACGCGGGAC
GCGCGAGACG CGCGCGATCG CGGATCGCTT CCTTCGCGCG AGTTTTGACT GACTCCGACG
GTTCGACGCG CAGGGCGACG AGGAGAACAA GCGCGTCGTC GGCGTGCTGC TCGGGGAACA
GCGCAAGGGA CGGTTGGACG TCACGAGCTC GTTCGCGGTG CCGTTCGAGG AGGACGACGG
GGATAACGGG ATTTGGTTTC TCGATCACAG TTACTTGGAA AACATGTATC GAATGTCGAA
GAAGATTAGC GCGAAGGAGA AGATTGTGGG GTGGTACAGC ACCGGACCGA AACTGCGGGA
GAGTGATATC GACATACACG AGTTGTTTTA CGCGTACACG CCCGAGCCGG TGCTCGTGAT
CGTGGACGTG CGGGCGGAGA ACGCGAACAT TCCGACGAGC GCGTTCGCGG CGCAAATCGA
AGTCAAGGAG GATGGAACGG AAAAGCAACA GAAGACGTTC GTGCACGTGC CGAACTCGAT
TGAGGCGTTC GAGGCGGAAG AGATCGGGGT CGAGCACTTG CTGCGCGATG TGAAGGATAA
CACGGTGTCG ACGCTGAGCA CCAAGGTGAG CGAAAAAGTG CAGTCTTTGC GCGGTTTGAA
GGCGCGATTA GAAGAAATCA AGAGTTACAT GGATAAGGTT GTCGACGGCT CGTTGCCGAT
GAATCACGAG ATCATGGGTC ATCTGCAAGA CGCGTTTAAC CTGTTGCCGA ACCTGAACTT
GGAGGATTAC GTCAAGGGAT TCAACGTCTC CACGAACGAC GCCATGCTCG TCGTGTACCT
CAGCTCGTTG ATTCGTTCAG TCATCGCTCT GCACGACTTG ATCAACAACA AGGCGACGAA
CAAGGAACGC GAGCGCGCCC TGGATGCCCC GGGAGCGAGT GACGCGGAGA AGGATACGGA
CAAGGAGAAC GAAAAACCGA AGGATTCGGG AAAGGCCGAC GCCGCAAAGT GA
 
Protein sequence
MSTPAPSDGA TAVAPRAAPA SASEVVVHPL VLLSVVDHFR RCDENKRVVG VLLGEQRKGR 
LDVTSSFAVP FEEDDGDNGI WFLDHSYLEN MYRMSKKISA KEKIVGWYST GPKLRESDID
IHELFYAYTP EPVLVIVDVR AENANIPTSA FAAQIEVKED GTEKQQKTFV HVPNSIEAFE
AEEIGVEHLL RDVKDNTVST LSTKVSEKVQ SLRGLKARLE EIKSYMDKVV DGSLPMNHEI
MGHLQDAFNL LPNLNLEDYV KGFNVSTNDA MLVVYLSSLI RSVIALHDLI NNKATNKERE
RALDAPGASD AEKDTDKENE KPKDSGKADA AK