Gene OSTLU_24121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24121 
Symbol 
ID5000520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp700227 
End bp701597 
Gene Length1371 bp 
Protein Length417 aa 
Translation table 
GC content46% 
IMG OID640415941 
Productpredicted protein 
Protein accessionXP_001416220 
Protein GI145342501 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTGCGTCGA TGTCTTCGCA TAGAACCTCC CGTCGTCTCG CAGATTGTGT GACGTAAGCT 
GATATCCTGT ACACATCTTG TTCAGTCATA TGTGTCCAGC TGTGTTGTGA AAGCGCCATG
TCGGTTTGCG GAGAAAGGGA GCAACTTGGC TCCAACGCTG TAGAAAATGG AACAATCTAT
AGACATCCAT ATTGCGCACG CTTTATCTCT TCCAAGCAAA GGTTTTGTGG ATTGATGGCA
AAAGCTGGGC ATTCGTTCTG CGTTCACCAC CTACCCACAG AACAGTCGGA GTATATTTTT
GCTTTCTGTC CAAAGTGCAA AACAAAAATT GCTAGTGATA GGGTAGACGC TCATTTAAGA
AAGTGCCCCG TTCGCTTGAA AGAACAGCAG ATTACAAATG AGATTTACTT TTGGAAAGAT
ATCAACAGTG CCAAAGAGAC GGTGGTAGAC AAAACAGTAA GCTTGCGCGA ACTCTCAGGC
ACGCAACTCG CTCTTCTGCT CGATAAAATT AGTCAAGCCG AACCAAAATG CTCGTCTTAC
TTGAGTAGAC GTTCGAGCAG CTCACACGAT GACGGCAAGA CCATGCATAG TGTACACGTC
CCACAGCGGA CAAGTGAATC GGAAGCGCTA CAGCGTGTCG CCATAGCAGA GGCCGTGCAA
AAAGTTCGAG ACGTCTTTGA TCTTAGTACT AATAGTGCAC TTGTTGAGCT TTGCTCCGGA
CGTGCGTATC TTTCAGCAGA GATCATGAAA ACATGGCCGT TCTCAAAGCT ATTACTTGTC
GACCGTCGTT CACATCGCTT CAAGGCGGAT AGATTCCTTC GTCATCAAAG TGAGCTGAGG
CGTTTGCTCA TTGACATCAA GGATCTGGAT TTGTATAAAG TCTCTCTATT GAATAAATCG
GAAGTGACCA TCGTGGGTAA ACACTTGTGC GGTGAAGCTA CTGATTTCGC TTTAAAGTGT
GCATTATCTT ATCTGAAAGA TAGTACAGTC ACGTTCAGAG GATTGGCAAT TGCTCCATGT
TGTCACCACG CTTGTAAATG GTCTTCTTTC GTCAATACAC CGTTTCTTGA AACACTCGGC
TTCAGCGAGA CGGATTTTGG CTATCTCATA CGTATGACGA GCTGGGCAAC CACACTGTCT
ACTGGTTGCA CGTCCCACGG ATTCAGTTCT GATTTTGCGA TTCACACTGA GGATATGGAT
GCGTGCCGAA CCCTTAGTAG CAATGATAAA CTACGCGTAG GGCGATACGT AAAGCTGATT
CTCAACACAG CTCGTGTTCT ATGGTGCAAA GAGCGCGGTT TCGACGTAAG CTTGGAGGAG
TATACTAATG TTTGGGTAAC GCCTGAGAAT CAGCTAATAC TTGTAAGATA G
 
Protein sequence
MSVCGEREQL GSNAVENGTI YRHPYCARFI SSKQRFCGLM AKAGHSFCVH HLPTEQSEYI 
FAFCPKCKTK IASDRVDAHL RKCPVRLKEQ QITNEIYFWK DINSAKETVV DKTVSLRELS
GTQLALLLDK ISQAEPKCSS YLSRRSSSSH DDGKTMHSVH VPQRTSESEA LQRVAIAEAV
QKVRDVFDLS TNSALVELCS GRAYLSAEIM KTWPFSKLLL VDRRSHRFKA DRFLRHQSEL
RRLLIDIKDL DLYKVSLLNK SEVTIVGKHL CGEATDFALK CALSYLKDST VTFRGLAIAP
CCHHACKWSS FVNTPFLETL GFSETDFGYL IRMTSWATTL STGCTSHGFS SDFAIHTEDM
DACRTLSSND KLRVGRYVKL ILNTARVLWC KERGFDVSLE EYTNVWVTPE NQLILVR