Gene OSTLU_119569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119569 
SymbolCup201 
ID5000335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp584613 
End bp585944 
Gene Length1332 bp 
Protein Length443 aa 
Translation table 
GC content46% 
IMG OID640415756 
ProductConserved protein of unknown function 
Protein accessionXP_001416429 
Protein GI145343653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.434648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGACC AATGTACTAC AGAAAATAAG AGCGTTCAGT TAGACCAGTT GCAGCGGCGG 
TACCGTTCAC AAATACGACG GATAAAACAG TTACAAAGCA GCGGAGCGGA GAAAGTTCAG
TTGCGCAACG CAGCAACAGT TCTTCGTTCG ATTAAAGATG ATATTCGCTT AGAGCACAAA
AACAGGACGC GCCGGAAGGT CGTTGACAAC TGTGCAAATC TCTATCCACG GGCGTGCAGT
ATACCTACAG ATGACGAGGG CTTTGTAACG TCCTTTCCTG CTGGACAAGT ACTCAGTGGT
CCAGAGACGG ACAAACACGA ACTCTTGGAT TTTTTTAAAA CGTACGGGTT CGTAATCTTT
CGCGATATCA TCGGTCCTGA AGAGTGCGAA GAAACTGCAA AAGAAATTTG GGACCACTTG
GAGGCCAGGA ATCCCAGTCT TCAACGTGGC GTCCCACACA CTTATTCAGT GTTATCTTCC
AAGACCTATG GCCTTGCACC CGAACCAGCA CTATTCACAG CACAGATGAT AAGAAACAGA
TGTAACGAGT ATGTGGTCAA GGCACTGCGA CTTTTATTGG GGCACGCGGA TATATTATTG
TCACATGACC GCTGGTGCTT CTACAGACCA ACAAAAAGTA TAAGTATAAA AAATAGCCAA
CACTTCATGG ACATGCCCAC CTGGAAAACA CCGAGTAATC TTCACTTGGA TTTGAATCCT
TGGATGTATA TAAATGGGAA TGTACCTTCA CAGACGCTGG ACTACAAAAA CTTGCGCGAT
TTTAGCAAAG AGATGAACAG TGTTACACAG GTAACTGGGC CACACCTACA AGGGATTCTA
TCGATCACGG AGAACAAAAA CGAAGATGGC GGCACGGTGC TCGTTCCTGG CTTCCATAGC
GTGTTTTCAG ATTGGGTCGA ACATTTGGGG GCCATGAACA AATATACGAA CCACAACGAT
TCCAGTACAA ATAGGCTCGT GTGGCGAGGT CACGGTGCAG GGAGCTTCAA GTTTGCGGCT
GTGGACCCTA TTCACAATTT GAAACGCAGA ATTTCACTTC GGGCTGGTAG CTTTTTAGTC
TGGGATCAGC GCATTGTTCA CGGTTCCGTA CCAAATAATA GCTCCAACCC TCGAATGGCA
CAATTTATCA AGGCCTTTAA AAGTCACGGG ATATCCAAAC AGCAGTTCTA CGCGAGATCT
AAAGCTATCC ACAAGCACAT GAAAGTAGCA AGAACGCTGA AATTAGATAC GCTGACGAGC
GACTCACGTC GAGTGTTAGG TCTCGACCCT CATCTTAACA AACTAAACGG TACCAGCGGA
GTCAGTATGT AA
 
Protein sequence
MVDQCTTENK SVQLDQLQRR YRSQIRRIKQ LQSSGAEKVQ LRNAATVLRS IKDDIRLEHK 
NRTRRKVVDN CANLYPRACS IPTDDEGFVT SFPAGQVLSG PETDKHELLD FFKTYGFVIF
RDIIGPEECE ETAKEIWDHL EARNPSLQRG VPHTYSVLSS KTYGLAPEPA LFTAQMIRNR
CNEYVVKALR LLLGHADILL SHDRWCFYRP TKSISIKNSQ HFMDMPTWKT PSNLHLDLNP
WMYINGNVPS QTLDYKNLRD FSKEMNSVTQ VTGPHLQGIL SITENKNEDG GTVLVPGFHS
VFSDWVEHLG AMNKYTNHND SSTNRLVWRG HGAGSFKFAA VDPIHNLKRR ISLRAGSFLV
WDQRIVHGSV PNNSSNPRMA QFIKAFKSHG ISKQQFYARS KAIHKHMKVA RTLKLDTLTS
DSRRVLGLDP HLNKLNGTSG VSM