Gene OSTLU_31074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31074 
Symbol 
ID5001452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp258059 
End bp259825 
Gene Length1767 bp 
Protein Length557 aa 
Translation table 
GC content56% 
IMG OID640416873 
Productpredicted protein 
Protein accessionXP_001417194 
Protein GI145345386 
COG category[S] Function unknown 
COG ID[COG3389] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCGTCGC GTCGGCGCGC GAAGTCGGCG CGAAGTCAGC GCGCGTCTCG CGACGAAACC 
GATGGCGAGG GAAAGGACGC CGACGCGACG GACGCGACGG ACGCGACGGA CGACGACGAC
GACGACGACG ACGACGACGA CGACGACGCG AAGGCGGACG CGATGGGCGG CGTGCGGGGC
GATCGGGATT TTGACGTTTT TGACGACGCT CTTGGGGGCG AACGGAGAGC CGGAGTGCCA
AGAACGAAGC GAAACGCGCG CGGTGGACGT AGAACTCGAC ACCGGGGCGT GGCAAACGAT
CGTCACGGGA GGGCTTGCGC TGTTCGGGGG GGACGTGGAG GCGAACGCGA CGCTCGGGCC
GTGGACGATT GGGGGGGCGA GCGCGGACGC CTACGGATGC GCGATCGGTA GCGTGGGAGA
CGATTTCACG GGTAAAGATG TCTTGGTCGT GAAGCGAGGG GAGTGTGAGT TCTACGAGAA
GGCTAGGGTG GCGCAAGACG TCGGAGCAAA GGCGGTCTTT GTGGTGAGCG ATGGGGAAGA
CTTCACCGCG ATGACGTGCA ACGAGGATCA GAAACTGGAT GTCGTGACCG TTTTAGTGAC
GGGAACGACT GGACAGGCGA TCCTCGATGC CACCACGGAA GTGGGCGCGA CGATTACGAT
CGCACGCTCC GACGCACTGC CGAGACAGTT TGATTTCTTG GCCTCTGCGG CTCTCGTTGC
GTTGGCCCTC GCTACCATCG CCCTCGGTGG AAGGTGGTCG TTGAAGGACA AACGAGCCGT
CGTGAGCTCA AAACGTGATG ATGATGACAT CGACGATAGT AGTGACGGAG GAGAAGCCCA
TGAGGGCATC GAGATAAACG AGTACAGCGC GTTTTGGTTC GTCATCATGG CGTCAGCGGT
CTTACTCATC TTGTTTTATT CCATGCAACA TTGGGTATTC GTGGTGATGA GACTGGTGTT
TTCTTTCGCT TCCTTCCAAG GATTGTACGT GATATGTTTC GAGGCGTTGA TGTCGAGACG
AAAGTCGACC TCGAGGGATT CAAGAGTGCT GTTGCCCATC GTCGGCTCAG TTCACCTTTT
GGCCATTCCG GCGGCTGTAT TCGCTGGCTT AATCGTTGCC ACGTGGCTTA TATTCCGGCA
AGCCACGTGG GCTTGGATGT TGCAGGATAT CATGGGTTTG TCATTCTTGG TAAACGTGTT
GCGTTTGGTG CATCTGCCCA ACTTCAAGGT GGCCACCATA CTTTTATGTT GCGCGATGTT
GTACGACATC TTCTGGGTCT ACGTTCAGCC ACATTTGTTC GGTAAGAAGA GCGTGATGGT
CGCCGTTGCG CGCGGCGGAG ATGAAGGTGA AAGTTTACCG ATGCTATTTT TATTCCCGAG
AGCTTCAAGT CCAGGGGATT TCTCCATGCT GGGGTACGGC GACGTCATCC TTCCCGGTTT
ACTCATCGTG CACAACTTGT TGTTTGACAA CAGAAAGCGC AATTTTTCGG ATACCAGGTA
CTATTACTTC TTCTGGAGTA TGGTTGCATA CGTTGTCGGG ATGTGCTTGA CGTTCACGGC
GCTTTATTTT GAGGTTGGAG GCCAAGGTGG ACAACCAGCG TTGACGTATT TAGTTCCCAC
CGTCGTCGGG ACGACGGGAA TTTTAGCGTG GAAGCACGAC GATTTATCAG ACATGTGGTA
CGGTGTCGAC GATGATTACT CGGCATTACC ATCGGAGTCC CAATCTATAT TGTAAAAAGT
AAGATACACG TAGTTGCGTA GAAACAA
 
Protein sequence
MARERTPTRR TRRTRRTTTT TTTTTTTTTR RRTRWAACGA IGILTFLTTL LGANGEPECQ 
ERSETRAVDV ELDTGAWQTI VTGGLALFGG DVEANATLGP WTIGGASADA YGCAIGSVGD
DFTGKDVLVV KRGECEFYEK ARVAQDVGAK AVFVVSDGED FTAMTCNEDQ KLDVVTVLVT
GTTGQAILDA TTEVGATITI ARSDALPRQF DFLASAALVA LALATIALGG RWSLKDKRAV
VSSKRDDDDI DDSSDGGEAH EGIEINEYSA FWFVIMASAV LLILFYSMQH WVFVVMRLVF
SFASFQGLYV ICFEALMSRR KSTSRDSRVL LPIVGSVHLL AIPAAVFAGL IVATWLIFRQ
ATWAWMLQDI MGLSFLVNVL RLVHLPNFKV ATILLCCAML YDIFWVYVQP HLFGKKSVMV
AVARGGDEGE SLPMLFLFPR ASSPGDFSML GYGDVILPGL LIVHNLLFDN RKRNFSDTRY
YYFFWSMVAY VVGMCLTFTA LYFEVGGQGG QPALTYLVPT VVGTTGILAW KHDDLSDMWY
GVDDDYSALP SESQSIL