Gene OSTLU_40166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40166 
Symbol 
ID5000047 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp529963 
End bp531216 
Gene Length1254 bp 
Protein Length401 aa 
Translation table 
GC content52% 
IMG OID640415468 
Productpredicted protein 
Protein accessionXP_001415524 
Protein GI145340837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.37612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTCG TGTGCGCCGT CATCCTTCGC CTCGTCCTCA TCGCTTGGAG CGCGTATCAA 
GACGCAAACT TTGACGTCAA GTATACCGAC ATTGACTACT TCGTCTACAC CGACGCCGCG
CGTCATGTCG TCCGCGGCGG ATCGCCGTAT GAACGAGCAA CGTATCGATA TCCACCGCTA
TTGGCCGTCT TACTCGCGCC GAACGTGTTG GTGCACGAAA TGTGGGGGAA AGTGTTCTTC
AGCACGTTGG ACATCGCGGT GGGCGGTTTG ATTTTGAAAA TCGGTCGGCG ACGCGGTATG
AACGCGCGAG AGCTCAAATA TGCTTTGTGG TGTTGGTTAT TTAATCCGTT CACGTGCGCG
ATAAGCACGA GGGGAAGCTG CGAGGCATTG ACGGGAGTGT TGATGCTGTT GACGGTTGAG
GCTCTCACCG CGGGCGCAAC GACAAGGGCC GCAATCGCGT ACGGATTCGT CGTTCACATG
AGGCTGTATC CAATCATACA CGCATTGATG TTCGTTGCGT TTCTTAATAA GGATTACATG
GGCAATCGCG CTTTGTTCGG TAAGCGAGGA TCCAAAGCGC TTTCGTGGGT GACGGTAGAA
AACGTCAAGT TTGCCGTGGT TTCGTCGGCG ACATTTTTCG CGCTAACGGC TGGTTCGTAC
GCCGTGTATG GCATGGATTA CATCGATGAG GCAATTCTGT ATCACGCGCA AAGAAAAGAC
CATCGTCACA ACTTCTCACC GGCGTTTTAC GGGATATATC TGAGCATTCA TCCGACGACG
GACGCTCCAG ACTTGAACGG TTCAGCAATC GTTCAAACCG CCGATCGGTT GGCTATGAGT
CCGTTGCCCA TGCTTACAGT CGTTCTATCA CTTGGGTTTG CGTTTGCTAG CGACATGCCT
TTCGCACTTT TTGTGCAGAC ACTCGCGTTT GTAGCTTTCA ACAAGGTGTG CACGGCGCAA
TACTTTGTTT GGTGGTTCAT GCTCTTGCCA CTCGTTTTAC CATCGCTGAT GCGAAGTGCG
AATCGAAAAC GTGTGGTGTT CGCCACGCTG ATTTGGCTCA TCGCCCAGTT ACACTGGCTG
GCTTGGGCGT ACGCCCTCGA ATTCAAAGGG GCGCAAGTAT TTGAGAGCGT GTGGTTGGCG
TCCATCGCGT TCTTCGGCGC AAACATTTGG CTCTTGTTGA ACATCATCGC AGCGTATGCG
CACGCACCGA TATTTTCCAG AGGTCGCTTG CAGAAGTTTT CAAAAGTAGA ATAG
 
Protein sequence
MAFVCAVILR LVLIAWSAYQ DANFDVKYTD IDYFVYTDAA RHVVRGGSPY ERATYRYPPL 
LAVLLAPNVL VHEMWGKVFF STLDIAVGGL ILKIGRRRGM NARELKYALW CWLFNPFTCA
ISTRGSCEAL TGVAAIAYGF VVHMRLYPII HALMFVAFLN KDYMGNRALF GKRGSKALSW
VTVENVKFAV VSSATFFALT AGSYAVYGMD YIDEAILYHA QRKDHRHNFS PAFYGIYLSI
HPTTDAPDLN GSAIVQTADR LAMSPLPMLT VVLSLGFAFA SDMPFALFVQ TLAFVAFNKV
CTAQYFVWWF MLLPLVLPSL MRSANRKRVV FATLIWLIAQ LHWLAWAYAL EFKGAQVFES
VWLASIAFFG ANIWLLLNII AAYAHAPIFS RGRLQKFSKV E