Gene OSTLU_42930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42930 
Symbol 
ID5005491 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp167080 
End bp169209 
Gene Length2130 bp 
Protein Length661 aa 
Translation table 
GC content60% 
IMG OID640420912 
Productpredicted protein 
Protein accessionXP_001421224 
Protein GI145353874 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.816964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.204876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGG CGGCGCTGGT GAAGCGACAG CGGCGACCGG TGACGTATTC GCTTCCGCAG 
CCCGAGTACG CGAACAAGCA CGTCGGTGGG GTGAACGCGG TGGACTTGGG AGGAGAGGGA
TCCGATAGCA TATTCACCGG TGGGCGCGAT GGGACGGTTC GGATGTGGGA TCTGTCGGCG
GGGATGCCAG CGTGCGCGCG GCGGTTTGAA GGTCACGGCG GGTGGGTGAA CGACGTCGCG
AGAGTGTCGA GCACACATTT GGCGAGCGCT TCTAGCGATC ACACCGTGCG GCTGTGGGAT
ATAAGTGAAG GATCGATGTC GTCGTCTTGC GCGGTGGCTT TGCAGGGACA CACAGATTAC
GTCATGGCGT TGGCGTGCGC GTCGGAACGC TTGGAAGGAA AGTTTGCTTC GGGCGGGCTC
AATCGCGAGA TTTTCTTGTG GGACATCGAG CGATGTCTCG CGATCAATGC CGCGCCGTCG
TATCTAAGCG TGAAGAATGA TGGATGCGTC ACCGCGCTGG GGGGCTCGAA GGAGTCCATC
TACGCTTTGG GTATGGATGG GACTGGAAAC TTGCTCGTCT CCGGCGGTAC CGAGCTCGCG
CTTCGCGTGT GGGACACGCG GAGCGCTCAA AAGGAAGGCA AACTCAAGGG ACACACAGAC
AATGTTCGAG CAATCGTCGT CGACACGGAT GGTAAAAAGT GCGTCACAGC GTCGAGCGAT
AGAACGATTC GTGTTTGGGA CATTGGCGAG CAGCGATGCG TGCAGACGTT TGCGGGCATG
CACACCGGGT CAATTTGGGC GCTCGCGTGC AACCGCGATT TCACTCGTGT ATACTCGGGA
GGTATCGATG CCCGGATATG CGTCACGTCG TTGCGTGATA GGAAAAGTAG TTTAGTCGCG
AACGAATCGG CGGCTATTTT AAAGCTGCGC CTAGACGAGT CAACGCGCGC GAGCGGTTTC
GCCGCGCACT CGTCGGACGG CGACTTGTGG ACCGCCACTG CGTCGCGGTC GATCAAGCGC
TGGCCGACGA TCATCCCGGG CGAAGATGGC TCATCAGCGA ACGTAGAAGA CTCTCATGGA
ACGCCGACGA GCACCTCGTT CTCGCGTCGC GCCGCTGGAT CGAGTCTTTT GCGGCAGCCG
GGAACGTGGT TCGACGTCGG TTCGCCCGGG GCATTTTCGT TCGGCGGCAC GACGCCGCGA
AAGTTCGATA ACTGGGGCGA TCCGCACGAT CGTGAGCCCC AGGACTTAGT GCCGTCGATG
GAGATTATCG GTGTGAGCCC GATCGTTCGT CACAGCGTGC TGAAGAACAA GATGGAGGTG
TTGACGCAAG ACTCCATGGG TGCCATCGCG CTGTGGGACG TGTCGCGATC GACCCCAATA
CGCACGTTCG ACGGCGTCAC CGATGGATCA GACTTCGAAA TTTTACTCGA AAATGAAAGT
TTGAATCCAG ATGTCGTCGT TCCATCGTGG TTTACCTGCA ACACTCGTAG TGGTTCGCTC
GCCATCACGC TCTCGCCAAG TTCGGCGTTC AACGCCGAGG CGTACGCGAG CGATCTGGGT
ATCGCTGACG CCGCGCCGGA CGAGCGAAGA AATTTAGGTG TCGAAATCAT TCGACTATTG
ATGGATGAGT GGGTAGAAAA ATTTACGTCG TCGAAACCTC CGCGATCAAC GAAACGGGCG
TTCGCCGAGC CGCTCCCGAG CGAGTGTGCG GTTTCGTTCG CGCACCCCAT CGACGACAGC
GGTCGCGTGT GCGTCAAGAT TCGCGAAGAC TTCTCGGGCA CACCCACCGA GCGTGAGTTG
CTTCCCAAAT GGTTCGCCGA TCACGCCCTG AATCAAGCGC CCGAGCCGGA ATCTCCAAAG
ATCAGTTTCC GCCTCGCCCC TCGCGCAGGC TCGGGTCTCG CTGAAATTTC CCCCGCGAGC
GTCTCGGCGC CCAAGATTCT CGGCGCTCAG AAGATTCGCG AATACATCGC ACAGAAGTTA
GACGAAACAG ATGGCGGAAC GAGCACCGAT GCCAGCGCGC TCGCGCTCTG GTGCGCCGGC
CAACCCGTGC CGTCGTCGGC CACGCTCGCC GCCGCGCTCG CGCGCGTCTG GAAAAAATCG
CCCCCGATCG AGCTCGAGTA CTCCAACTAG
 
Protein sequence
MNTAALVKRQ RRPVTYSLPQ PEYANKHVGG VNAVDLGGEG SDSIFTGGRD GTVRMWDLSA 
GMPACARRFE GHGGWVNDVA RVSSTHLASA SSDHTVRLWD ISEGSMSSSC AVALQGHTDY
VMALACASER LEGKFASGGL NREIFLWDIE RCLAINAAPS YLSVKNDGCV TALGGSKESI
YALGMDGTGN LLVSGGTELA LRVWDTRSAQ KEGKLKGHTD NVRAIVVDTD GKKCVTASSD
RTIRVWDIGE QRCVQTFAGM HTGSIWALAC NRDFTRVYSG GIDARICVTS LRDRKSSLVA
NESAAILKLR LDESTRASGF AAHSSDGDLW TATASRSIKR WPTIIPGEDG SSANVEDSHG
TPTSTSFSLP SMEIIGVSPI VRHSVLKNKM EVLTQDSMGA IALWDVSRST PIRTFDGVTD
GSDFEILLEN ESLNPDVVVP SWFTCNTRSG SLAITLSPSS AFNAEAYASD LGIADAAPDE
RRNLGVEIIR LLMDEWVEKF TSSKPPRSTK RAFAEPLPSE CAVSFAHPID DSGRVCVKIR
EDFSGTPTER ELLPKWFADH ALNQAPEPES PKISFRLAPR AGSGLAEISP ASVSAPKILG
AQKIREYIAQ KLDETDGGTS TDASALALWC AGQPVPSSAT LAAALARVWK KSPPIELEYS
N