Gene OSTLU_40372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40372 
SymbolCHR3502 
ID4999958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp972053 
End bp974041 
Gene Length1989 bp 
Protein Length663 aa 
Translation table 
GC content56% 
IMG OID640415379 
Productpredicted protein 
Protein accessionXP_001415990 
Protein GI145341798 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCGT GGGAAGAAGG GCAGTGGGCG CGAGAATCGG CGGAGAGCAA GCGGTTGGGA 
GAAGTGGAGA GCATCGTCGG GTGGCGACGA TGCGAAAAGG AGGAGACGGA GACGTTGATA
AAGTGGAAAG GTACGTCGTA CGCGCATTGC ACGTGGGTGA AAGTCACGGC GTTGGAAAAC
GATCCGACAT GTGGTGTGCA GGGTAAGATG CGTGTGGCGA GGTATTTCGA TAAGTATCCA
AAGGAGCTCG GGCCGTGCGT GGACGTCAAA CCGGATTACT TGGTCGTCGA CCGCGTCTTT
TCGATGTTTG AAGAGGTGGA CAGAACACTC GTGTGCGTGA AGTGGTCTCG AATGAGTTAT
GACGAGACGA ATTGGGAAGA TATAACTGCC GTGCGCGAGA TGGAAGGTGG TGCGAGCGCT
TTGGAAGAGT TTGAACGCGT CCGAAGCCGC GCTTCAGCGG CGCGCGAGCG CCAAGCCATC
GTTGATGCGG AGACGGATGA GGACGTCGCC AATGCGTGGA GTTCGTACGA TGCCGACACC
GTGCGCGACT CGTACGGAGA GTCCGACGAG TTGCGATCGT ATCAGAAGGA AGGCGTGAAG
TGGATGGCGT TCAACTTCCG AGCCGGACGC GGGTGCATCT TGGCGGACGA GATGGGTCTC
GGGAAAACTG CGCAGGCGCT AGCTCTCATA CATCACTGCT TGCAAGTGCG GCCAGGTCTC
CCTGCTCTTG TCGTCGTTCC CCTTTCAACG ATTGTGAACT GGGAGCGCGA GGCGCAGCGC
TGGGTCCCGG ACGCGTACGT GGTGACGCAC GTCGGCAAGC AAGCCGGTCG CGAATTCGCG
CGAGAACACG ACTGGTATCA CCCAGTTGAC GAAACCCAGA GCATATCGCG AGCGTTTAAG
GCTAATATCG TCCTCACTAC TTATGAAACG ATTACTGCCG ATCGCCAATC TTTCGCGAAG
GCAAAATGGA GTACGATGGT CGTCGACGAA GCGCATCGCT TGAAACGAGT TGGAGGTAAG
CTTGGGAACG ATTTGAACAG CCTCGCGGTG GAGCGCATTT GCTTACTCAC GGGCACTCCG
CTTCAAAACA ACACCACCGA GCTCTGGTCG TTGCTGAACT TTGTCGATTC TAAGCACTTC
TCCAACGCGG AGGAGTTTGA AGAAGCGTTT GGAGGCATGG CAAAGGCTGC GCAAGTCGAG
CGTTTACAAA AGGTTCTTGG TCCGTACTTG CTGCGTCGAC TGAAGCGCGA CGTCGAGCAA
AAGTTACCAC CGCGAAGTGA GACACTTGTC GAGTGCGAGC TCGCGCCTTT GCAGAAAAAG
TGCTATCGTG CATTATTTGA GCGTAACTTT TCCTTTCTTC GGCAAGGTTG CGACTCGAGA
GAGAGTTTTG CAAACTTTGC GAACATCATG ATGGAAGTCC GTAAGTGTTG CCAGCACCCG
TTTTTGCTCG ACGGCGTCGA AGCTGCCATC GCGCCGGAAG GCGCGAGCAC CACTGCCTTG
GTATCGAGCG CGGGAAAGTT GCAGCTCTTG GACAAGCTCC TTCCGCATCT TCGCGAAGGT
GGGCATCGAG CTCTCATCTT CAGTCAAATG ACGCGCGTTT TGGACGTCCT GGAGGATTAT
TGCCGCGCAC GAGGTCACTC TTACGTGCGA CTTGACGGTA GCATCACCGG CAAAGCACGT
CAAGAAGCGA TCGACAAGTA TTGCGCTGAG GATTCTGACA CTTTTCTGTT TCTCCTCTCC
ACGCGCGCCG GAGGCCAAGG CATCAACCTC GTCCAGGCTG ACACTGTCGT TATGTTCGAC
AGCGACTGGA ATCCGCAAAA CGATGCACAG GCGCTCGCGA GAGCGCATCG CATCGGGCAA
ACGCGCCAAG TCCAGGTATA TCGACTCGTC ATGCGGGCCA CGTACGAAAA GGAAATGTTT
ACGCGGGCGT CGATGAAACT CGGTCTCGAA CAAGCCATCT TTGGGAGCGC AGAAAAGGAA
GAGAAATCA
 
Protein sequence
MHAWEEGQWA RESAESKRLG EVESIVGWRR CEKEETETLI KWKGTSYAHC TWVKVTALEN 
DPTCGVQGKM RVARYFDKYP KELGPCVDVK PDYLVVDRVF SMFEEVDRTL VCVKWSRMSY
DETNWEDITA VREMEGGASA LEEFERVRSR ASAARERQAI VDAETDEDVA NAWSSYDADT
VRDSYGESDE LRSYQKEGVK WMAFNFRAGR GCILADEMGL GKTAQALALI HHCLQVRPGL
PALVVVPLST IVNWEREAQR WVPDAYVVTH VGKQAGREFA REHDWYHPVD ETQSISRAFK
ANIVLTTYET ITADRQSFAK AKWSTMVVDE AHRLKRVGGK LGNDLNSLAV ERICLLTGTP
LQNNTTELWS LLNFVDSKHF SNAEEFEEAF GGMAKAAQVE RLQKVLGPYL LRRLKRDVEQ
KLPPRSETLV ECELAPLQKK CYRALFERNF SFLRQGCDSR ESFANFANIM MEVRKCCQHP
FLLDGVEAAI APEGASTTAL VSSAGKLQLL DKLLPHLREG GHRALIFSQM TRVLDVLEDY
CRARGHSYVR LDGSITGKAR QEAIDKYCAE DSDTFLFLLS TRAGGQGINL VQADTVVMFD
SDWNPQNDAQ ALARAHRIGQ TRQVQVYRLV MRATYEKEMF TRASMKLGLE QAIFGSAEKE
EKS