Gene OSTLU_24443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24443 
SymbolHAC3501 
ID5001511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp604058 
End bp607393 
Gene Length3336 bp 
Protein Length1100 aa 
Translation table 
GC content56% 
IMG OID640416932 
Productpredicted protein 
Protein accessionXP_001417549 
Protein GI145346136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00582257 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCTC GTCAAGGCGC GCAACAGGGC GGAAGAGGAG GCGGGAGCGC TGCGGCGGCT 
TCGCGTCAGC AGATGGGAGC GTTGTTCCCA GGGGGATTAG GGGGGAATAT GGCGCAACCC
GCGGCGGGAC AGATGATGGG GGGTAACGGT ATGTTACCGA ATGGGGCGCC GATTTTGCGC
GGCGCGGCGG CGGCGCAAGG ATCGCAATAC GCGGGAGGAT CTTCGCAAAT GATGGCCGGC
GTGCCACCTG GGACGATGAT TCCGACGCAG GGTGTGGGTT CGATGATGCC TGAGGCAGCG
ACGGCGGCGG CGCCCAAGGG ACGAGCTAAG GCGCCGCCGA AGGGTAAAAA GCTCACCAAG
GCGCAGCAGG CGGCGCAGGC GAAGCAGCTC GCGCAACAGC AACAGATGGC CCAACAGCAA
CAGGCGGCGA TGCGCGGGCG CCAGCAACCG GGTAGTGGTG GGGGTGGTGT CGTTGCTAAT
TGGCGAGACA TCAATGACGA GCAGTTACGA AAGGCGTACA TCGTGAAACA GCAACGATGG
CTTCTCTTCC TTCGACACGC GAGCAAGTGT CAGGCGCGAC ACGGGCACTG CCCCTACACG
CCGCACTGTC ACGTCGCGAA GCAGTTGTGG GAGCACGTGT TGAAGTGTAC GCTGACGCAG
TGCAACTATC CTCGATGTTT GGCGTCGCGT GAGTTGTTGA AGCATCATCA GACGTGCAAG
GACGCTGGGT GTCCGGTGTG CGGTCCCGTG CGCAACGCCA TGCTCAAGCA GCGTCAACAG
GCGCAAATGC AAATGGCGCA CGGCAACAAG AGAATGAAGC TCGATCACGA TCTCGATCGT
GGTCAACTCA TGGTGAAGGG TACGAGCGGT TTGAAAACGG ATCGCAAGCC AGGTGGCGAA
GGGACGTCGT TAATGGAGTG CTTCACTCCG GAAGAAATCC GTACGCATCT CGCCGCCTTG
CGTCTCGCCG ACAAGGAAAA GGTGCAAGGT CAAGGTCAAC CGAGCGCTCG TCAGCTTCAG
AAGGAAGCTG AGCGGGCGGT GATCAACGCG ACAGAAAGTT CGTGCCGTGC ATGCGGCGTT
GAACGATTGA CGTTCGAACC ACCGCCGCTT TACTGCTACA GTTGCGTCGG TCGCATCAAG
CGCGGACAAG TATTCCACCA GATGCCTAGC CTCGGGGGCG AGACTCGAAG AGATGCGTGG
TGTAACCCGT GCTTCAACGC CATCCAAGGG TATGTCGATG TCGAGGGTCA ACGGTTTCCG
AAAGCGACGC TCATCAAGAA GAAGAACGAC GACGATCTCG AGGAGCCATG GGTGCAATGC
GACTACTGCG AGGATTGGTA TCACCAACTC TGCGTTCTTT TCAACGGTAG ACGCAATGAA
GGCGGTGAAG CGCCATTCAC GTGTCCAAAC TGCATCTTAT CGCAGTTGGA TAAGAACGAA
CGTCAAGTTA CTGCTGAACG TCCGTCTTCA CAACAGCCGG CGAGCTCGTT ACCGAAGACA
AAGATGTCGA CGTTTTTGGA AGAACGCCTC GCGTCGAAGC TCTCGGCGGA ACGCGTGGAG
CGCGCGAAGC AGTTGGGCGT ACCCATAGAA AACGTCCCGA CCGCTGAGAA CTTGACCATT
CGTGTTGTGT CTCAAACACT CAAACAAATG GACACCAAGC CGCACTACTA CGCCCACTTC
AAAGAACAGG GCATTCCGGC GCACTTTACT TACCGTTCGC GCGTCATCCT CCTGTTCCAA
AAGCTTGAGG CCACCGATGT GTGTTTGATG GCTATTTATG TGCAAGAGTA TGATGACGAG
TGCCCTGAAC CGAACCGGCG AAGAATTTAC CTCTCGTATT TGGACTCTGT CAAGTACTTC
CGACCGGACA ACGTCACCGT GGCCACGGGC GAAAACTGCG CGTTGCGCAC GTACGTCTAT
CACAACATTC TGATTGCGTA CCTGGACTAT GTCAAACAGC GCGGCTTCAC GTCGTGCTTT
ATTTGGGCGT GTCCGCCGTT CCAAGGAGAC GATTACATTC TGTACTGTCA CCCCAAGGTG
CAAAAGACGC CAAAGGCTGA CAAGCTTCGT GAGTGGTACC TCAAGATGTT GCGCTCGGCG
CAGAAGGATG GCATCGTCAT TTCAACTTCG AACGTGTACG ATGAATTCCG ACTCGGCAAT
CAAAATCACG ATATTCGATG CGCCACTGAG TATCCGTATT TCGATGGTGA TTACTTCTCC
GGCATTGCGG AAGATTGGAT ACCGACCATC ATGAAGGAAC TCGAGGAGGC AAAGAACATC
GAGGCGAAGA CAAAGTCGTC CACGGTTAAG ATCAGCGCGC GCAAGGCGGC CAAGGCAAAG
AGTGGCACGA TCGCCGCTGA CGCTGAACTG AATAAAGAAC TAATGAAGAA GCTCGGGACG
ACGATCAGCA ACATGAGAAA CGACTTCATG CTCGCCCATC TTGCGCACCA ATGTTTGTGC
TGCCGCAGAA CCATCGCCGG TGCCAACCGC TATTACGCGA CGGAAGGCAC ACCCTTGGTG
CTTTGCGAAG AATGCAAGGA GACTGAAGAT GCGATGCCAG AGAACGAAAA ACGTTACGCT
GGCCGCAAGC TCGAGTGCGA AAAGTGTGAG GAAATTCCCA CGCTGACGAA GGAACAGAAG
GACGAGGAGG AAAAGCTCGA GAGCGAATTC TTTGACACAC GTCAAGCGTT CCTGTCCTTG
TGCCAAGGCA ATCACTTCCA GTTCGACTCG CTTCGTCGCG CCAAACATAC GACCATGATG
GTGCTGTATC ACTTGAACAA TCCATCTGAG CCTGCATTCG TCGCATCATG CAACGTTTGT
TCACGCGAGC TCGAGCCCGG AAAGGGTTGG CGCTGCGAGA CGTGCCCCGA TTTCGATATT
TGCGACAACT GCCGCATCAG AACTGGTCAC CAGCACCCCC TCATGCGACA AGGTCGTACC
GCCGGAGATC GCACGGCGTT GTCTCAAGCC GAGCGCGAGA ACCGCGCGGC GCAGATCGAG
CGAACCATGG AACTCTTGCT CCACGCGTGC AAATGCCGCA AAGAACGATG CGAAAACAGC
AACTGCCCGA AAATCAAACA CTTGCTCAAG CACGCCTTGA GCTGCACGGT CAAATCAGCA
GGCGGGTGCC AGCTCTGCCG TAAAACGTGG ACGCTGTTGC AAATTCATTC TAAGGGATGC
ATGGAGGACG ACTGCCCCGT GCCCAGGTGC CGCGATCTCA AAGAGTACCG TCGTCGCGGT
CAAGAACAAA TTGAAGAGCG CCGACGCGAG CAATACAGAC TTTACCTGAA CGCCGCGCGA
TGAGCGCGCG ACGATCAGAA CAACACGATT AACGAC
 
Protein sequence
MISRQGAQQG GRGGGSAAAA SRQQMGALFP GGLGGNMAQP AAGQMMGGNG MLPNGAPILR 
GAAAAQGSQY AGGSSQMMAG VPPGTMIPTQ GVGSMMPEAA TAAAPKGRAK APPKGKKLTK
AQQAAQAKQL AQQQQMAQQQ QAAMRGRQQP GSGGGGVVAN WRDINDEQLR KAYIVKQQRW
LLFLRHASKC QARHGHCPYT PHCHVAKQLW EHVLKCTLTQ CNYPRCLASR ELLKHHQTCK
DAGCPVCGPV RNAMLKQRQQ AQMQMAHGNK RMKLDHDLDR GQLMVKGTSG LKTDRKPGGE
GTSLMECFTP EEIRTHLAAL RLADKEKVQG QGQPSARQLQ KEAERAVINA TESSCRACGV
ERLTFEPPPL YCYSCVGRIK RGQVFHQMPS LGGETRRDAW CNPCFNAIQG YVDVEGQRFP
KATLIKKKND DDLEEPWVQC DYCEDWYHQL CVLFNGRRNE GGEAPFTCPN CILSQLDKNE
RQVTAERPSS QQPASSLPKT KMSTFLEERL ASKLSAERVE RAKQLGVPIE NVPTAENLTI
RVVSQTLKQM DTKPHYYAHF KEQGIPAHFT YRSRVILLFQ KLEATDVCLM AIYVQEYDDE
CPEPNRRRIY LSYLDSVKYF RPDNVTVATG ENCALRTYVY HNILIAYLDY VKQRGFTSCF
IWACPPFQGD DYILYCHPKV QKTPKADKLR EWYLKMLRSA QKDGIVISTS NVYDEFRLGN
QNHDIRCATE YPYFDGDYFS GIAEDWIPTI MKELEEAKNI EAKTKSSTVK ISARKAAKAK
SGTIAADAEL NKELMKKLGT TISNMRNDFM LAHLAHQCLC CRRTIAGANR YYATEGTPLV
LCEECKETED AMPENEKRYA GRKLECEKCE EIPTLTKEQK DEEEKLESEF FDTRQAFLSL
CQGNHFQFDS LRRAKHTTMM VLYHLNNPSE PAFVASCNVC SRELEPGKGW RCETCPDFDI
CDNCRIRTGH QHPLMRQGRT AGDRTALSQA ERENRAAQIE RTMELLLHAC KCRKERCENS
NCPKIKHLLK HALSCTVKSA GGCQLCRKTW TLLQIHSKGC MEDDCPVPRC RDLKEYRRRG
QEQIEERRRE QYRLYLNAAR