Gene OSTLU_16043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16043 
Symbol 
ID5002652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp177856 
End bp179784 
Gene Length1929 bp 
Protein Length642 aa 
Translation table 
GC content61% 
IMG OID640418073 
Productpredicted protein 
Protein accessionXP_001418631 
Protein GI145348388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.168876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC GCGGCCGCGC GCGATCGATA TGCGTCCTTC TCGCCGCGCT CGCACTGACG 
ACGAGCGCGT ACGCCGCGCC GACCTCAGCG CCGCGCGCGC GCAGCCGCGC GACGCCCGCG
CCCGCGACGT CTCGTCCCCG CGATTCCCCG GACATCGAAA CATCGAACAT CGACATGAAC
GACGTCGAGG CGTTTCTCAC GCAAGCGCGC GATGAATTCA TCGCGACCCG CGCGTCGCGC
GGGTCTTCAC ACGGCCGCTC GACGACGACG TCTCCGGACG AAGCCGGTTT GGACTGGATC
GCGTCGTTCT TCTCGCCTCG GGAACGTCGC CGCGGCTCTG CGGGTCGCGC GGCGCGTAAT
CGACGCGACT CGAACTTGCC TCCGGTTGAA GTGGATCAGA CCGGCGTTGA CTTCTACCTC
ACCGCGCGCG CGTACGCGCA AGAACACGCG GTGAGCGATC CTCGGGGCGG TCGGATGCGC
ATCGCTCGCG CAGAGGAACC GAATCAGTTT CAGTGGTGGT ACGATTTCAT TTGGCAAACG
CAACAACAAA TCATCGCTTG CGAAGGACGA GAATCGTGCG AAGGTCTTTG GCATCAGTTC
GGCGCGCAAG GCAGCATTTG TTGTGACGCC GGTGCCGAAT TTTACGGTCA CGCTCACTTT
TCGTGCGTCG AAAGTGTCGA ACAATGCGAG GCGCTCACCA CGTGCGCCTC GTCCGCCGAT
TGCGGCGTGA ATCAAGTGTG TTGCGCGACG AAGCCGCATT CTGACGATTT GATGTGCGTG
ACGAGCTTTC AAGACTGCGC GGCGTACTGT CACTCCGATG CGCAGTGTAA AGCGGAGAAA
GGCGAACAGT GTTGCTACGA CGAAGTACTC GGATACACGA TTTGTATCCC GGAAGGGCTA
TCGTGTCCAC CGCCCCCGCC AGAGTGCCCG ACCACGGGAC AGCCGACGTG CCGCTCGGAC
TCCGAGCGCA CGTGTTGCGG TGGCGTCTGT TGTCCGCCGG ATGAAAACGG TGTTGAATGG
TTGTGTTGCA GAATCTGTGA CGAAAATGTT TGCTACCGAG CTACCCCCTC TCTGGGGGAC
GGAAGCTACG AGTGTCCGGA CCCATTTTGC CCGCGGCCTC CAGAGTGTTC GTCGCAAGCA
GAGTTGAGTC AGTGCACGGA TCCGGTGAAC CCGATCGTCG CCGCGCAATC TGCGCGCAAT
GGCGATCCTC CGACCGGTAG CATTTGCTGC GGTGGTGTGT GCTGTGAGAT CGGGGACCCC
GATCTGGTGC CCATCGGCAC ATTCGGCCCG AACTTTTGCT GCTACGATTT CCCGGACGGT
CCTTCTGGTT GGAGTTGCCA ACCAGGCATC CCTGGTGACC CGCTTCCGAC TCCCCCGCCC
GGTTGCGCGC GACAACCTCC CAGCGCTCAG TGTCCGGCCG GTTCTGAGTA TTTGGATACG
TGCGTTGCTG ACGATCAGTC TATTGGCGTG TGCTGTGGAC CGGAAGTTGA CGGCGGCGGG
CTGACATGCT GCCCTGATGA ATCCGTGTGT TGCGCCAACG TCGTGGACGG AGCGACCGTC
GGTTACGAAT GCAAAGCCCA GAATGAGTGC CAAGAGGGTG AGTTATGCAG GATACTCGGT
GACTGTCCGA ACTCTCAGCA ATACTCGGTC TGTGGTTCGT GCGGACAGGG CAATAATGAC
TGCGCGCTGT CCTGCCAAGC GGGCGCAGGT GAGCCTCCGA ACGACCCTTC GTGGATTTGT
TCCACCGTCG ATGGTTGCGA CCCAGTCGCC GACTACAACA ACGGCCCGAG CGATGCCACG
TGCGTGTGTA ATAACGGAGC GTGCGGCGGC GCGACTCAAT GTAAGACAGG AGGGAATTGC
TGCGCGTGCC AAGCTCGTGG CCCGCGCGGT GGCGTGCAAT GCGACAACCC ACCAACCGTA
TTGGTTTGA
 
Protein sequence
MTIRGRARSI CVLLAALALT TSAYAAPTSA PRARSRATPA PATSRPRDSP DIETSNIDMN 
DVEAFLTQAR DEFIATRASR GSSHGRSTTT SPDEAGLDWI ASFFSPRERR RGSAGRAARN
RRDSNLPPVE VDQTGVDFYL TARAYAQEHA VSDPRGGRMR IARAEEPNQF QWWYDFIWQT
QQQIIACEGR ESCEGLWHQF GAQGSICCDA GAEFYGHAHF SCVESVEQCE ALTTCASSAD
CGVNQVCCAT KPHSDDLMCV TSFQDCAAYC HSDAQCKAEK GEQCCYDEVL GYTICIPEGL
SCPPPPPECP TTGQPTCRSD SERTCCGGVC CPPDENGVEW LCCRICDENV CYRATPSLGD
GSYECPDPFC PRPPECSSQA ELSQCTDPVN PIVAAQSARN GDPPTGSICC GGVCCEIGDP
DLVPIGTFGP NFCCYDFPDG PSGWSCQPGI PGDPLPTPPP GCARQPPSAQ CPAGSEYLDT
CVADDQSIGV CCGPEVDGGG LTCCPDESVC CANVVDGATV GYECKAQNEC QEGELCRILG
DCPNSQQYSV CGSCGQGNND CALSCQAGAG EPPNDPSWIC STVDGCDPVA DYNNGPSDAT
CVCNNGACGG ATQCKTGGNC CACQARGPRG GVQCDNPPTV LV