Gene OSTLU_52141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_52141 
Symbol 
ID5006883 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp155701 
End bp160702 
Gene Length5002 bp 
Protein Length608 aa 
Translation table 
GC content62% 
IMG OID640422304 
Productpredicted protein 
Protein accessionXP_001422825 
Protein GI145357233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.70287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00225388 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
CGCCGCGCGT CGCGCGCCGA CGCCGAATCG CGCCGACGTA GCACATCGCC GTCATCGCCT 
CCGCGTCGAT CGCGCGTCGG CTTTCGCGCA GGAGCTCGAA CGCGCGCGTC GCGCGCGGCG
TCGCGCGCGC GAACGCGTCG AGCGCGTCGC GGTCGTCGGC GCTCGAGCGC GTCGCGGTCG
CGTCGGCGCG CGCGGCGCGG CGGCGGCGAA TCGGCGCCGG TCGACGGCGC GCGACGGCGC
GCGCGGGCGC GCTCGAGCGC GCGGGCGAGC GCGCGCGGGC GGCGGCGCGC ATGCGTCGAC
GCGCGGGCGC GCCCGACGGC GCGCGCGCGG CAGAGCGCGA CGCGGTGTGC GGCGAAAGGA
TTCGTCATGA CGCGCGGTGA CGGCGCGCGC GACGGCGCGT CGACGCCGCG CGCCATCGGG
CGCCTGCCGA GCGACGTCGT CAATCGCGTC GCCGCGGGAG AGGTGCGTCG AGGGCCGGCG
CGCGACGGCG CGCGCGATGA GAGATTGAAT GTTTGAACGT CAGCTCGACG ACGGACGACG
CGAACGCGCG ACTGACGACG CGAGACGACG CGACGACCGA ACGCGCGCGC AGGTGATCCA
TCGACCGTCG AACGCGCTGA AAGAGCTGGT GGAGAACTCG TTGGACGCGG GCGCGAAGTC
GATCGCGGTG ACGACGAGGG AGGGCGGGAA TAAACTGTTG CGAGTGCAAG ACGACGGACA
CGGAGTGCGA ATAGAGGACT TGCCGCTGCT GTGCGAGCGA CACGCGACGA GTAAGATTGA
AAAGTTTGAG GATTTAGCGC GATGCGAGAG CTTTGGGTTT CGAGGAGAGG CGCTGGCGAG
CATGAGCTAC GTGGCGCACG TGTCGGCGAC GACGATGGCG GCGGGGGCGA CGCACGCGAC
TCGAGCGACG TATACGGATG GGAAGATGGA TGCGGAGGGG GCGAAACCGA TCGCGGGGGT
GTTAGGAACT ACGATTAGCG TGGAGAACTT GTTTTATAAC GTCGTGACTC GAAGGAAGGC
GTTGAAGAGC GCGTCAGAGG AGTACTCGAA AGTGCTCGAG GTGTTGCAGA GGTACGCGGC
GTTGCGAACG GATGTGGCGT TCACGTGTCG GAAGCACGGT GAGTCGCGAG CGACGTTGCA
CACTCCCGTG GCGCAATCGC GCGTCGAGCG GTTGCAGGCG ATTTACGGTC CCACGGTGGC
GAGAGATTTG AAGAAGCTCG ACTTCGACAG CGAGCTGTCC AAGAAAAAGT TTGATTTCAA
GCTGCAAGTG GACGGTTTAG TGAGCGGTGG GAATTATCAT TCAAAGAAGA CGACGTTCAT
TTTGTTCATC AATTCGCGTT TAGTGGAGTG CGCGCCGCTC AAGCGCGCGT GTGAGTCGGT
GTACGCGGCG ATACTCCCCA AGGCTGAGAA GCCGTTTGTA TTCATGCACC TCCGCCTGCC
GTTTGAAGAC GTCGACGTCA ACGTGCATCC CACGAAACAG GAGGTGCACT TTCTGCACCA
AGAAGCCATT GTGGAGTTGA TTCAGTCCAA ACTAGAGAAG ATTCTTCTCG CGACGAATTC
GTCGCGAACA TTCACCGTGC AAACACTGCT TCCTGGCGCG GAGAAACTGG CAAAGAAGGA
TGACGAAAAC GACGCCGAGC GAAGCGGCGA CAAGGAAAAT AGCGAAAAAG CGGACGAACC
GCCGGCGTCG CAGGCGAAGA CGATGCGGAC ACAGCGCGAA CGCGCGGGTG GTGATCACAA
GCTCGTTCGC ACGGATGCGA ATTTAGCAGC GGGGAGTTTG GACGCGTACT TGCAGCGAGC
GATGAATTCC GAGGGACGCG AACACGAGAA AATAGAAGAG GTTCGACGCG CGGTGAGAGA
GCGTCGAGGA CAGCGCACGG AACCCGAAGA CACGTACGTG TGCGAGTTGA CGTCTATTCG
CCAGCTTAAC ACCGAAATCG CCAATCGCGC GCACAAGGAG CTCGGCGACG TGATTAAAAA
TCACACACTC GTCGGCGCCG TGGACGCGCG CAAAGGCGTG TGGTTACTTC AGCACCAAAC
CAAGCTCTTC ATGGTGGACG CCGTAAAGCT CACCGAGGAA ATGTTCCATC AAATGGCTTT
GAAGAACTTC GCCAACTTTG GGTACCAATC GCTGCAAGAT CCCGCGTCTT TGGCCGAACT
CGCGCTGTGC GCGCTGGAGG ATAAATTCGT CGACGACGAA GAGTGGGACG CGAGCGATGG
CTCCAAGGAG GAAGTCGCAG AGAAAATCGC AGAGATGCTC GTCGAAAAGG CGGACATGCT
CAAGGAGTAT CTCGGCGTCG TCATCGACAA GGAACGGCGT CAGATCACCG GAGTGCCGTC
GATGCTTCCC GGGTACGCGC CGGAAATCGG CAAACTTCCC GAGTTCGTCC TCGCCCTCGC
CGAAGACGTC GATTGGACGA GTGAAAAAGA GTGCTTCGAA ACCTGCGCTC GAGTCATCGG
CGCATTTTTC GCCATGGACT GCTCTTTCCA CGATCCGAAA GCCGAAGAAG GCGACGCCGA
GTCCGACGCT CGTCGCGTCG CTCGCCTCTG CGTCTTTCCC GCGATGAAGC GCCGTCTCGC
CCCGCCTCGT CGTTTCGCCG ACGACGGCAC CGTCATTCAG ATCGCGTGCC TCGAGCAGTT
GTACAAAATT TTCGAGCGCT GTTAGTCGGT CCGCGTCGTC GTCGTCGCCC TCGAACGGCG
TCGGCGTTCC GAATCGTCTC GCGGCGCGTC GCCCGGTCGT AAAGCTTGAA ATCGTGAACC
AACCAGCCGT TCGATTCAAC CGACGCGCGT CGACCGCGCG CGCCTCGGTC GTCACTCCCG
CGCGCGCCAC CGGCTCGACG GACGGACCGA CGCGACCGAC GCGACCGACG CGCGCGTTCG
AATCGTGCAT GCGCGCCGCG CGTCTCGTCG CCGCGCGCGT CGCCGCGAGA ACGCCGACGT
CATCGCGTCG CCTCGCCGCT CGAAAACTCA AAACGACGAC CCGCGCGCGC GACGCCGCGA
TCGCCGAGCG AGCGAGCATG ACGACGGGCG AACGCTCGAA CGCGTCGAAG CTCGCGGCGG
TGCGAGAGGC GATGGCGAAG CGAGGGGTGC GAGCGGTCGT CGTGCCGTCG CAGGATCCGC
ACTTTAGGCG CGTCGGCGAA GCGAAGGCGA ACGAACGAAA CGAGGAACGA CGACGCGCGC
GACGGGAAAG ACTGACGAAC GGGCGAGGGC GTGTTTTTGT GGGGAACGCA GTGAGTACGT
GGCGGCGTGC TTCGAGCGAC GACGATGGTT GAGCGATTTT ACGGGGTCGG CGGGGACGGT
GGTGGTGACG GACGCGGCGG CGTTGTTGTG GACGGATGGA CGGTATTTCG TGCAGGCTGA
AGACGAGCTG AGCGAGGACT GGACTCTGAT GCGAAGTGGG GTGAAGGATG TGCCGGACGT
GAAGAAGTGG TTGTGCGCGG AGGAGGCGGG ACTGGCGTTT ACCGGAGCCA AGGTGGGCAT
CGATCCAAAC GTGCACTCGG TGAGCGAGGC GCGAGGTTTG AGAGAAGCGT TGAGCGCGTG
CGGGATCGAG TTGATGAGCG TCGAAGAGAA CTTGGTAGAT TTGGTTTGGA GCGATCGTCC
ACCGTTCCCG AAGACGCCGC TCAGAGTGCA CCCGATGGAG TACGCGGGGA AGAGCGTGGC
GGAAAAATTG GAAAACCTTC GAGAAAAAAT GAAGGAAAAC GACGCGCAGA AGCTCGTCGT
GAGCTCGTTG GATGACGTCA TGTGGCTATG CAATGTTCGA GGCGGTGATG CACCGTGTAA
TCCGGTGACG TTGTCTTACG TCTTGGTGGG TGAAAACGAC GCTTCGTTTT ACGTCGACAC
GGACAAGGCG ACGCCTGAAG TCGTGGCGCA TCTCGCCGAG GCAAACGTGA CGATCAAGCC
GTACGAAGAC ATGGCCAAAG ACGTGTATGC CGCGGCACAG CGCGGTGAGC GACTCTGGAT
GGACGTCGAT AAGGTCTCCA TCGCCATGCT CGAACAGGCT GAAGCCGGAG CCGCCGAAGC
GCCCAAGGAT GCGAAAAAGG TGAAGACGGA GAGCGCGCCG TCCGCCATCA AGGAGGGCAC
GTGTCCGGTC CCGATCGCAA AGGCGGTGAA GAATGAGGCC GAGATGGCCG GTATGGTCGA
AGCCCACCTC ATGGATGGCG CTGCGATGGC TGAATTCTGG TGCGCGATCG AGCGAGACGT
CGCCGAGGGG CGCGCCATTG ACGAGTACGA AGCTGGCGAG AGGGTCTTGG CGTGCCGAGC
CAAGCAAAAC GGTTTCTTCG AAGAATCGTT CCCGACGATC GCGGGTGAAG GTCCTCATGG
CGCCGTGGTG CACTACCGTG CTTCGAAAAA GAGCGCGAGG GCTATCGGTA AGGACAGCTT
ATTACTCTGC GACAGCGGCG GCCAGTACGC GTGTGGCACG ACGGATGTCA CTCGAACGGT
GCACTTCGGA ACGCCCACCG CTCATCAAAA GGAGTGCTAC ACGCGCGTGC TCCAAGGTCA
CATCGCACTC GACCAAATGG TTTTCCCTGT CGGCACGAAA GGTTTCGTTC TCGACGCCTT
TGCGCGATCG CACCTGTGGG CCAACGGCTT GGATTACCGT CACGGCACCG GCCACGGCGT
CGGCGCGGCG CTCAACGTGC ACGAAGGTCC GCAAGGAATC TCTCCGCGTT TTGGAAACAT
GACGCCCCTT ATGCCAGGAA TGATCTTGAG CAACGAGCCG GGGTATTACG AAGACGGTGC
GTTCGGTATC CGCATCGAGA CGCTTCTGCA AGTGAAGGAG GCGAAGACTG CGCACAACTT
CGGAGACACT GGATTTTTAT GCTTTGACGT CTTGACGTTG ATCCCGATTC AAACGAAACT
CATGGACTTG AGCATTATGA GTGAAAAAGA AATCGCGTGG GTGAACGCGT ATCACGAAAA
AGTTTGGCAA CAAATTTCCC CGCGAGTGTC GGGGGAGACT AAAACGTGGC TCGAACGCGC
GTGTGCAAAG ATTTCCAAGT AG
 
Protein sequence
MAKRGVRAVV VPSQDPHFRR YVAACFERRR WLSDFTGSAG TVVVTDAAAL LWTDGRYFVQ 
AEDELSEDWT LMRSGVKDVP DVKKWLCAEE AGLAFTGAKV GIDPNVHSVS EARGLREALS
ACGIELMSVE ENLVDLVWSD RPPFPKTPLR VHPMEYAGKS VAEKLENLRE KMKENDAQKL
VVSSLDDVMW LCNVRGGDAP CNPVTLSYVL VGENDASFYV DTDKATPEVV AHLAEANVTI
KPYEDMAKDV YAAAQRGERL WMDVDKVSIA MLEQAEAGAA EAPKDAKKVK TESAPSAIKE
GTCPVPIAKA VKNEAEMAGM VEAHLMDGAA MAEFWCAIER DVAEGRAIDE YEAGERVLAC
RAKQNGFFEE SFPTIAGEGP HGAVVHYRAS KKSARAIGKD SLLLCDSGGQ YACGTTDVTR
TVHFGTPTAH QKECYTRVLQ GHIALDQMVF PVGTKGFVLD AFARSHLWAN GLDYRHGTGH
GVGAALNVHE GPQGISPRFG NMTPLMPGMI LSNEPGYYED GAFGIRIETL LQVKEAKTAH
NFGDTGFLCF DVLTLIPIQT KLMDLSIMSE KEIAWVNAYH EKVWQQISPR VSGETKTWLE
RACAKISK