Gene OSTLU_19644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19644 
Symbol 
ID5003203 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp579522 
End bp581660 
Gene Length2139 bp 
Protein Length268 aa 
Translation table 
GC content67% 
IMG OID640418624 
Productpredicted protein 
Protein accessionXP_001419425 
Protein GI145350026 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0024] Methionine aminopeptidase 
TIGRFAM ID[TIGR00500] methionine aminopeptidase, type I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00530685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.519725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATAC CTATTTTCCC CGGCACCAAC CACAGCTTGG TTTGTTTCAA TCGCCTGCGT 
TTTCGCGCCC GCTTCAAAAA CTCGCGCTGC AACTCGGTGC CGTCTCGCAC GACGACGAGC
GCCTTCTCCC CCTCCATCGC CCCGCAGTGC TCGGCGGTGA AGGCGTCGTA AAATATCGAT
GAATCAAACA CTCCGTCACG CGTGAGCGTC AAAACTTCGA TGCGTTGCCC CATCTGCGCC
ATCTCCTTGG CGCGTTGAAT CAACTTGGCG CCCGAGCTCC CAGATTTCAA CGGCGCCGCC
TCGTTCGTGA ACAAATACAC GCTCTTCCGC CCCGCTCGCT TCGGTCCGTT CTCGAGCATG
TGCGACGCCG TCCATAACCC CTTCGTCAGC GCGTCCTCTT GATTGAAAAA ATCGCCCGTC
CCATCGTCGT CCTCGGGTTT CAACTCGCCG AATCGCTCGC GAAATTTGTC GGCGCCCTTC
TCCCCGTTCG CGTATTCGCT CAATTCCAGC GCTCCCGCCG CGCTCGGGTT CTCCGCCCTT
CGCACCTCGC ACACGCGTTC CATCCCTATC CCACCCACGC TCTTCCCGGT GTTGTACGCG
CACACGCCCA GCACGTCGTC CGGCGCCACG ACGACGCGCG CGCGCGCGAA TTCAAAGCAC
GCCCGCGTCG CCGCCGTGAA CGCGCACGCC CCGTCCGCGC GCGTCGCGTC CGCCTCGAAC
ATCGCCGGCG AGCAATCGAT CAGCATCACC ACCGCGTCGC GCTGCGCGTC CGGGTCCCAC
GCGCGCGCGC CGTCGCCGTC GCCGTCGCCG TCTTCCGCCG CGCCATCGAA TTCGAAATCA
TCGTCCGAAT CGTCGTCGTC GCGCGCCATT TCGCCCGCGC GCGCGTCCGA CGGGCGCCGC
GCGACGTCCC CCTTCGCTCG CGCGCCGCCC TCGCGCGCGC GCGTCGCCGT CGCGCCGCCG
CGCGACGTTT TCTGCCGCAC TCTTCGACGC GCGCTTCGAC GCATCGTTTC ATTCGTCATG
CGCGTCGCCG CGCGTCGCGC CGACGCCCTC GCGTTCGCGC GCGCCGTCGC GCGTCGTCCT
CGCGCGTCGC TCGACCGTCT CCGGCGCCCG CGCTTCGGCG CGCGCGCGAC GGAGACGCGC
GCGAAGAAGA AGGGCCTGCT CGGCGACCTG CTCAACGTCG CGAGGGACCG CGCGCGCGAC
GAGGCGACGT GGTACAACGG GCGCGCGCCG CTTCGACCGG GGACGTACGC GCCGCAGAAG
ACGGTGCCGG CGTCGATCGA GCCGCAGCCG CCGTACGCGA GGGACGGACA CCTGCCCGAG
TACGACGACG GCGTCGTGCA GGTGCAGACG ACGGCGGCGG ACGTCGAGGG GATGCGACGC
GCGGGAAGAC TCGCGGCGGA GGTGCTGGAC ATGGCGGAAA AGATGATCAC GCCCGGGACG
ACGACGACGA ACGACATCGA CGAGGCGGTG CACGCGATGA CGATCGCGGC GGGGGCGTAT
CCGAGCCCGT TGAACTACGG CGGGTTTCCG AAGAGCGTGT GCACGAGCCT GAACGAGTGC
ATCTGTCACG GAATCCCGGA CGACACGGTG ATCCTGGACG GAGATATCAT AAATATTGAC
GTCACGGTGT ACCTGAACGG GTATCACGGC GACACGTCGA GGACGATCAT GGTGGGGAAC
GTGACGGAGG AGGTGCGGCG GCTGGTGGAG ACGACGGAGC GAGCGCTGGA CGCGGCGATC
GCGATTTGTA AGCCCGGGAC GCCGGTGAGG AAGATCGGGG CGACGATTCA TCAGATCGCG
GACGACGCCA AGTTCGGGGT GGTGGATAAG TTCGTCGGGC ACGGCGTGGG GAAGGTGTTT
CACAGCGGAC CGACGGTGCG GCATCATCGC AACAACGACC CGGGGACGCT GCGGGTCGGT
CAGACGTTCA CCATCGAGCC CATGCTGACG ATCGGGACGA CTCGAGACAA GATGTGGAAG
GACGGATGGA CGAGCGTCAC CGCGGACGGG AAGTGGACGG CGCAGTGCGA GCACACGCTG
CTCGTCACCG AGACGGGCGT CGACGTCTTA ACCGCGTCGC CGTATCGCGC GTCTCTGTCC
GAGGCGTGAC CGCGCGCCGG ACCCTCGTCG CGTCGTCGC
 
Protein sequence
MGIPIFPGTN HSLVQTTAAD VEGMRRAGRL AAEVLDMAEK MITPGTTTTN DIDEAVHAMT 
IAAGAYPSPL NYGGFPKSVC TSLNECICHG IPDDTVILDG DIINIDVTVY LNGYHGDTSR
TIMVGNVTEE VRRLVETTER ALDAAIAICK PGTPVRKIGA TIHQIADDAK FGVVDKFVGH
GVGKVFHSGP TVRHHRNNDP GTLRVGQTFT IEPMLTIGTT RDKMWKDGWT SVTADGKWTA
QCEHTLLVTE TGVDVLTASP YRASLSEA