Gene OSTLU_41838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41838 
Symbol 
ID5005193 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp367767 
End bp369751 
Gene Length1985 bp 
Protein Length626 aa 
Translation table 
GC content59% 
IMG OID640420614 
Productpredicted protein 
Protein accessionXP_001421129 
Protein GI145353669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0141589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG GCGAACGCTC GAACGCGTCG AAGCTCGCGG CGGTGCGAGA GGCGATGGCG 
AAGCGAGGGG TGCGAGCGGT CGTCGTGCCG TCGCAGGATC CGCACTTTAG GCGCGTCGGC
GAAGCGAAGG CGAACGAACG AAACGAGGAA CGACGACGCG CGCGACGGGA AAGACTGACG
AACGGGCGAG GGCGTGTTTT TGTGGGGAAC GCAGTGAGTA CGTGGCGGCG TGCTTCGAGC
GACGACGATG GTTGAGCGAT TTTACGGGGT CGGCGGGGAC GGTGGTGGTG ACGGACGCGG
CGGCGTTGTT GTGGACGGAT GGACGGTATT TCGTGCAGGC TGAAGACGAG CTGAGCGAGG
ACTGGACTCT GATGCGAAGT GGGGTGAAGG ATGTGCCGGA CGTGAAGAAG TGGTTGTGCG
CGGAGGAGGC GGGACTGGCG TTTACCGGAG CCAAGGTGGG CATCGATCCA AACGTGCACT
CGGTGAGCGA GGCGCGAGGT TTGAGAGAAG CGTTGAGCGC GTGCGGGATC GAGTTGATGA
GCGTCGAAGA GAACTTGGTA GATTTGGTTT GGAGCGATCG TCCACCGTTC CCGAAGACGC
CGCTCAGAGT GCACCCGATG GAGTACGCGG GGAAGAGCGT GGCGGAAAAA TTGGAAAACC
TTCGAGAAAA AATGAAGGAA AACGACGCGC AGAAGCTCGT CGTGAGCTCG TTGGATGACG
TCATGTGGCT ATGCAATGTT CGAGGCGGTG ATGCACCGTG TAATCCGGTG ACGTTGTCTT
ACGTCTTGGT GGGTGAAAAC GACGCTTCGT TTTACGTCGA CACGGACAAG GCGACGCCTG
AAGTCGTGGC GCATCTCGCC GAGGCAAACG TGACGATCAA GCCGTACGAA GACATGGCCA
AAGACGTGTA TGCCGCGGCA CAGCGCGGTG AGCGACTCTG GATGGACGTC GATAAGGTCT
CCATCGCCAT GCTCGAACAG GCTGAAGCCG GAGCCGCCGA AGCGCCCAAG GATGCGAAAA
AGGTGAAGAC GGAGAGCGCG CCGTCCGCCA TCAAGGAGGG CACGTGTCCG GTCCCGATCG
CAAAGGCGGT GAAGAATGAG GCCGAGATGG CCGGTATGGT CGAAGCCCAC CTCATGGATG
GCGCTGCGAT GGCTGAATTC TGGTGCGCGA TCGAGCGAGA CGTCGCCGAG GGGCGCGCCA
TTGACGAGTA CGAAGCTGGC GAGAGGGTCT TGGCGTGCCG AGCCAAGCAA AACGGTTTCT
TCGAAGAATC GTTCCCGACG ATCGCGGGTG AAGGTCCTCA TGGCGCCGTG GTGCACTACC
GTGCTTCGAA AAAGAGCGCG AGGGCTATCG GTAAGGACAG CTTATTACTC TGCGACAGCG
GCGGCCAGTA CGCGTGTGGC ACGACGGATG TCACTCGAAC GGTGCACTTC GGAACGCCCA
CCGCTCATCA AAAGGAGTGC TACACGCGCG TGCTCCAAGG TCACATCGCA CTCGACCAAA
TGGTTTTCCC TGTCGGCACG AAAGGTTTCG TTCTCGACGC CTTTGCGCGA TCGCACCTGT
GGGCCAACGG CTTGGATTAC CGTCACGGCA CCGGCCACGG CGTCGGCGCG GCGCTCAACG
TGCACGAAGG TCCGCAAGGA ATCTCTCCGC GTTTTGGAAA CATGACGCCC CTTATGCCAG
GAATGATCTT GAGCAACGAG CCGGGGTATT ACGAAGACGG TGCGTTCGGT ATCCGCATCG
AGACGCTTCT GCAAGTGAAG GAGGCGAAGA CTGCGCACAA CTTCGGAGAC ACTGGATTTT
TATGCTTTGA CGTCTTGACG TTGATCCCGA TTCAAACGAA ACTCATGGAC TTGAGCATTA
TGAGTGAAAA AGAAATCGCG TGGGTGAACG CGTATCACGA AAAAGTTTGG CAACAAATTT
CCCCGCGAGT GTCGGGGGAG ACTAAAACGT GGCTCGAACG CGCGTGTGCA AAGATTTCCA
AGTAG
 
Protein sequence
MTTGERSNAS KLAAVREAMA KRGVRAVVVP SQDPHFRRYV AACFERRRWL SDFTGSAGTV 
VVTDAAALLW TDGRYFVQAE DELSEDWTLM RSGVKDVPDV KKWLCAEEAG LAFTGAKVGI
DPNVHSVSEA RGLREALSAC GIELMSVEEN LVDLVWSDRP PFPKTPLRVH PMEYAGKSVA
EKLENLREKM KENDAQKLVV SSLDDVMWLC NVRGGDAPCN PVTLSYVLVG ENDASFYVDT
DKATPEVVAH LAEANVTIKP YEDMAKDVYA AAQRGERLWM DVDKVSIAML EQAEAGAAEA
PKDAKKVKTE SAPSAIKEGT CPVPIAKAVK NEAEMAGMVE AHLMDGAAMA EFWCAIERDV
AEGRAIDEYE AGERVLACRA KQNGFFEESF PTIAGEGPHG AVVHYRASKK SARAIGKDSL
LLCDSGGQYA CGTTDVTRTV HFGTPTAHQK ECYTRVLQGH IALDQMVFPV GTKGFVLDAF
ARSHLWANGL DYRHGTGHGV GAALNVHEGP QGISPRFGNM TPLMPGMILS NEPGYYEDGA
FGIRIETLLQ VKEAKTAHNF GDTGFLCFDV LTLIPIQTKL MDLSIMSEKE IAWVNAYHEK
VWQQISPRVS GETKTWLERA CAKISK