Gene OSTLU_25439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25439 
Symbol 
ID5005009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp440803 
End bp444141 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table 
GC content59% 
IMG OID640420430 
Productpredicted protein 
Protein accessionXP_001421003 
Protein GI145353402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.088116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAA TTCGACGCGC GCCAGGCGTG GAGTGTCGAG CGACGAAGAC GAAGACGAGT 
AAACCGACGA TCGGGACGAA GACGGCGCCG AGCGATGCGT TGAGCCTGGC GAAAGACGTG
CGGACGTATT TGGAGTCTGA GAGTAACTTG GACGGAGAGG AACACTCGAG GGAGGTGGTG
ATCGCGCCGG TGAGCGGGGA GAAGATTGAG GCTCGAGCGC GAGGCGCGGC GAACGGGGAG
TCGACGTCGG CGACTACGGA GAAGGAAATC GTGATTGAGA TCGAGTACGA AACGGACGAA
AGCGCGATGG GAATCGCGGC GGATGAATCG TACTTTTTGT GCCCGGGGGC GTCGAGGCGG
CCGGCGTGTT GGTTTCCGTG CGTCGAGCGA GGGGACGTGC TGACGACGTT TGATTTCTCG
GTGAGCGCGC CGAGGGATTT GCAAGTCATC ACGTCCGCGC ATTGGGATCG CGTCGAGCGG
TGCAACGACA TCGATGAAGA CGAAGACGGC TTTAGGCTGC GACATCACTT CACGGGGATG
TATCCGACGT TTGCGCACGA AATGCGATTG ATTTGTGGCA GGTTCAAGGC GGTGTCGAGC
CCGATCAAAG GCGTGACTTT GTTCGCTCCC AAAGACGGCG ATTACGCCGA GCGCTTGGAG
ATCGCGGCCG CGGGTGTGAG TAAGGCTATC GTGGCGTACG AGGAATATCT CGGGCATCCG
TACCCTATTT CGTGCTTGAA TATCGTCTTC ATGCCGGACG AGTACGTGGG AGCGCGCGAT
AGCTTGGGCG CGTGCATGAA CATTCACTCG GCTAAATGGT TGATAGACCC GACGCTCAAC
ACGGCTTTGC TCGATGCGCG CGTGCATATC GCCACCGCCA TCGCGCGACA GTGGTTCGGG
GGCGTCGTCG TCCCGGCGGA TACCACGGAC TGTTGGGTCG TGGAAGGACT GGCGCAGTAT
CTCGCCGGCG CGTATGTGAA AAAGTTGACG GGATTGAACG AACTGTCGTT CAGGCGCATG
CGCGACATGC AATTGACGGC GCGGATGGAC GACGGTGAAT CACTCCCCCC TCTCGCCAGT
CGTGCGGCGC GCATTTGGCG CGCCGGACAG TACGCCGGCC CCGACCTTGC CGCCGGCGGG
ACGCCGAAGC CACTCTCGGC GAGCGTCGAG CGCGGGTTGC AAGCCAAGGC GGTGACGATC
ATTTACATGC TCGAGAAACG CCTCGGGCCG GATGTCATGC AGAAGGTGTT GAAATATTTC
GCAGGATTGC ATGTTCGTAG AAATAAGAAG GAGGGCGGGA CGCGCGCTGG TCCCTCCGCG
GAGGTGCTGT CGAGCAACGC GCGTTGGATT CACACGCTGC AACTCTTCGA TCACTGCCGC
GCGACGGTCA ACTTGGGCAA AGGTGAGGTG AACTCCTTCT TAGAACGCTG GGTGTATGGC
GCGGGCACGC CGAAGCTGTG CGTTGGATAC GTTGTGAAAC GTAGGAAAAA CGTCATGGAG
TTTGCGGTCA AACTCGAAGG GAGCGTCGCG GCGGCGGCGG CCGACAGAGC GGCACTCGCC
GTCGCCAGAA ACCATCGCAC TTCGGTCACC GTGCGCATGC GCGAAGAGAA CCGCCCGGAT
GCGAACGATC ACGTCGTATC GCTAGGACAT TCTGCTTGGC AGTTGATGGA GATTCCTCTG
CAACCGAAAA TCAAAGATCG TCGCCCGAAG ACAATCATCG AATCTGGAGG TGACCCCGAA
CTCATCGCCG CGATGGATTG TCCGGTACGA TACGTCCGCG TGGACCCGGA ATTCGAGTGG
ATGGGCAACA TAGAACAAAG TGCGAAGCAG GTCGGGTTAG AGTCGATGAT GGCGCAGATG
CTGGAGAAAG AAAAAGACAT CGTCGCGCAA ACCATCGCCG TGGAATTTCT CGGTCGCCGC
GTCGCCAACG GATCCGTGAG CGCAGTTCTC GTGCTGGATA AGTGTCTGAA CTCTGAGGAT
ACCTTCTGTC GCGTGCGCGC CGAAGCAGCG CTAGCGCTCG GCAAAAGTGC GAGCGAGAAA
ACGCAATGGG GCGGATTGAA CGCGTTGATT CGGCATTACA AAAAGTTTCA GTGCGACGCC
AACACGGGAA AGCCGAAACA AAACGACTTT AGAGACCTCG CGAAAGTCAT CGTTGACGAA
GCCGTCATCA CGGCGCTTGG TTACGTCATG GATAACGGAA TCACCCATCA GGATGCGCTC
GAGGCTATCG TTGCCCGACT AAAGTATAAC GACAACGAGG GCAATCCGCA GAGTGATGAC
GGATTCGTGG CGACGTGCAT CGCCGCGCTC GGCCGCTGCG TTCCCGCGGA TGGCAAGCAA
CTGCAAACCG TCATGCAGAC GATTCATTAC TACATTCGGC GTGATGACAG ATTTCCAAGC
GACGACCTTT GCGTGACGTG CGCTGGAATC AGAGCGTTAG GCTTACTCGC TTCGACGATC
GACTCGAAAG AGCTCCGCGC CGACGCCGAA CACGTCGCCA AACTTGGATT CGCGCTCGGT
AGACAATCCA CCGTGCTCGC TTCCGCGGAC ACGATGATGT ACCTTCGCTT CGTCGAAACT
AAGAGCGAGA TCGAAGCGCT CAAGTACATG CTTGAACGAT CGGCGCAAGA GTCTGCGGCG
ACAAAGGCTG CCATGTTGTG GAGCGCGTGC GAGTACTTGC AATCCGTCGC CGGGGCGGAT
TCTTTGAAGG GCGTCTCGAA GCCGATATTG GCCGACCTCC GTGATCTCGT GCTAAAGGGC
GGAAGCGAAA TTGCGAGCGC GGCGTTTTCG ATCCTCGCCA CGCTCGCTTC GCAAGATGAG
AGCCTGAGTG AGATTCGAGA ATCCATCGCC GAGGCGATTC GGCGCGCGGG CGATCAAGTC
CCAGTCGGCG TCGATGTCCA CGCCGCGCAC GCCGCGCCAA CGGCTGAGGA TGAAGTCAAG
CGCGCGGAGA AGGAAGCTAA GCGTGCTCGG AAGCGAGAAC GCGAGCGCTT GCGACAAGCC
GCGCGCGCCG AGGTAGACTT GAAAAACGCT GAACAGCGTA GAATCGACGC CGAGGCGGCG
TTTGTCGAAC AGCAAGCGGA ACACGTACAA GACGACAAGA CGACCGTCTC CGAGCGCACG
GCGACGATGA TGGGATCAGA AGGCGACGCC ACGCCTCAAG GTGGTACTCC GGTGGGTGGC
CTGAAGCGCG CGAGAGATCA GAATCCGAGC GTCACGCAGT TTGCCGTGGA GCCTGCGGCG
CCCGTCGCGA CGGAGGACGC CGCGCCCGCC GCGACGGAGG ACGCCGCGCC CGCGAAGAAG
CTCAAACTTT CGTTCAAAAT TAAAAAACCC ACGACGTAG
 
Protein sequence
MDGIRRAPGV ECRATKTKTS KPTIGTKTAP SDALSLAKDV RTYLESESNL DGEEHSREVV 
IAPVSGEKIE ARARGAANGE STSATTEKEI VIEIEYETDE SAMGIAADES YFLCPGASRR
PACWFPCVER GDVLTTFDFS VSAPRDLQVI TSAHWDRVER CNDIDEDEDG FRLRHHFTGM
YPTFAHEMRL ICGRFKAVSS PIKGVTLFAP KDGDYAERLE IAAAGVSKAI VAYEEYLGHP
YPISCLNIVF MPDEYVGARD SLGACMNIHS AKWLIDPTLN TALLDARVHI ATAIARQWFG
GVVVPADTTD CWVVEGLAQY LAGAYVKKLT GLNELSFRRM RDMQLTARMD DGESLPPLAS
RAARIWRAGQ YAGPDLAAGG TPKPLSASVE RGLQAKAVTI IYMLEKRLGP DVMQKVLKYF
AGLHVRRNKK EGGTRAGPSA EVLSSNARWI HTLQLFDHCR ATVNLGKGEV NSFLERWVYG
AGTPKLCVGY VVKRRKNVME FAVKLEGSVA AAAADRAALA VARNHRTSVT VRMREENRPD
ANDHVVSLGH SAWQLMEIPL QPKIKDRRPK TIIESGGDPE LIAAMDCPVR YVRVDPEFEW
MGNIEQSAKQ VGLESMMAQM LEKEKDIVAQ TIAVEFLGRR VANGSVSAVL VLDKCLNSED
TFCRVRAEAA LALGKSASEK TQWGGLNALI RHYKKFQCDA NTGKPKQNDF RDLAKVIVDE
AVITALGYVM DNGITHQDAL EAIVARLKYN DNEGNPQSDD GFVATCIAAL GRCVPADGKQ
LQTVMQTIHY YIRRDDRFPS DDLCVTCAGI RALGLLASTI DSKELRADAE HVAKLGFALG
RQSTVLASAD TMMYLRFVET KSEIEALKYM LERSAQESAA TKAAMLWSAC EYLQSVAGAD
SLKGVSKPIL ADLRDLVLKG GSEIASAAFS ILATLASQDE SLSEIRESIA EAIRRAGDQV
PVGVDVHAAH AAPTAEDEVK RAEKEAKRAR KRERERLRQA ARAEVDLKNA EQRRIDAEAA
FVEQQAEHVQ DDKTTVSERT ATMMGSEGDA TPQGGTPVGG LKRARDQNPS VTQFAVEPAA
PVATEDAAPA ATEDAAPAKK LKLSFKIKKP TT