Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19807 |
Symbol | |
ID | 5005001 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 412655 |
End bp | 415317 |
Gene Length | 2663 bp |
Protein Length | 869 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420422 |
Product | predicted protein |
Protein accession | XP_001421145 |
Protein GI | 145353703 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.137348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.504611 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG CGACGGAGAC GCCGACGGAG GCGCCGAAGA CGATATATCT GAAGGATTAC GAGCGACCGG CGTACGCGTT CGAAAGGGTG AATTTGGACT TTGAGCTCGG GGAGGCGACG ACGACGGTGA CGTCGACGAT TCGAGTTCGA CCGGCGAACG ATGCGAATGG GAAATCGTTG TTCCTGAACG GCGATGAATC GGTGGAGTTG GCGGCGATCG AGGTGGACGG CGCGAAATTC ACGACGTACG AACGGACGGG GAAGGGGATC ACGCTGCGCG CGCTGCCGAC GGAGGCGTTT GATTTGCGAG TGACGACGAC GATTAAGCCG CAAGAGAACA CCGCGCTCGA GGGACTTTAT AAGTCGTCTG GGAACTTTTG CACGCAATGC GAGGCGGAAG GTTTCCGACG AATCACGTTT TATCAAGATA GACCGGATGT GATGTCGATA TTCACGACGC GCATCACGGC GGATAAGACG AAGTATCCGG TGCTGCTCGG CAACGGAAAC TTGGTGGATT CTGGAGATTT GGAGGGTGGT AAGCACTTCA CCGTGTGGGA AGATCCGTGG GCAAAACCGT GTTACCTTTT CGCGCTCGTC GCGGGTGATC TCGGGATGGT GGAGGATAAA TTCAAGACGA TGACGGGCAA AGAAGTGACG CTTCGCATCT TCACCGAGAC GCACAACTTG GACAAGTGCG CGCACGCGAT GACGAGCTTG ATCAAATCCA TGAAATGGGA CGAAGACACG TATGGTTTGG AGTACGACTT GGAGCTTTTC AACATCGTCG CCGTGGACGA TTTCAACATG GGCGCCATGG AGAACAAGTC GCTGAACATT TTCAACTCGC GTCTCGTGTT GGCGACGCCG CAGAGCGCCA CGGATGCCGA TTACGCCGCC ATTGAAGGTG TCGTGGCGCA CGAATACTTT CACAACTACA CTGGAAACCG TGTGACGTGT CGTGATTGGT TCCAACTGTC CCTCAAGGAA GGGCTGACTG TGTACAGAGA CCAAGAGTTC AGCGCGGATA TGAACTCGCG AGGCGTCAAG CGCATCGGCG ACGTCTCGCG CCTTCGCATG GCGCAATTCG CCCAAGACGC GGGTCCGATG GCGCACCGGA TTCGCCCGGG AATCGTACAT TAAGATGGAT AACTTTTACA CCGTGACAGT TTACGAAAAG GGAGCGGAAG TCGTGCGCAT GTACGAGACG CTCCTCGGCA AGGATGGGTT CCGCAAGGGG ATGGATTTGT ACTTTGAACG TCACGACGGT CAAGCGGTGA CGACGGAGGA TTTCTTCGCC GCCATGTGCG ACGCCAACGG TGCGGACTTG TCCACGTTCA AGCCCTGGTA CTCCCAAGCG GGTACGCCGC GCGTCACCGC GAACGGGTCT TACGACGCCG CCGCGAAGAC GTTCACCCTC GAATGCTCGC AAGTGGTTCC GAAAACGCCC GGTCAAGACT CCAAGGTTCC GGTCTTGTGC CCGATCGCCG TTGGTCTCGT TGGTCCCGAC GGTGCAGACA TGAACCTCAC GATCGACGGC AAATCCCACG GCACGACGGC GGTGCTTCGT TTCGATCAAG CCTCGGCGAC GTACACTTTC ACCGGCGTCG ACGCCAAGCC CGTGCCGAGC ATCTTGCGCA ACTTCAGCGC GCCCGTGCGT TTGACGACCA ACTTGACGCA AGACGACTTG TTGTTCCTCA TGGCGAACGA CTCGGACGCG TTCAACCGAT GGGAGGCTGG GCAGACGCTG CTCAGAAACC TCTGCCTGGA TCTGATCAAG GGCGGCGAGC AGTCATTCAA GATGAACGAC GCCATCACGG CGGCGATGCG CACGATTCTT TCGGGCGCCA AGGCTGCCGA CGCGGACAAG GCGTTCATCG CGCGCGCCAT GATGGTGCCT TCCGAGGGCG AGCTGAGCGA CATGCTCGAA GAGGGCACGG TGGATCCCGC CGCCGTGCAC GCCGCTCGCG ACTTTGTCAT GAAGACGCTC GCCACGGAGC TTCGCGCTGA GTTGGAAGCC ACGGCGCAAG CGAACAGCGC CGCGGTGTAT TCGAACGAAC CCGCCGATCG CGCCGCGCGA TCGCTGAAAA ACGCGTGCAT CGGATATTTG TCGTATTTGG ACGCGCCGGA AATCGCCGCG ATGACGTACG AGCGCTACGT CGCCGCGGAC AACATGACGG ATAAGATTGC CGCCCTGAGC GCGCTCAGCG GCAAAGACTG CGACGAACGC ATCAAAGCCA TCGATGCGTT TTACGCCGAG TGGTCGCACG ACCCGCTCGT CATGAACAAA TGGCTCAGCA TCCAAGCCGC GTCGTCGCTC CCGAACAACC TCGCCAACGT TCGCGCGCTC GCCGCCGGCT CCGCCTTCGA CATCAAGAAC CCCAACAAAG TGTACTCCCT CATCGGTGGT TTCTGCGCCT CTCCCACCAA CTTTCACGCC ATCGACGGCT CCGGTTACGA ATTCCTCGCC GACATCGTCC TCGAGCTCGA CGATCTCAAC GGCCAAGTCG CCTCTCGCAT GGTGTCCGCG TTTACGCGTT GGCGCAAATT CGAGCCGACG CGCGCGTCGG CGATGAAGGC GCAGCTCGAG CGCATCGCCG CCAAGACGGG TCTGAGCGAA AACGTCTTCG AGATCGTCTC CAAGTCGCTC GAGTGATCGC GCG
|
Protein sequence | MTTATETPTE APKTIYLKDY ERPAYAFERV NLDFELGEAT TTVTSTIRVR PANDANGKSL FLNGDESVEL AAIEVDGAKF TTYERTGKGI TLRALPTEAF DLRVTTTIKP QENTALEGLY KSSGNFCTQC EAEGFRRITF YQDRPDVMSI FTTRITADKT KYPVLLGNGN LVDSGDLEGG KHFTVWEDPW AKPCYLFALV AGDLGMVEDK FKTMTGKEVT LRIFTETHNL DKCAHAMTSL IKSMKWDEDT YGLEYDLELF NIVAVDDFNM GAMENKSLNI FNSRLVLATP QSATDADYAA IEGVVAHEYF HNYTGNRVTC RDWFQLSLKE GLTVYRDQEF SADMNSRGVK RIGDVSRLRM AQFAQDAGPM AHRIRPGIGA EVVRMYETLL GKDGFRKGMD LYFERHDGQA VTTEDFFAAM CDANGADLST FKPWYSQAGT PRVTANGSYD AAAKTFTLEC SQVVPKTPGQ DSKVPVLCPI AVGLVGPDGA DMNLTIDGKS HGTTAVLRFD QASATYTFTG VDAKPVPSIL RNFSAPVRLT TNLTQDDLLF LMANDSDAFN RWEAGQTLLR NLCLDLIKGG EQSFKMNDAI TAAMRTILSG AKAADADKAF IARAMMVPSE GELSDMLEEG TVDPAAVHAA RDFVMKTLAT ELRAELEATA QANSAAVYSN EPADRAARSL KNACIGYLSY LDAPEIAAMT YERYVAADNM TDKIAALSAL SGKDCDERIK AIDAFYAEWS HDPLVMNKWL SIQAASSLPN NLANVRALAA GSAFDIKNPN KVYSLIGGFC ASPTNFHAID GSGYEFLADI VLELDDLNGQ VASRMVSAFT RWRKFEPTRA SAMKAQLERI AAKTGLSENV FEIVSKSLE
|
| |