Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26022 |
Symbol | |
ID | 5004217 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 103960 |
End bp | 106875 |
Gene Length | 2916 bp |
Protein Length | 924 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419638 |
Product | predicted protein |
Protein accession | XP_001420077 |
Protein GI | 145351419 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.812469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACCCGACG CGATGACGGC CACCGCCGAA CCCAAAGAAA TCAGGCTCAG TGATTACGCC CCGTTCCCGT ACGCGTACGA TGAGGTGCGG CGCCGACGCG ACGCGACGCG ACGCGACGCG ACGCGATGCG ATCTGAAACT CTGAGAAATA TTAAAGAAAA CCGACGACTG ACGCGCGCGC GCGCGCAGGT AACGCTCGAT TTCGCGCTCG ATGGTGAGTA CGCCACCGTC ACGGCGATGT CGGTGATTAC GCCGATCGAA GGACGCGACC GCACGCGAGG GCTGGTGCTG AACGGGAAGA TGCCGTTCTT TGAGCTGCTC GGCGCGCGCG TGAACGGCGA GACGTTACCC GCGGATCGGT ACTCGATTGA GGCGGATGGG GACGACACGC TGATGATAAT CAAAGACACG CCCGACGTGA GGTTTGAGTT GGAGATCACG ACGAAGTTTA AGCCGCAGGA TAACACGGAA CTGAGTGGGT TGTACAAATC GAGCGGAACG TTTTGCACGC AGTGCGAGGC TGAGGGATTT CGGTCCATCA CGTTCTATCC CGATCGTCCG GACGTCATGA GCGTGTTTAC GACGAAGATC ACGGCGGATA AGGCCAAGTA TCCGGTGTTG CTTTCGAATG GAAATTTGAT TGATTCAGGC GACGCGGCGA ACGGCGCGCA CTTTGCGACG TGGAAAGACC CTTGGCGAAA GCCGTGCTAC TTGTTCGCGC TCGTCGCGGG CGATTTAGCG GTGGTCGAGG ACACGTTCAC GACCATGAGT GGTCGCGAAG TGGCGCTCAA GATTTACGCG CAGGCAAAGA ACATCGATCG ATGCGATTTC GCCATGGCAA GCTTGAAGCG AGCGATGAAG TGGGACGAGG ATCGCTTTGG TTTGGAGTAC GATTTGGATC TGTTCAACAT CGTCGCCGTG GACGACTTCA ACATGGGCGC GATGGAAAAC AAGTCTTTGA ACATTTTTAA TTCTCGCTTG GTGTTGGCGT CGGAAGAGAG CGCGACAGAT GCGACGTTTG AGCGCATCGA AGGAGTCATT GGTCACGAAT ATTTTCACAA CTACACGGGC AACCGCGTCA CCTGCCGCGA CTGGTTTCAA CTGTCGCTCA AGGAAGGTCT GACTGTGTTC CGAGACCATG AGTTTACGTC CGATTTGCAC TCTCGCGCGG TCAAGCGTAT CGCAGACGTG AGATACTTGC GTGCGGCGCA GTTTGCGGAG GATGCGTCGC CGCTCGCGCA TCCAGTTCGA CCCGAGGCGT ATCAAAAAAT TGACAACTTC TACACCCTCA CTGTATACGA GAAGGGTTCG GAGCTCATTA GAATGTACTC GACGCTCTTG GGCAAGGACG GGTTTCGCAA GGGCATGGAC TTGTACTTTC AGCGTCACGA CGGTCAAGCG GTGACGACGG AGGACTTTTT CCAAGCCATG TCGGACGCAA ACTCGACGAA CATTGAAAAG CTCAAGCGTT GGTACTCTCA AGCGGGCACG CCGGCGCTCA ACGCCGAAGG CACGTACGAC GCCGTCACGA AAACGTACGC GTTGACTTTA ACGCAAACGT TACCACAGAC GAATGATGTG AAGGGCGCGG CGGATAAGAA ACTACCGCAG CTCATTCCCG TCGCTGTCGG TTTGCTCGGC GAAGATGGCA AAGATATGGT GCTTGATGGC GATATCAAGT GTGAAGGCGA CGCCGAGGCG ACTTTGGACG AAACGAAGAC GACGGCGGTG TGTCGGTTGA CTGAGTTCAA ACAGACGTTT ACCTTCACCA ACATAACGTC CAAGCCAGTG CCGAGCGTTT TGCGCGGGTT CTCTGCGCCA GTCAAGCTCA CGATGACTCC AGAACTCACC ACGGATGAGT TGTTGTTCTT GCTCGCCAAC GATAGCGATG AATTCAACCG ATGGGAAGCG GCGCAGAAGA TTGCGACGAG TATTTTGATT CGCTTGTGCA AAAAGCACAA CGATGACAAA GCGCTCAAAA TTGAAGATGT GGATGTTACT TCGGATCCGT CATGGGCGAT ATACTCCGCG GCTTGCTGTC TGATCGTGAA GGACGCGACC GCCAATCGCC TCGATCGTGC GTGGGTCGAG GAAGCACTGA ATTTTCCTGG ACCTAGCCAG TTGATTCAAG ATCTCGCCCC TGGTGTGAAT CCGGTGAATA CGTATAGGGT GTGCAAGGCA TTTGCTCGCG CGTTTGCCAA GGAATCTCGC GTCGAGCTCG AGGCGGCGTT GGCGACTTGC GATGCCGAAG CCGCCGGTTT GGCGTACGAC GTGGACGGAC CGCAGGTCTC GCGACGTGCC TTGCGTGGAT ACGCGATTCG TATGTTGGGC TCTATCGGTG GTGATGACGT GTCTAGCTCC ATCGCAAGCG CGTACTCGAG CGCGAAGAAC ATGACAGATA CCGTCTCCGC GCTCACTGGT CTGTGCGGAC ACGACGATTC TTCCGCCGCG AAGAAGAAGG CGTTTGATGA CTTTTTGAAC AAGTGGAAGG ATGACAACAA CGTCAGTTGC ACTTGGCTTC GCATGGTTGC TTCTGATGCC GGTAAAGGAG GCGCCAACGC CATCGACGAA ATGAAGCGAC TCATGGCGAG TGACGTATAC GACGCGAAGA ATCCGAACAA ATTTTACTCC CTCATTGGCG GCTTCGCCGG CGGCAACATC GAAGGTTTCC ACGCCGCCGA CGGCAGCGGA TACGAATTCG TCGCCGATGT TTTGCTGCAA ACCGACGCCA TTAACCCGCA GGCATCTTCG AGAATGGCTT CACCATTCAC GAAGTGGCGG TTGTACGACG AAAATCGCCA AAACCTGATG AAGGCGCAAC TCGAACGATT GCTTGCGCAA AAGTTGTCGC CGAATCTCTT TGAAATCATC TCGAAGGCGA TTAAGGGCTG AACGAAACGA CTTTAATTAA ACGATG
|
Protein sequence | MTATAEPKEI RLSDYAPFPY AYDEVTLDFA LDGEYATVTA MSVITPIEGR DRTRGLVLNG KMPFFELLGA RVNGETLPAD RYSIEADGDD TLMIIKDTPD VRFELEITTK FKPQDNTELS GLYKSSGTFC TQCEAEGFRS ITFYPDRPDV MSVFTTKITA DKAKYPVLLS NGNLIDSGDA ANGAHFATWK DPWRKPCYLF ALVAGDLAVV EDTFTTMSGR EVALKIYAQA KNIDRCDFAM ASLKRAMKWD EDRFGLEYDL DLFNIVAVDD FNMGAMENKS LNIFNSRLVL ASEESATDAT FERIEGVIGH EYFHNYTGNR VTCRDWFQLS LKEGLTVFRD HEFTSDLHSR AVKRIADVRY LRAAQFAEDA SPLAHPVRPE AYQKIDNFYT LTVYEKGSEL IRMYSTLLGK DGFRKGMDLY FQRHDGQAVT TEDFFQAMSD ANSTNIEKLK RWYSQAGTPA LNAEGTYDAV TKTYALTLTQ TLPQTNDVKG AADKKLPQLI PVAVGLLGED GKDMVLDGDI KCEGDAEATL DETKTTAVCR LTEFKQTFTF TNITSKPVPS VLRGFSAPVK LTMTPELTTD ELLFLLANDS DEFNRWEAAQ KIATSILIRL CKKHNDDKAL KIEDVDVTSD PSWAIYSAAC CLIVKDATAN RLDRAWVEEA LNFPGPSQLI QDLAPGVNPV NTYRVCKAFA RAFAKESRVE LEAALATCDA EAAGLAYDVD GPQVSRRALR GYAIRMLGSI GGDDVSSSIA SAYSSAKNMT DTVSALTGLC GHDDSSAAKK KAFDDFLNKW KDDNNVSCTW LRMVASDAGK GGANAIDEMK RLMASDVYDA KNPNKFYSLI GGFAGGNIEG FHAADGSGYE FVADVLLQTD AINPQASSRM ASPFTKWRLY DENRQNLMKA QLERLLAQKL SPNLFEIISK AIKG
|
| |