Gene OSTLU_26022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26022 
Symbol 
ID5004217 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp103960 
End bp106875 
Gene Length2916 bp 
Protein Length924 aa 
Translation table 
GC content55% 
IMG OID640419638 
Productpredicted protein 
Protein accessionXP_001420077 
Protein GI145351419 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.812469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACCCGACG CGATGACGGC CACCGCCGAA CCCAAAGAAA TCAGGCTCAG TGATTACGCC 
CCGTTCCCGT ACGCGTACGA TGAGGTGCGG CGCCGACGCG ACGCGACGCG ACGCGACGCG
ACGCGATGCG ATCTGAAACT CTGAGAAATA TTAAAGAAAA CCGACGACTG ACGCGCGCGC
GCGCGCAGGT AACGCTCGAT TTCGCGCTCG ATGGTGAGTA CGCCACCGTC ACGGCGATGT
CGGTGATTAC GCCGATCGAA GGACGCGACC GCACGCGAGG GCTGGTGCTG AACGGGAAGA
TGCCGTTCTT TGAGCTGCTC GGCGCGCGCG TGAACGGCGA GACGTTACCC GCGGATCGGT
ACTCGATTGA GGCGGATGGG GACGACACGC TGATGATAAT CAAAGACACG CCCGACGTGA
GGTTTGAGTT GGAGATCACG ACGAAGTTTA AGCCGCAGGA TAACACGGAA CTGAGTGGGT
TGTACAAATC GAGCGGAACG TTTTGCACGC AGTGCGAGGC TGAGGGATTT CGGTCCATCA
CGTTCTATCC CGATCGTCCG GACGTCATGA GCGTGTTTAC GACGAAGATC ACGGCGGATA
AGGCCAAGTA TCCGGTGTTG CTTTCGAATG GAAATTTGAT TGATTCAGGC GACGCGGCGA
ACGGCGCGCA CTTTGCGACG TGGAAAGACC CTTGGCGAAA GCCGTGCTAC TTGTTCGCGC
TCGTCGCGGG CGATTTAGCG GTGGTCGAGG ACACGTTCAC GACCATGAGT GGTCGCGAAG
TGGCGCTCAA GATTTACGCG CAGGCAAAGA ACATCGATCG ATGCGATTTC GCCATGGCAA
GCTTGAAGCG AGCGATGAAG TGGGACGAGG ATCGCTTTGG TTTGGAGTAC GATTTGGATC
TGTTCAACAT CGTCGCCGTG GACGACTTCA ACATGGGCGC GATGGAAAAC AAGTCTTTGA
ACATTTTTAA TTCTCGCTTG GTGTTGGCGT CGGAAGAGAG CGCGACAGAT GCGACGTTTG
AGCGCATCGA AGGAGTCATT GGTCACGAAT ATTTTCACAA CTACACGGGC AACCGCGTCA
CCTGCCGCGA CTGGTTTCAA CTGTCGCTCA AGGAAGGTCT GACTGTGTTC CGAGACCATG
AGTTTACGTC CGATTTGCAC TCTCGCGCGG TCAAGCGTAT CGCAGACGTG AGATACTTGC
GTGCGGCGCA GTTTGCGGAG GATGCGTCGC CGCTCGCGCA TCCAGTTCGA CCCGAGGCGT
ATCAAAAAAT TGACAACTTC TACACCCTCA CTGTATACGA GAAGGGTTCG GAGCTCATTA
GAATGTACTC GACGCTCTTG GGCAAGGACG GGTTTCGCAA GGGCATGGAC TTGTACTTTC
AGCGTCACGA CGGTCAAGCG GTGACGACGG AGGACTTTTT CCAAGCCATG TCGGACGCAA
ACTCGACGAA CATTGAAAAG CTCAAGCGTT GGTACTCTCA AGCGGGCACG CCGGCGCTCA
ACGCCGAAGG CACGTACGAC GCCGTCACGA AAACGTACGC GTTGACTTTA ACGCAAACGT
TACCACAGAC GAATGATGTG AAGGGCGCGG CGGATAAGAA ACTACCGCAG CTCATTCCCG
TCGCTGTCGG TTTGCTCGGC GAAGATGGCA AAGATATGGT GCTTGATGGC GATATCAAGT
GTGAAGGCGA CGCCGAGGCG ACTTTGGACG AAACGAAGAC GACGGCGGTG TGTCGGTTGA
CTGAGTTCAA ACAGACGTTT ACCTTCACCA ACATAACGTC CAAGCCAGTG CCGAGCGTTT
TGCGCGGGTT CTCTGCGCCA GTCAAGCTCA CGATGACTCC AGAACTCACC ACGGATGAGT
TGTTGTTCTT GCTCGCCAAC GATAGCGATG AATTCAACCG ATGGGAAGCG GCGCAGAAGA
TTGCGACGAG TATTTTGATT CGCTTGTGCA AAAAGCACAA CGATGACAAA GCGCTCAAAA
TTGAAGATGT GGATGTTACT TCGGATCCGT CATGGGCGAT ATACTCCGCG GCTTGCTGTC
TGATCGTGAA GGACGCGACC GCCAATCGCC TCGATCGTGC GTGGGTCGAG GAAGCACTGA
ATTTTCCTGG ACCTAGCCAG TTGATTCAAG ATCTCGCCCC TGGTGTGAAT CCGGTGAATA
CGTATAGGGT GTGCAAGGCA TTTGCTCGCG CGTTTGCCAA GGAATCTCGC GTCGAGCTCG
AGGCGGCGTT GGCGACTTGC GATGCCGAAG CCGCCGGTTT GGCGTACGAC GTGGACGGAC
CGCAGGTCTC GCGACGTGCC TTGCGTGGAT ACGCGATTCG TATGTTGGGC TCTATCGGTG
GTGATGACGT GTCTAGCTCC ATCGCAAGCG CGTACTCGAG CGCGAAGAAC ATGACAGATA
CCGTCTCCGC GCTCACTGGT CTGTGCGGAC ACGACGATTC TTCCGCCGCG AAGAAGAAGG
CGTTTGATGA CTTTTTGAAC AAGTGGAAGG ATGACAACAA CGTCAGTTGC ACTTGGCTTC
GCATGGTTGC TTCTGATGCC GGTAAAGGAG GCGCCAACGC CATCGACGAA ATGAAGCGAC
TCATGGCGAG TGACGTATAC GACGCGAAGA ATCCGAACAA ATTTTACTCC CTCATTGGCG
GCTTCGCCGG CGGCAACATC GAAGGTTTCC ACGCCGCCGA CGGCAGCGGA TACGAATTCG
TCGCCGATGT TTTGCTGCAA ACCGACGCCA TTAACCCGCA GGCATCTTCG AGAATGGCTT
CACCATTCAC GAAGTGGCGG TTGTACGACG AAAATCGCCA AAACCTGATG AAGGCGCAAC
TCGAACGATT GCTTGCGCAA AAGTTGTCGC CGAATCTCTT TGAAATCATC TCGAAGGCGA
TTAAGGGCTG AACGAAACGA CTTTAATTAA ACGATG
 
Protein sequence
MTATAEPKEI RLSDYAPFPY AYDEVTLDFA LDGEYATVTA MSVITPIEGR DRTRGLVLNG 
KMPFFELLGA RVNGETLPAD RYSIEADGDD TLMIIKDTPD VRFELEITTK FKPQDNTELS
GLYKSSGTFC TQCEAEGFRS ITFYPDRPDV MSVFTTKITA DKAKYPVLLS NGNLIDSGDA
ANGAHFATWK DPWRKPCYLF ALVAGDLAVV EDTFTTMSGR EVALKIYAQA KNIDRCDFAM
ASLKRAMKWD EDRFGLEYDL DLFNIVAVDD FNMGAMENKS LNIFNSRLVL ASEESATDAT
FERIEGVIGH EYFHNYTGNR VTCRDWFQLS LKEGLTVFRD HEFTSDLHSR AVKRIADVRY
LRAAQFAEDA SPLAHPVRPE AYQKIDNFYT LTVYEKGSEL IRMYSTLLGK DGFRKGMDLY
FQRHDGQAVT TEDFFQAMSD ANSTNIEKLK RWYSQAGTPA LNAEGTYDAV TKTYALTLTQ
TLPQTNDVKG AADKKLPQLI PVAVGLLGED GKDMVLDGDI KCEGDAEATL DETKTTAVCR
LTEFKQTFTF TNITSKPVPS VLRGFSAPVK LTMTPELTTD ELLFLLANDS DEFNRWEAAQ
KIATSILIRL CKKHNDDKAL KIEDVDVTSD PSWAIYSAAC CLIVKDATAN RLDRAWVEEA
LNFPGPSQLI QDLAPGVNPV NTYRVCKAFA RAFAKESRVE LEAALATCDA EAAGLAYDVD
GPQVSRRALR GYAIRMLGSI GGDDVSSSIA SAYSSAKNMT DTVSALTGLC GHDDSSAAKK
KAFDDFLNKW KDDNNVSCTW LRMVASDAGK GGANAIDEMK RLMASDVYDA KNPNKFYSLI
GGFAGGNIEG FHAADGSGYE FVADVLLQTD AINPQASSRM ASPFTKWRLY DENRQNLMKA
QLERLLAQKL SPNLFEIISK AIKG