Gene OSTLU_41444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41444 
Symbol 
ID5002517 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp699525 
End bp701075 
Gene Length1551 bp 
Protein Length311 aa 
Translation table 
GC content62% 
IMG OID640417938 
Productpredicted protein 
Protein accessionXP_001418551 
Protein GI145348215 
COG category[R] General function prediction only 
COG ID[COG1439] Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGT GGTCCAAAAT CGTGCGCGAC GATCCCGCAC CGGCGCCGGT CGACGCCGCC 
GAAGCGTCCG CGGCGGCGGC GGCTGAGGCG ACGCTCGAGA GCAAACTGCG CGACGCCGCG
ACGCTGAAGG CGGTCGTCGA CGCCAACGCC GTGTTCAAGG GCTACGCGCT GACCGACCCG
AACGTGCTGT GCGTGACGAT CGCGGAGGTG CTGGATGAGA TTCGGGACGC CAAGGGACGA
GACGCGGTGG CGGCGAGCGC GGGCGCGCTC GAAGTCGCCG AACCGAGCGA AGAAGCGATC
GAAGCGGTGA AACGGTTCGC GAGTAAGACC GGAGACGTGC ACGCGCTGTC GAGGGTAGAC
ATGAAGCTCA TCGCGCTGGC GTATGACTTG GAGGGAAGAT GTCACGGGGT GGAACATTTG
AGGACGGAGC CGGCGCCGCC GAGGACGCAC GCTAAAAAGA CGAATCGGTT CGAGAAGCAG
CCGGGATGGG ATTACGTGCC GAACGCGGAC GATTGGGCGG AGTTGGACGA TATGAATAAG
CTCCAGGAGG AAGCCGAACG CGAGATGCGG GAAAAGATGG CCAAGGTTTC GATCGAGCAG
GCGGCGGAGG AGGAATTGCG AAAGGAGCGC GAGGCGGAGA CGGCGGTGGC GAGGGAACGA
CGCGCCGCCG AGGAGGAACG CGTGCGAGCG TTGAAGGAGA AGGCTGCGGA GGCGTTGGTG
GCGCAGGAAC ATATAGTTAA GGACGTCGAG GGCGATACCG ACGAATGGGC GCCGGTCATT
TGTCGAACGA CGCGCGTGCG TCGCCAAAAG CGAGAGGAGC GCGCGCGTTT AGCGGCGGAG
GAAGCCGAAC GTCGAGCGAC GGCGAACGCG GAAGTTGAGG GAGCCACGCC CGAGGAACTC
GAAGAGCAAA GCAAACGAGC GACGGACTTC TTCACCTCTC GTGGTGAAAT CGAGGCGAAT
GTGGAGGAGG AAGAGGACGA CGACGCGTCC ACTCGTAGCG ACGACGACGA GGAAGTCGAA
CTCGAATCGT GCGTCTCCTC CGTAACGGCT GATTACGCCA TGCAAAACGT CATTCTGCAG
ATGGGCCTTA AACTCGTCGC GCCCGACGGC ATGCGCATCG AGCACCTACG GCGATGGGTT
CTTCGCTGCC ACGCGTGCAA CGAAATCACC CGCAATCTCA CTCGTATGTT TTGTCCCAAG
TGCGGCAATC AAACGTTGCA AAAAGTCGAG CACACCGTCA CTCGCGACGG CGTCGAACAA
TTCGGCGTTC GTAAAAAGTT TGTTTTACGC GGCAGCAAGT ACACCTTGCC CGCGCCCAAG
GGTGGTCGCA ACGCAAAGAA AATAATCTTA CGCGAAGACC AACTCATGAG CGTGCGGCTG
ACTAAGAAAC AAGTAGGCGA AGACGTCTTC GCCGCCGAGT ACAACGAGGA ATCGTACGCC
GACGCCAAGC ACTTCGCCAG TCAGAAGACG GCGTACGAAA TCGGCGGCGG CGACGTCCGT
CGCAACCCGA ACGAACGTCG TCACGTCGCC ACGAACAGGC GTCGAAAGTA G
 
Protein sequence
MSAWSKIVRD DPAPAPVDAA EASAAAAAEA TLESKLRDAA TLKAVVDANA VFKGYALTDP 
NVLCVTIAEV LDEIRDAKGR DAVAASAGAL EVAEPSEEAI EAVKRFASKT GDVHALSRVD
MKLIALAYDL EGRFELESCV SSVTADYAMQ NVILQMGLKL VAPDGMRIEH LRRWVLRCHA
CNEITRNLTR MFCPKCGNQT LQKVEHTVTR DGVEQFGVRK KFVLRGSKYT LPAPKGGRNA
KKIILREDQL MSVRLTKKQV GEDVFAAEYN EESYADAKHF ASQKTAYEIG GGDVRRNPNE
RRHVATNRRR K