Gene OSTLU_31583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31583 
Symbol 
ID5001901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp175457 
End bp178092 
Gene Length2636 bp 
Protein Length851 aa 
Translation table 
GC content64% 
IMG OID640417322 
Productpredicted protein 
Protein accessionXP_001417700 
Protein GI145346451 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0143437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.733987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAC AGACGATGAA GAGGACGAAG AGCGCGAGAG GGGCGCCGCG AGGCGCGCGC 
GCGGCGGCGA AGCGCTCGGA AGACAGCGAA GAGGTCGAGG GCGGCGCGCT GAAGCGCATC
GTCGCGGCTG GTACGGTTGC GTTTTCCGTG CTCGCGGGCG TGGAGGGGGC GATCGAGCCG
GCGCGCGCGG CGGGCTCGGC GACGGAGATC GTGCAACTCG CGCTCGACGC CGTCGACCCC
GTCGAGGACC CTGACGCCAA GGCTGAAGCG CCCGAGCGAG TCAAGGCGGA TACTTCCTCG
CTCGAGGGTG CGCTTAAGGC ACAAGTGCAG TCGCGAAAGT CGACGGTGAA GGAAGCGGGT
AAGAAAGCGG CCAAGGCTGC CGCCGCGCCG GCGAAGAGCG GCGCGCCCGA AGGCGCCATG
TCTCCGAACG CGAAGGATTA CAAGTCTGAA ATCGGCGAAA GCCTCGCTAC TCTGGATTAT
GACGCCATTA TCAAGAAGAC GGACGACTAT TTTGTCTACC GCTACGATCG CGGCATCGAT
GAATCTCAAA TCATCGATCT CGACGACGAA GACGACTCTG CGACGCGCGG TCCGAAGGGC
ACGAAGAAAC GAGTCGCGGT GGCCAAGTCC ACCACCGCCC CTTCGTTCAC CTTGCCGAGC
TTCACCGCGC CGAGCTTCGA TGCGCCGAGC TTCTCGATTC CGACGTTTGA AGTGCCGAGC
ATCGAAATTC CGGCTATTCC GGGTGTAGCC GAACCGGCGA CCAAGAAGGA CACCTCCGCC
GAAGACGCCG CGAAGGCTGA CGCCGCGGCC GCGAAGAAGG CCGAGGCCGA GGCCGAGGCT
GCGAGGAAGG CCGAGGCCGA CGCCGCGACC GCCAAGAAGG CCGAGGCCGA CGCCGCGGCC
GCCAAGAAAG CTGACGCTGA AGCCGCCAAG AAAGCTGACG CCGAAGCCGC CAAGAAGGCC
AAGGCTGACG CTGAAGCCGC CAAGAAGGCA GCGGCGGAAG CCGCCAAGGC TGAAGCTGCC
GCGAAGAAGG CTGAAAGCGC GAAGAAGCCG ATGGCTGCCG CCCCGGCCGC CGGTAGCAGC
GATCTTGGTT TCGATTTCGG CTCTCTCTCT CAATACATGG AATCCGCTCC GGCGGCGCCG
AAGGTTGACA AGAAGGCTGA AGCCGCCGCT AAGAAGGCTG CCAAAAAGGC CGCTGCTGAG
GCCGCGAAGA AGGCTGCCGA GGAAGAAAAG AAGGCCGCCG CTGCCGCGAA GGCCGCTGCG
AAGGCTGCTC CGAAGCCGAT GGCTGCCGCC CCGGCCGCCG GTAGCAGCGA TCTTGGTTTC
GATTTCGGCT CTCTCTCTCA ATACATGGAA TCCGCTCCGG CGACGCCGAA GGCACCGAAG
GCTGACAACG CGTCCGCCGC CGCTGGTCAA AAGGCTGCCG AAAAGATCGC CAAGCAACAG
AAAGAAGCCG CCAAGAAGGC TGAGGCCGCC GCGAAGAAGG CTGAGGCCGC CGCGAAGAAG
GCTCAAGCGC AAGAGGAAGC CGCCGCCGCG CGCGCCGCCG CCAAGGCTGA GATGGCCGCG
AAGAAGGCGG CCGGGAAATC CGCTGAGAAA CCGACTTACA GCAAGCGCAC TGTTGAAAAG
AAGGCGAAGC CGACGTTCAC GAAGTCTGCC AGAGATGGTA AGTTCGCTCC CTTCGCTGGG
ACTTACAAGA CGACCGTCGT CGAGAAGGAG GCCCTCCCGG GCGTGCCCGT CGACTTCGAT
GCCATCGTCG ACGCCCAAGA ACCCAAGGCG GAAGCGATTC TCGCCAAGGC CAACGATAAA
TCTGGCGATT TCTTGAACAT TTCCGGTGAA GCTGGCTTCG CCATCGCGGG TACGATTGCG
TTGGTTTACG AGACGGAGGA CAAGAAGTTC CGCGAGCAAG CGAAGAATGC CAAGATGCCG
GCGCCGACGA AGACGAATGC TCCCTCCGGT GAAAGCACCA CCGAAGGTTG GTTCGATGCA
GCGCTCAAGA AATACATGAA CAAAGATGGT TCCGCCCCGA AGCCAAAGCC CGTCGCCGCC
GCGCCGAAGC CGGTCGCCGC CGCGCCGAAG CCGGCCGTGC CGAAGTCTGA TCCGGTGAAG
AACGCCAAGG AGGCGCAATC GTGGATGGAC AAGTGGTCCG CGAGCAAGCC TAAGCCGGCT
GCCGCCGCGG CGCCGGCGCC GGCGCCTGCC GCGCCGAAGC CCGCGGCGCC GAAGTCTGAT
CCGGTGAAGA ACGCCAAGGA GGCACAGTCG TGGATGGACA ACTGGGAGCG CAAGGTCAAG
CCGACCGCCG CCGCCGCGCC GACGCCGGTC GCCGCGCCGG CGCCGACGCC GGTCGCCGCG
CCGGCGCCGA CGCCGGTCGC CGCGGCGCCG AAGCCCGTCC CGACGCCGAC GGTCTCGACC
ACCACCACTC GCACGGTCAC CTCGGACAAC CTGACGGCCG AGCAACGCGC CGCCGCCGAA
GCCTGGCTCA AGAAGTGGCG CGAGGACGGT CGCCCGACGG ACGAGACCAA ATTCGACGAG
GCCAAGACGT GGTTGAAGCA GCACAACTTC GACTGAGCGA TTACAAACAA CGAGTGAATC
GTTCACATTG ACTGTTCATG AAGACATTAA TAAACAAACG CACACGCACT AAGCTA
 
Protein sequence
MDEQTMKRTK SARGAPRGAR AAAKRSEDSE EVEGGALKRI VAAGTVAFSV LAGVEGAIEP 
ARAAGSATEI VQLALDAVDP VEDPDAKAEA PERVKADTSS LEGALKAQVQ SRKSTVKEAG
KKAAKAAAAP AKSGAPEGAM SPNAKDYKSE IGESLATLDY DAIIKKTDDY FVYRYDRGID
ESQIIDLDDE DDSATRGPKG TKKRVAVAKS TTAPSFTLPS FTAPSFDAPS FSIPTFEVPS
IEIPAIPGVA EPATKKDTSA EDAAKADAAA AKKAEAEAEA ARKAEADAAT AKKAEADAAA
AKKADAEAAK KADAEAAKKA KADAEAAKKA AAEAAKAEAA AKKAESAKKP MAAAPAAGSS
DLGFDFGSLS QYMESAPAAP KVDKKAEAAA KKAAKKAAAE AAKKAAEEEK KAAAAAKAAA
KAAPKPMAAA PAAGSSDLGF DFGSLSQYME SAPATPKAPK ADNASAAAGQ KAAEKIAKQQ
KEAAKKAEAA AKKAEAAAKK AQAQEEAAAA RAAAKAEMAA KKAAGKSAEK PTYSKRTVEK
KAKPTFTKSA RDGKFAPFAG TYKTTVVEKE ALPGVPVDFD AIVDAQEPKA EAILAKANDK
SGDFLNISGE AGFAIAGTIA LVYETEDKKF REQAKNAKMP APTKTNAPSG ESTTEGWFDA
ALKKYMNKDG SAPKPKPVAA APKPVAAAPK PAVPKSDPVK NAKEAQSWMD KWSASKPKPA
AAAAPAPAPA APKPAAPKSD PVKNAKEAQS WMDNWERKVK PTAAAAPTPV AAPAPTPVAA
PAPTPVAAAP KPVPTPTVST TTTRTVTSDN LTAEQRAAAE AWLKKWREDG RPTDETKFDE
AKTWLKQHNF D