Gene OSTLU_31150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31150 
Symbol 
ID5001506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp392750 
End bp396115 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table 
GC content54% 
IMG OID640416927 
Productpredicted protein 
Protein accessionXP_001417492 
Protein GI145346014 
COG category[S] Function unknown 
COG ID[COG5594] Uncharacterized integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.034455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGC CGACGCAAGA AAACTACGGC GCGACGCCGA CGAGCGAGCT CACGAAGGTG 
CAACCGCCGC GAACGCGAAC GCCGTGGGAC TCGGCGGCGT GGATCACGGC GACGCTGTTG
GACGAGCAAG GCCAACGGTG TCCGTGCCCG CCGGGATGCG CCGGGGATTG CGCGGTGCGC
TCGCATTGGT TGTTTAATGA GAAGGATAAC GACCTGCGAA CGATTCACAC GCTGCAAAGC
ACGTCGAATC CGCCGATGGT GAGCTTGAAC GCGGGTAACG TGGCGCCGTG GGGTAAGGTG
GCGCAGGTGA ACGCGTCGAG TTATCGCGTC ATGGGTGGGT GTCCGCCGTC GGATCAGATG
GACTGCATCG CGTGCGATAC TGGTGCTGGA TACTATCCGG TGGTGTACGC TGGCGACTTT
TCTATCGAAG GTATCAAGCA AACGGGCGCG GGTATTTCGT GCTATTATCT GGAGATTTGT
GACAACGACG ACTCCTGCGA TACGCAGATC GCAATGTCCG AAGTGGGGAC GCTGGTAGGG
TCTTACGCGG GTCTGTTTTC ATTCCTTCTC ATCCTCTTTG TTATGTTGCG TAAAATCGTA
TGGCTCCGGA TGGCGATCGA CTCGGCGGCG TGGGTTGAAA AGGGCAACCC CAAGTACCCG
CCGCGAGAAA TTCAGATTCC CAAACCGAGA GACGACAATG CATGGTCGTG GCTTTATGAC
GCTTATCACC GCGATAATAA TTGGATGAAA GAGTTCACCA CCCCAGACGA GTACATGCTC
GTCCGCTGGT TCAAGTTATC CAGTCGCTTC TTCTTCACCG CCGGCGCCGT ATGTTGTCCA
ATTCTCATGA GTTTGTACGC CGCCGATACC GTACCAAGCG CAGATGCGGG CTCGAAGATA
CTTACAACTC TCGAAAAGAG CGGTATCGCG AAATACACAC TGCTCAATGC TCGAACCGAA
TCTTCTTTCG CGGCCGCCAT GGCTTTCACC TGGATCACAA GTCTCTTCCT GATTTCTTTG
ATTCGCGTCG AGTCGCGTAA GTACGTCCAC ATGATGTGGA CGGTGGATCC CGATAAGACT
GGTATTAAAG CGAACGCAAT CCTCGTCAAG GATATGCCCT TGTTAACCAC GGCTCCGGCA
CCGAAAAAGT TTGAACAGCT TAACACCGGC AGCGTCAAAG ATATTCTTAA AGTCAAAAAG
AGTGTGCGAG GTTCTGTGAA GAAACTCGAC AAAATCTTCG ACGACGAAGA AGTCGGCTGC
CTGGGGAGAT TTAAGCTTTT ACTCAACGAT GGCACCGTGC AATCTTCCGG TGATTCCGCC
AAGCTTCGGC TTTTATACAA GGAGGAGTCG ATGAACCTTG TCATCAGCAA GTTTGAGAGC
GTCCTCGGCA AAGACTGCAT CGCTTTCAAA ATGCTCGCCT CGGATACTCG AAAGCTTGAT
AGCGCCGCGA AAGCCTGGGC GAATGCCCGC GAGCACGTCA CACAGAACAT GCAAGCGATC
GCCGACTTAC AAGAGACCGA AAAGACTGGA GGACTCAGCT GGGGGGAATC GACTCAACTT
GCGAAAGCGT TGAAAGATAT GGATTCGTTG AAGAGAGCCG AGGCGCAACG CTTCGATGTT
TTCATCTCCA CTCGCGATGA ATACATCAAC AACCACCGAC CCGCGTGTAG TGCGGTCGTG
GTATTTGCGA GACAAATGGA CGCCGTCATA GCCTCTCAAA TTCAAATTGA CGACGTTCCC
GGGCAGTGGG TCACCGAACC GGCGCCAGGC AACTCTGATG TCGTGTGGCA CAATCTCTCC
TTGACATCCG TCGAACGCGC AAAAAAGACG ACTCAGGCTT TTTTTATCGC CGTGGCCATT
TCTCTATTCT TCATGTACCC TGTCAATATC GCTGTCGCGG CTGTCGCCGA CGTCAAGGAC
TCTCTCGTGA GCGTGTTTGG CGAGTCTATT TACAACATCA TCCTGTCAAT TGTGTTGACG
GTGTTCCTCG TCGTCGGTCA CATTTTAAGC TTGGTCGTGA GTCGGCAAAC TGGCTACGTC
TCGGTCAGCG CTATGGACTC ATTCGGTGCG TCTATGTACT TTTGGCTTCT CATTCTCAAC
TTGGTCTTTT CCAACCTGAA CACGACGCCC CTGTGGAAGG ACGTCCTCGT GTGGATGCAA
AAGCCGCACT TGTTCACGTA CCAGTTCATC TTGAGGTTGA TGAATACCAG TACTTTCTTC
CTCCAGTTCG TCATGCTTCG TACCGCGACT TCACCGGTTC TCGAGTTGAT CCATCCTCCG
GTACTCCTCG GCTTCGTCAC AAAGTGCTTG CTATACCGCA GTCGAGCGCG CACGTGGCCG
GCTTTCGCAA AGAGACTCAT CTGGGCTCAA CCGACGCCCA CGCCGAGCCA TCGGGTTCCC
GCGCAAACGA TGTTGGTGTT CTTCATCGGC ATAATTTACA CCGTCGTCGC ACCAGTTTTA
CTTCCGGTGT GCGGCGTCTT CTTCGGTTTT TTCTACATTT TCTGGAAGCA CAACATGGTC
TATCACTACA TCCAACAGTA CTCCGCGGGG ACGTCCATGT GGGCGTGGCT CGTCGGAAAG
ATGTATTTCA GCCTCGTGTT CAGCCAAATC ATGGTCGCTT TCGGTCTTCC GACGCTCGGC
TTCAACACGA TGAAGTATCG TGTCTTCATC ATACCTCTCG TTTTATTCAC CCTTCTCGAA
TGGTCGCGCG TGAATACCAT CCTCAACGAT GCGTTCAGGG TGCCGGTACA CGCGGCTGGC
GCAGCGTTGA AGCGCAGAAG CGGCAAGCAT GAAGAAGACT CGGACGACGA ATACTTCGCT
TCGAGATCGG CTTCGAGATC GGATGTTCCA GCGCTCGAGG AAGACGCTCG GCAAGAAATC
TTCGTGAGCA CGACGCGAAT CGGTGATTCA AGGAAGAGGA AGGTTATGCG CGGCATCGTG
CCGGTGGAAG AAATGAAAAA GAGCACACGC CGGCAAAAGA ATATTCTCGA AAACACTCGA
GAGCAAGGAA AAGCTGAGAT CGAGTACAAA GTGAAGAAGG GAATCTGGCA AACGTACGCG
CCGAGCGTGT TGTGGCCACT CGCCGCCGAG AAGTCTGCCG GTTCCATTTT CTTGCGTCGC
TGGAAACAAA TCAAGGCGCG AAAGCAAGTC GAAGCAGACA TGTTGGCCGC CGTGGCGCAC
TTACCCGACG ACGACCAGAG AAAGAAGGCA GTCATCAAAG ATTTGGCGCT CAGGAAACAA
GTGCGCGACT CCGCCGCGGA AGCCATTCTT CGAAAATCCA CCTCGCCGTA CGTGAAGCAA
CTTCGACAGA AGCGCGCCGC CGAGGAAGCC GAGAAAGCCG ACACTCGAGC TGGCTCGACC
AAGTAG
 
Protein sequence
MATPTQENYG ATPTSELTKV QPPRTRTPWD SAAWITATLL DEQGQRCPCP PGCAGDCAVR 
SHWLFNEKDN DLRTIHTLQS TSNPPMVSLN AGNVAPWGKV AQVNASSYRV MGGCPPSDQM
DCIACDTGAG YYPVVYAGDF SIEGIKQTGA GISCYYLEIC DNDDSCDTQI AMSEVGTLVG
SYAGLFSFLL ILFVMLRKIV WLRMAIDSAA WVEKGNPKYP PREIQIPKPR DDNAWSWLYD
AYHRDNNWMK EFTTPDEYML VRWFKLSSRF FFTAGAVCCP ILMSLYAADT VPSADAGSKI
LTTLEKSGIA KYTLLNARTE SSFAAAMAFT WITSLFLISL IRVESRKYVH MMWTVDPDKT
GIKANAILVK DMPLLTTAPA PKKFEQLNTG SVKDILKVKK SVRGSVKKLD KIFDDEEVGC
LGRFKLLLND GTVQSSGDSA KLRLLYKEES MNLVISKFES VLGKDCIAFK MLASDTRKLD
SAAKAWANAR EHVTQNMQAI ADLQETEKTG GLSWGESTQL AKALKDMDSL KRAEAQRFDV
FISTRDEYIN NHRPACSAVV VFARQMDAVI ASQIQIDDVP GQWVTEPAPG NSDVVWHNLS
LTSVERAKKT TQAFFIAVAI SLFFMYPVNI AVAAVADVKD SLVSVFGESI YNIILSIVLT
VFLVVGHILS LVVSRQTGYV SVSAMDSFGA SMYFWLLILN LVFSNLNTTP LWKDVLVWMQ
KPHLFTYQFI LRLMNTSTFF LQFVMLRTAT SPVLELIHPP VLLGFVTKCL LYRSRARTWP
AFAKRLIWAQ PTPTPSHRVP AQTMLVFFIG IIYTVVAPVL LPVCGVFFGF FYIFWKHNMV
YHYIQQYSAG TSMWAWLVGK MYFSLVFSQI MVAFGLPTLG FNTMKYRVFI IPLVLFTLLE
WSRVNTILND AFRVPVHAAG AALKRRSGKH EEDSDDEYFA SRSASRSDVP ALEEDARQEI
FVSTTRIGDS RKRKVMRGIV PVEEMKKSTR RQKNILENTR EQGKAEIEYK VKKGIWQTYA
PSVLWPLAAE KSAGSIFLRR WKQIKARKQV EADMLAAVAH LPDDDQRKKA VIKDLALRKQ
VRDSAAEAIL RKSTSPYVKQ LRQKRAAEEA EKADTRAGST K