Gene OSTLU_25777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25777 
Symbol 
ID5006390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp74986 
End bp78287 
Gene Length3302 bp 
Protein Length841 aa 
Translation table 
GC content55% 
IMG OID640421811 
Productpredicted protein 
Protein accessionXP_001422333 
Protein GI145356221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.203689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00367561 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTTCAACGCG CGACGATGGT GAGACGTGGC GTTGCGATCG CGATCGTTGG TGTCGCCGCC 
GTGGTGGTCG CCCTTGGCAC CGGCCTCGGC ATCGGGCTCA GGGATGACGA CGCGACCCCG
GCCCCGGCCC CGACCCCGAC CACCGTCAGC GGGCTGTCGG GCGCGGCGAA GACGAACGCG
CTCAAGGCGG CGTTGACGCC GCGTAAAGTC ACCATGATCC CGTCAGAGGG TGTTTCGACG
ACGTCTTCTG GTTCGTCTCG AATCGATGGT CGTTCGCTCG TCGAGGTGAC GCCTCTCGAA
GTGTCTTCGT TCTCTTCGAC GTCCGACTAC GCCACGGCGC CGCCGGCGGA GACGTTCGTC
GACGCTGTAA GTACCGAAAT TTTCCAGATA CCAAACATGG TTCTTTGCTA CGTCGCCTCG
GTGAACTGGA CGGCGAACCT GAACACTGGC CCATACATCG CTGAGATCGA TCCTTTCCAC
TGTGACAACA GCGACGGCGA CCGAATCAAG GGTAAGGTTA CGTACCGTTG GGTCGTCAAC
GCCACGGGCC CAGACATCGA CGACGCGAGC GATACCCGTG ATTTCAAGAC GAACGTCTGG
GTAGCCCTCA GCGATACCCC AGAAATGCCG AACATCGACA CGGAAATGAT CGTCAAGCGC
GATGACGTCG CCATCAAGTC TGACAAAATC TTGGTGAAAA ACTTTACCTT TACGTACCAA
TCGCCGACTG GCTCGACCGC GTCCCAACGC ATCAAAGGTG TGGTTCGTCG CGAGTGCGAA
ACCACCGACG CGGATTCGGA TTGCACGGCG ACCGGGGTCA CGTGGTGGGA AGAGGTGACA
CGAGGCAGCA ATTCATTCAC CGCGGGTGCT AAATCGCGCG CAGAAAACGA CATCACGAAG
GCCTCTTTCC AGACCATCGA CTTTGATAAT GGTTCGACAC AAGCGGGTCA ATTGGTGACT
ACCGCTACGC TCGTCAAGAC GAAGGCGTAC GGGCAGTCAT TCTGCGATGA ACTCGAGAAC
AGATCGTTCT TCGGTGAGCG CTACGGCGCG TACGACAGCA ACGGGGCGAA GGTGAACCTT
CAAAGCTACG TGCATCTTCA GGCTACAGGC AGCGACGGCA AGACGTATAA TGCTGATCTG
CACTATCCGG GCAATCTCTA CATTTCCGAT TACGACTACA TTTCCGACAC CGCGCTCAGT
ACCGCCGAGA AGACTATCGC TCAAACCGCG TTCACCGACG GGAACGTAGT AGAGGAAATT
GTCGATTGGG ATGATCCGAT ACTGAACGTG AACAAGAGGC TCAAAGTCTC ACGCGGGGTG
TTGTACAAGT TCACGGCGAC GGTGCGAAGC GCAAGCGACT ACCAAGGGGC GACGATCACA
GGCTGGTTCT GGGATGACAC TAACGCTGCA GAAGTTGAGG TCAAGTTCAG GTATGACTCC
GACGCGAACG CACTCGTGTT GTCGTCGAAG CGAACTTGGA ACGGTGGATC GCTGAACACG
AACTCAATCA CGACTCCGCG TGCGCTCACG ATGGCTGACA TTGACAACCA AATCATACGC
TGCTGGGGAA GCATCTTCGG CCGAGGCTCT GCGAGTTTGC TTAACCTGAC GCACTTCAAA
GTGTCTGACG CGCAAACCGT CCCCCCTGGG TCGCTCAGCA GCCACCTCGC ACTCACGTGC
GGCGAGCGGT GTATCGACCA GACGAAGCTA TCGAGTGCGA GCAGTCACGA CAGCCAGTTC
TACGGCTCGC CGAACGGATA CAACGACGCT CGAAGCACTT ATAAAAAGTA CGTCTTCGAC
AAAGATACCG GCTCGTTGAA GGAAGACGTT TCCGGCACCC TCGGCGCTGA AGTCGTAATG
GACCCAACCA ACTCATTCTT TGACTCGGGA GAAGTTCATA TGGTTCTCTT CGAAGCTACA
CCTGCGAACC TCGACACTTT GAGTTGCACC GCCACTACCG ACGTTTGCGA GGAACCCAAC
AAATTGGATG TGCACTACGA GTGGAGCTCA AGCAATTGGG AGGGCATGGC GTGGCTCGAA
GATCCTTCGG ACTCCGCGTC AAAGAAGTTC ATGGATGCGC CTCTCCAGCT TCAAGGCAAG
ATACCCACGG ACGCCGTGTT GCTGCGCTCT CCATCCGGTA CCGACTACTC CGGGGTGAAT
CTCAACGTGC GTTACGAAGG CGGTTGGTTG GGCGACGCGC CGTTCATTTG CTTCAATCCG
ATGACTGGTT CGCGCGCGGC GCCCGAGGTC GACCAGTATG GCAACGAGGA GTGCGACGAC
AACAATGGTT ACCATCGCCG TCCGGATGTC CTGATCCCCG ATGGCACCAC GTTCAAACAA
CCATCGACTG GCGACCGGTA CGTCTTGAAA CTCGAAAGTG GTGTCGAAAT GCTTGCTCCC
GCCGATGCGT CGGCGTGCTC CGGGATGTCG TACGACACGA GCATCACGGT GCCCACGGCG
GCGGACTACA CCGCGTTCAC CATGCCGACG AAGCCGTCCA TGACCGGGCT CAGCGTAAAA
GGCACGGATA AGGTTTCCTA ATCAATCAAG TCAACCAGTG CTTGTTTTCA AGATCGTGTC
CGAAACGCCG GCGTTATATC GGCGGGCGCG ACCCATCGAT GCGACTCAAA CGACGGCCAA
ACTTGAGTCT CTCGGCAGCA ATAAAATCAT TTAGCGCCAA CACGCGCGAG TCTTTAATTA
ATAAATATAA ACTTCTCCGT ACCGGTTATC ATGTGGAGCG CACGAGACCT ACGACCGACG
CGTATTCGGA AATGAAGCGT GGCTGCATGC CCAAGTTCAC TCGATGGTTT GAGCATTGCG
TCACTGTTGA ATTCCCTGAG AAGTGGGTTG GAAATAAAAT CCGCAACAGT GACATCTTCA
TCGAATACCA GACGTGGTTG CCAGCGGCTG CGCGTGGACA AGATAGCGCG ACGAAGGTCG
GCAATAAGCT CAAGGACTTC TTCAAGAAGG AAAAGGGCCA CAGGATCCCA ATGGAAGAGG
ACCACCTGAG GCAAGGTAGG GACGAGAAGG GCGTCTATTG GGAAATTGAC CGCGACGGGT
GCTTCGAGTG GCTGAAGAAC AATGGGTACA CGGGGGAGAC GGAGCTCGCG CCGGCGGTCG
TCTGGTGTTC ATACTAATTT TGTGATTCGT ACATTTGCAA TATGAAGACT CAACAAGACA
CCGAAAAGAC CATGGTTTGG ATCCTGACGA AATGACGAAT GACGAACTGT TTTCATAAAG
GTCTACACTG TTGCTTGTCA TAGATCTAAA ATAAAAATAA ACATTAAAAT ATTTTAACAT
TT
 
Protein sequence
MVRRGVAIAI VGVAAVVVAL GTGLGIGLRD DDATPAPAPT PTTVSGLSGA AKTNALKAAL 
TPRKVTMIPS EGVSTTSSGS SRIDGRSLVE VTPLEVSSFS STSDYATAPP AETFVDAVST
EIFQIPNMVL CYVASVNWTA NLNTGPYIAE IDPFHCDNSD GDRIKGKVTY RWVVNATGPD
IDDASDTRDF KTNVWVALSD TPEMPNIDTE MIVKRDDVAI KSDKILVKNF TFTYQSPTGS
TASQRIKGVV RRECETTDAD SDCTATGVTW WEEVTRGSNS FTAGAKSRAE NDITKASFQT
IDFDNGSTQA GQLVTTATLV KTKAYGQSFC DELENRSFFG ERYGAYDSNG AKVNLQSYVH
LQATGSDGKT YNADLHYPGN LYISDYDYIS DTALSTAEKT IAQTAFTDGN VVEEIVDWDD
PILNVNKRLK VSRGVLYKFT ATVRSASDYQ GATITGWFWD DTNAAEVEVK FRYDSDANAL
VLSSKRTWNG GSLNTNSITT PRALTMADID NQIIRCWGSI FGRGSASLLN LTHFKVSDAQ
TVPPGSLSSH LALTCGERCI DQTKLSSASS HDSQFYGSPN GYNDARSTYK KYVFDKDTGS
LKEDVSGTLG AEVVMDPTNS FFDSGEVHMV LFEATPANLD TLSCTATTDV CEEPNKLDVH
YEWSSSNWEG MAWLEDPSDS ASKKFMDAPL QLQGKIPTDA VLLRSPSGTD YSGVNLNVRY
EGGWLGDAPF ICFNPMTGSR AAPEVDQYGN EECDDNNGYH RRPDVLIPDG TTFKQPSTGD
RYVLKLESGV EMLAPADASA CSGMSYDTSI TVPTAADYTA FTMPTKPSMT GLSVKGTDKV
S