Gene OSTLU_17955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17955 
Symbol 
ID5005226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp656 
End bp2263 
Gene Length1608 bp 
Protein Length535 aa 
Translation table 
GC content60% 
IMG OID640420647 
Productpredicted protein 
Protein accessionXP_001421364 
Protein GI145354168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones93 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGA ACGACCCTGG TGGGCGAGAG CGTTCGAAGC GTCGATCGCG ATTATTCGGC 
GACGAGGATG TCGACGTCGA TGCCGTCCCG ATCGAGGTGA CGACGGCTAG CGACGCTGCG
CACGAGGAGA CGACGGCCGG CGAAGCTGCA CACGATGATG GTGTTCTCGA TGATGACGAA
ATCGGCGGAG TCGTCGATGA TGCTGTCGCA GAGGGTGAAG CGAACGTCAT GCACGACGCG
ATTGATACGG ATGACGACGT TCGCGAGGCT TCATCGTCCT CCGCGGTCGT CTTGGATGAC
GACGCCCGTT CAGACTCGGC GCGAGAGTGT TCGTCCGTCT CGGCGTCGGC GAGCGGAACC
GAAGCCGAGC GCTTGCGATT CGGAACGTCG TTGTCGCAGG TCTACCGATT CAAAGATGGC
GAGTACGTCG CCGTGGGCGC GGCCGGATTG GCTATTTTAG ACTGCGCGTC GGCGAGCAAG
CGAAGTTTGC TCGTTTACGA CGTCGAAAAA CGACCGCTCG TACGACGAGT CATCGACGCG
CGTCTGACGT GCGAGCTGCA GAGAGATAAT TACGTGACGA TTCATCACGA CGACGCGATT
TTATTCAGCG CTTTAACGCG TGACGAGAGC GATTGGCTGG CGGTGGCGGC GCAAATCACC
ATCGCGCAGT TTGTCGCGCG TCGCGAGAGC CTCGAGGCAA TCGTCTTAGA TATTAGCACA
GAGAAATCTT CTTCAAGCGG TGGTGCGACT TTGGAAAAGG GTGATAGCGT GCAGCTGAAC
TATGAAGCCG TCCGAGGTGG TTTACCGCCG GCGAGCGAGC CGTATGACTT TTTGGCGTTG
CAGTTGTTCG ACGAAACGTC GAAGCGCGTC GCGGGTTTGA AAGTGAAACG AATCGGCGAC
GACGATTCGG GCTTACCCGA AGCGCTCGCA CGCGGTATCA TCGGTTGTCG AAAGGACTGT
CGACGACTGA TCCTCGCGCC CGAGACTGAA ACAGACTTTG TGCTCTACGA CGTCACGGTT
TTGCGAATGA AGAAGCCTTC GGCTAAAGAT GGAAAAGAGG CAGCTGTTTC AGGTGCGAAG
AGCGTTGACG AACACCTCAC GGACGCAACC GCGATGAACG AACGAACGGC GCGAGTCGCG
CATTCGAGTG CCGAGGCCAC TGACGCAGCA AGCGCCGGAA CCACCACAAC CACCGGAACC
ACCGAGGACT CCGAACGCCG CGCAGTTCGT CGGAACGCCC GCCGCCGCCG CAATGGTGGA
TGTACAACCA GCCGCCCGCG GCGATGCCGT ACGCGATGCC CGCCGATGCC TACAGCGTTG
AACACACCCG ACCAGACGCT TGTATCCGAC GTCCGGCGTG CCGTCGCGTC GCTCGCCGAG
GATGTTGCGT CGCTCGCCGT ACGTACCCGT ACCGGTGGCG CCTGGATTCC TCCGAAGCCC
GAAGGCGAGT TCAAGCACGC CATCGAGGCG CTTCACAAGG CGCGACGCGT CGTCATGCTT
CCTAAAATTG AAGCGGATGC GCTGTCGATC GGCGACGTGA AGCAGCTGAC GGCTGAGGCC
GAACGACTGG GTGAAATGCA TGACGAACTT AGAGCAATCA GAAAGTAG
 
Protein sequence
MPANDPGGRE RSKRRSRLFG DEDVDVDAVP IEVTTASDAA HEETTAGEAA HDDGVLDDDE 
IGGVVDDAVA EGEANVMHDA IDTDDDVREA SSSSAVVLDD DARSDSAREC SSVSASASGT
EAERLRFGTS LSQVYRFKDG EYVAVGAAGL AILDCASASK RSLLVYDVEK RPLVRRVIDA
RLTCELQRDN YVTIHHDDAI LFSALTRDES DWLAVAAQIT IAQFVARRES LEAIVLDIST
EKSSSSGGAT LEKGDSVQLN YEAVRGGLPP ASEPYDFLAL QLFDETSKRV AGLKVKRIGD
DDSGLPEALA RGIIGCRKDC RRLILAPETE TDFVLYDVTV LRMKKPSAKD GKEAAVSGAK
SVDEHLTDAT AMNERTARVA HSSAEATDAA SAGTTTTTGT TEDSERRAVR RNARRRRNGG
CTTSRPRRCR TRCPPMPTAL NTPDQTLVSD VRRAVASLAE DVASLAVRTR TGGAWIPPKP
EGEFKHAIEA LHKARRVVML PKIEADALSI GDVKQLTAEA ERLGEMHDEL RAIRK