Gene OSTLU_23972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_23972 
Symbol 
ID4999811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1035865 
End bp1037760 
Gene Length1896 bp 
Protein Length629 aa 
Translation table 
GC content61% 
IMG OID640415232 
Productpredicted protein 
Protein accessionXP_001415671 
Protein GI145341138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA TGGCGACCCG CGACGCGCGC GCGACGGCGC GACCGACGCG CGACGCGATG 
CTCTGGACGA TCGAAGAGCT CGCGAGCGCG GGCGTCGACG CGACGACGCT GCGAAACCTG
CTGCCGGTGG GACTCGCGGG CGAGGGCGCG CGGGCGGCGC GCGCGCGGAA GCGCGTGCGA
CATTTTTTTC TGCTGAGCGC GTCGATCGCG GCGATCGAAG GGGCGGAGGG AGAGGAGACG
GAGAAGAGTC TGGAGGCGCA TCGAGACGCG ATGGCGCGCG CGAGAGACGC GTGCTTTCAG
GAGGACGACG GCGACGAGGA GGATTTGATC GGCGAGGATA CGGAGGCGTT TCGGGCGTAC
GTGAAGGCGG CGGAACGAGC GATCGAGGTG CTGGGGATGC GGGAGGCGTG CGAAGGGCTC
GATGAGCGCC GTTCGGCGCG CGACGAGACG CCGCGAGACG CGGTGCGACG GGTGGTCGAG
GAGTGGAAGG ACGCTTTGCT CGGCGTCCAC GGTCGGGACG TGGACGAGCA CTTGGATCGC
ATCGAGGACG CGTGGACGCG CAAGGCTGCG GATCGTTATC ACAGATTGTG CGAGGTTGAG
GAGTGGGACG AGCCGTCGCG CGCGTTGGTG AAAAAGGCGC TCGAGCTCTT GCGCGCCGAG
CTTTGCGGGA AAGCGACGAC GTTGACGACG GTGCAGGAGC ACTTGGCGAA CTCGTATTAC
GTGCCTTCGC GCGCAAAGCC AGGGTTCACG CTGGGTAAGC GTGATGATGG CGTGCGCGAG
TGGGTTGACG TCGGCGCGCG TGGCGTGAAG CGCCGAGCCG AGACGAACGC GGACGCTTTC
GTCTCGTCGC CATCGACCAA GCGAGTTCAT TCAGAAGAAA CATCTCCGGC CAAACCTCGT
TCGTCTCCGG GACGATTAGG CGCCTTGCTG TCGAAGACTA TTTCGTGGGT GCGCAAATCG
GTGGACGCGC CAGCGTTCAT CAAATCACCC TTCGGCAAAT CACCGCTCGG CAAGTCATCG
CGGATCACGG CGTCGCAAAC CATCGACGAC GTAGAAGAAA GACGTACAGA CGAAGTAGAA
ACTCTTGAAG ACGACACGCC CCCCAGCGAG CACGAATCAG AAGAAGAAAT AATGCCGACG
CAAGTGCCTA CGGGCTCGTA TACTGACGAA GATGACGAAG ACGAAGACGA AGATGAAGAC
AAAGACAAAG ACCCGGAGCC ATCTCCGACG CAAGTGCATA CGGGCTCGTA TACTGACGAA
GATGACGAAG ACGACGACGA AGATGAAGAT GAAGACCCGG AGCCATCTCC GGCGCCGACG
GAGCCGTCTC CGGCGCCGAC GCAGACAATA CAACCGGCTC CGCTCAAACG GCAAGGAAAA
GTGTATCCGA AGGCACAACG ACTGGTGTCG GTGAGAGCAG CACACGCGCG CTCACCTTTA
ACCGGCTCCG ACGACGAGTA CGACGAAATC GAAATCGAGG CGACGCCTGG CAATTACCTC
GTGCCCCGCG TCAATCGACT CCAGCCAGTG AAGACCAGTG TCAAGCAATC CCCAACTCGT
AAGAGAAAAT ACGAGAGACA AACAACAAGA CGCGCGCCTG GACGCCCGAA GAACTGGACG
CCCGAGGAAG AGACCGCCCT GATCGAGGGC GTGGAAAAGT TTGGCAGTGG CAAGTGGAAA
ACGATTTTAG CAGACGACGC GCGCGGTAAG AACGTTTTCG CCGCCAACGC CCGGACAAAC
GTCGATTTGG CGAAAAAATG GTACCATCTA CGCCCATCTC ATTTGAGCAA CATGTGGCGA
CAGCACGAGC AAGATCAAGA AATAGTGGCA CGCCAAGAGA AACCTAAGTT GGATTACATT
ATCGACGCAA TCCTAGAGGG CAGTCATTGA CTTCGA
 
Protein sequence
MATMATRDAR ATARPTRDAM LWTIEELASA GVDATTLRNL LPVGLAGEGA RAARARKRVR 
HFFLLSASIA AIEGAEGEET EKSLEAHRDA MARARDACFQ EDDGDEEDLI GEDTEAFRAY
VKAAERAIEV LGMREACEGL DERRSARDET PRDAVRRVVE EWKDALLGVH GRDVDEHLDR
IEDAWTRKAA DRYHRLCEVE EWDEPSRALV KKALELLRAE LCGKATTLTT VQEHLANSYY
VPSRAKPGFT LGKRDDGVRE WVDVGARGVK RRAETNADAF VSSPSTKRVH SEETSPAKPR
SSPGRLGALL SKTISWVRKS VDAPAFIKSP FGKSPLGKSS RITASQTIDD VEERRTDEVE
TLEDDTPPSE HESEEEIMPT QVPTGSYTDE DDEDEDEDED KDKDPEPSPT QVHTGSYTDE
DDEDDDEDED EDPEPSPAPT EPSPAPTQTI QPAPLKRQGK VYPKAQRLVS VRAAHARSPL
TGSDDEYDEI EIEATPGNYL VPRVNRLQPV KTSVKQSPTR KRKYERQTTR RAPGRPKNWT
PEEETALIEG VEKFGSGKWK TILADDARGK NVFAANARTN VDLAKKWYHL RPSHLSNMWR
QHEQDQEIVA RQEKPKLDYI IDAILEGSH