Gene OSTLU_4703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4703 
Symbol 
ID5002034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp242270 
End bp243388 
Gene Length1119 bp 
Protein Length373 aa 
Translation table 
GC content56% 
IMG OID640417455 
Productpredicted protein 
Protein accessionXP_001417961 
Protein GI145346986 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.629805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0151489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAACGCTACG ACGCCGTCGT CGTCGGAGCC GGTGTCATCG GACTGGCGTG CGCGCGCGCC 
TTGTCTTTGC GTGGTATGCG CGTGTGCGTC ATCGATAAAG CGGAGAGCAT CGGCGCGGAG
ACGAGCTCAA AGAACTCGGA GGTGTTACAC GCGGGGATGC ATTACGTCCC GGGAAGCGCG
AAGGCAAAGT TTTGCGTCGA AGGGCGGCGA AAGATTGTGA AATACTGTGA GAAGAAAGAT
GTAAAGTGGG AAAACATAGG TAAACTCATC GTCGCGCGGG ATGAGAGTCA GATGGACGCG
CTCGAGGCGA TGATTACGAG AGCGCAGATG AACGACGTCG ACGACTTGGC GTTGCTCACG
TCGGAGCAAC TCACGGCGTA CGAGAAAAAT GTGCGAGGTT ACGCTGGGGT TTTGTCGCCT
TCGACGAGTA TCGTCGACGT TCAAGAGCTC ATGGAGTCGT TTCGTCGCGA TTGCGTGCGA
GATGGGCGAA CAACGGTGGC GCTCGGGGAT GAAGTGGTTG AGATCACGCG CGGGCTGAGC
AAAACGTTTC AGGTGAAGAC AAAGTCGCGT CGCGTAATTT CTACGCCGCG CTTAGTGAAC
GCCGCCGGGC TATACGCACA CCGGTTGTGC GATCGGTTGT TAGATGTTTA CGACATCTCC
GTGACGCCGC CACCGCCTTT GTATTTCGCG CGAGGGATGT ATTGTGAGCT GAAAAAAGGC
TACTCGGCGC CTTTTCAGCG GCTCGTGTAT CCTTTGCCGA GAGAAGGAGG TCTGGGTGTG
CACTTCACTC GCGATGTTTA CGACAAGTGC AAGTTTGGTC CTGACATTGA ATGGATAGAC
GACATCGATT ACACGATGAA CCCAGCGCGC GTGCGTTCGT TTTACGAGGC GATTCGCGAG
TACTGGCCTG GTTTGCAAGA CGGCGCGCTT CGACCGGCGT TCACGGGTAT CCGACCTAAG
CTCATCAACG AAGCAGGTGA CACTGATGAG CCCGGCGCGA CGACTGACTT TGTATTTCAA
ACTGAAAGTC AGCACGGTGC GGTTGGTTTG GTGCATCTGT TTGGCTTTGA GTCACCCGGC
TTGACGTCGA GCCTCGCGGT GGCAGAATAC GTCGCCGAT
 
Protein sequence
ERYDAVVVGA GVIGLACARA LSLRGMRVCV IDKAESIGAE TSSKNSEVLH AGMHYVPGSA 
KAKFCVEGRR KIVKYCEKKD VKWENIGKLI VARDESQMDA LEAMITRAQM NDVDDLALLT
SEQLTAYEKN VRGYAGVLSP STSIVDVQEL MESFRRDCVR DGRTTVALGD EVVEITRGLS
KTFQVKTKSR RVISTPRLVN AAGLYAHRLC DRLLDVYDIS VTPPPPLYFA RGMYCELKKG
YSAPFQRLVY PLPREGGLGV HFTRDVYDKC KFGPDIEWID DIDYTMNPAR VRSFYEAIRE
YWPGLQDGAL RPAFTGIRPK LINEAGDTDE PGATTDFVFQ TESQHGAVGL VHLFGFESPG
LTSSLAVAEY VAD