Gene OSTLU_18363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18363 
Symbol 
ID5005653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp268487 
End bp270169 
Gene Length1683 bp 
Protein Length560 aa 
Translation table 
GC content58% 
IMG OID640421074 
Productpredicted protein 
Protein accessionXP_001421615 
Protein GI145354698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0904404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGT GGCCGGACGA AGGGTTGACG GTCGAGAGAT TGGTGGAGAC GCTCGTGTCG 
AGAGGGGCGC TTCGCGCGGA TGAGCGCGCG GCGCTCAGGC AGCGCGCGAA ACGCGTCGGC
GAGGATTGGA CGCGGTATGC GCAAGCCGAG GATCGCGCGG TCTTAAAGCT ACGAAGGAAG
TTGGTAAAGG GCGAGGACAT TCGCGCGAAT GCGTTGAAGC TGCCGAAAAT CGACTCGACG
TCGATAAAAG ATTGGACCAA GTCGGATTGG GAACGCTTCT TCAAGGTGGT TTTGGCGTCC
GTCATCGGCG GTGCCGCGTT ATTTGCGAGC GGAGGGACCG CGACACCGGC GCTAGTGGCG
GCAATGCACG GACTCGGGCT CGGTGCTGAA GCGTTTTCAG CGTTCGGTGG TTTGCAATGT
ATGCTCGGCG TCACCGGCGC GTCGCTGTGC GCGCAAAAGA TGGCGAATCG CACCAAGACT
GAGCTTGAAA ACTTCGACCT CATACCGCTT CGCGGCGCGC ATAAATCGTA TGCCATGCAT
ATATTCGTTC CTGGGTTTAC GCGCGACGAC CATGACTTAT TAGGCGCTTG GGGTGCGACG
AACAACCAAT ACGTCTCCGT CGTGCCGGAA TCCCGTTCCG TCGTCCCCGA CCTGGGCATC
GAGTTCACGA GTGGCGCAGA TGGATCGATC ATCGTACAGG CGAAAGACGA TTCAATCGCC
AAGCGTCATG GTGTTGTTTC TGGAAGCACT CTGCTGTCTT ATCGATCGGT TAAAAAACCT
GGCGAGCCGA GTGTCGTGCT CTCCGAACTC GTCGACATGC CGACGTCTGA CGAATTGTCG
CGAGTGCCGC GCCCAATCGA GATTCGACTG CAGCTCCCCG ATCGCGATGA TGAGCTGAAG
AAAGAAATGA GCGAGCTCGC GAACGAAATC AAGTCTCAAG TCGGTAATCA TAGCAAGGAA
GAACACTTGC CGACCGCTGA AGCGGCTATT CGACCCGAGC AACGGCGCTG GGGCAATCGC
ACGGGCGAGC AACTCGTGTT GAATTGGGAG CCGTCTACAC TCAATGAACT CGGCGCGTGC
ATGACATCCT GGAACGAGAC GTGCACCGTC AACTTTTACT TAACGCCCGC AGCGTTGGCT
AAGACTGCGC TCGGAGGGAT CGCTGACGCT ATCGCTTGGC CGGCGACGCT TCTCTCGAGC
GCGGGTTTCA TCGACGATCC TTGGGCTTTG GTCAAACTGC GCGGAAAAAT CGCGGGCGAA
GAACTCGCGC AGAGCTTGTT AGATGGCCAG CATGGTCATC GACCGGTGAC GTTCGTCGCG
TACAGCGCCG GTGCTTACGT CGTTCAGAGC TGCTTGCAAA AGTTGTACGA AGCCGGCGAC
AGAGGCAAGA ACATCGTCGA CCGCGCAATC TTCATCTCGG CGCCGATTTC TACGTCCAAG
GACGTTTGGC AGCCGATGCG TGAGGTCGTC TCCGGTCGTC TCGTTAACGT CCACTGCCAC
ACGGATTGGA TTTTGCTTCT CATGTGGCGC TTCAACATGC TCGATCCCAT GACCAGACTC
GCGGGCTTGT CCATCGTCAA GCGCGTGCCG AGCGTGGAAA ACTACAACAT TAAAAATCTC
CGTCACGCGC ATCTCCCCGA CGAAATCTCG CGCGTGCTCG AGGAAATCGA CCTTCAAGAG
TAA
 
Protein sequence
MREWPDEGLT VERLVETLVS RGALRADERA ALRQRAKRVG EDWTRYAQAE DRAVLKLRRK 
LVKGEDIRAN ALKLPKIDST SIKDWTKSDW ERFFKVVLAS VIGGAALFAS GGTATPALVA
AMHGLGLGAE AFSAFGGLQC MLGVTGASLC AQKMANRTKT ELENFDLIPL RGAHKSYAMH
IFVPGFTRDD HDLLGAWGAT NNQYVSVVPE SRSVVPDLGI EFTSGADGSI IVQAKDDSIA
KRHGVVSGST LLSYRSVKKP GEPSVVLSEL VDMPTSDELS RVPRPIEIRL QLPDRDDELK
KEMSELANEI KSQVGNHSKE EHLPTAEAAI RPEQRRWGNR TGEQLVLNWE PSTLNELGAC
MTSWNETCTV NFYLTPAALA KTALGGIADA IAWPATLLSS AGFIDDPWAL VKLRGKIAGE
ELAQSLLDGQ HGHRPVTFVA YSAGAYVVQS CLQKLYEAGD RGKNIVDRAI FISAPISTSK
DVWQPMREVV SGRLVNVHCH TDWILLLMWR FNMLDPMTRL AGLSIVKRVP SVENYNIKNL
RHAHLPDEIS RVLEEIDLQE