Gene OSTLU_16603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16603 
Symbol 
ID5003317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp678729 
End bp680738 
Gene Length2010 bp 
Protein Length669 aa 
Translation table 
GC content55% 
IMG OID640418738 
Productpredicted protein 
Protein accessionXP_001419241 
Protein GI145349650 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTGGT CGTACCTCAC CGGCGGTGCC GCGGCCGCGG CGCAGGAGCA ACGATCGACG 
ATCGACGCAT CCGCGAGCGA CATCGACCGC ATGAAAGCCG AACTCGCGCT CAAGGATGAA
CAAATCAAGC ACGAACAAGC GCGCGTGTTG GCGCTCGAGT TGCAGATTAA ACTTCGCGAG
CGCGACGAGC GCATCGAACG GCTCGAACGC GAGCTGAAAG AGAGCGCGGA GAAAGCGCCG
AGGTCGTCGT CGCACGGGAC GCCGCAGTCG TACCCGCGCG CCTTCGCGTC GACCGCGCGC
CCGCAAAACC GCGAGTCGGG AGGGACGAAA CGCACGAGAG ACTCGCGTGA TTACGCTTCA
ACCTCCTCGG ACGACAAAGG TGACGACGAC GGCGACGACG CGACGCGCTC GAGTCCAAAC
CATGCCGTAA AAACGAAGAC GTGTACCCCG TGGACGGTCG CGGAGGAGAA TTTCGTCATG
AACTACTTCG CTGGCGTCTG GGGCGGTAGT GCTGCGTCGC TCTTCCGGAA CCGGGACTTT
CTCGAGCGAT TGCGAGCCGT GAGTGGGAAC AAAAGGACGA AGGGAGCGCT TGCGGCGAAG
TGGTTTAAGG GTGGCCTCAG AAACGAACTC AAACGCAAAG CTGAAAAGTG CTTGCTGTAC
ACGGAGGAGG AGGGATTGAG GATACCTTGC TGGAGTAAGG AGGAGGAAGA TTTCTTCATG
AAGCATTTGA AGAAAAGTGG TTACCAAGTG AGCGAAGGAA GGTTCATCCA TAATTACCTG
ACTCAAGATT TCCTCGAGCG TCTGGCTAAA ATGAACGGGG GCATCAAACG AACTCAAAAG
GACGCTTACA ACAGGTTTTA CAACACAACT TTTGAAAAGT ACAAGAAGAA ATTACAAGCG
ACGGAGAAGG CGAGACAGGA AAACGGGGCG CTAACAAAGA AGCGAGAGAA GAAACTCATC
GAGAGTCGAC CGACTGTGAC CAATGCCGAC GGTCAAATCA CGGCAAATGG CCGTATCAAG
CTTCAGAAGA CGACCACGGC GAAGACCGGA CCAAACGGTG AGATCATCCT GGTGCCGATT
CCCGGCGTCA AGTTTTGCCA TCGTTGCAAG CGCACGAACA GGGAGGGCAT GGATTTCCAT
GAGCATAACT TCTCGACATG TATCAAGTGT CATGACCGCG TAAAAGAAAT CGAAGAAGCG
CGAATCGCGG CCGGACTACC GCGCCGGGGA AAAGGTTCGA GCCCGCGCAA CAAGCCCAAG
GAAACCAAGT TGCCTCCGAT CGGAGCAGAC GGCAAGCGGG ACTTGTTGGC GTACGAAAAT
AAAGAGAAAT GGTCCGAAAT GATCAAGGCA GTCGTGGACG GTGACGTGCA AGTGTGTGCC
GATGTCAGAG AAAAGTTGAT CAAGGATAAG CAGCTCTATC GCAATGTCGT CGCGTGGGAT
AAATCGATCT TCTCCACCGC GGCGTTTTCG AGCAAGAATC CGATTCGTGT CATCAGCTGG
GCGCTAGCGA ATGGGGCGTC GCGCGATGAA ATCAACGAAG ATGCGCTGAA ATGCGCTGTT
GAGCGTAAAC CGCACGATAC GAACGAGCCC ACCGACGGGT GTGCGTACGT CCCGGCGGTC
AAGGTTTTGA AACACTTGCA CAAGAGTGGG TTTCCGGCGA CGGAGGATGT TATACACACG
GCGTGCGCGT TCGGCGACGT CGATTGCGTG AGGTACTTGA AGGAGAACTA TGAATGCTGC
GATTTTGAGA ATATCTGGCG TGATTTCAAG AGATCTGACG GCGAAAACAA TGACATCATG
ATCGTCGCCG CGCAGGAAGG TCACGTAGAC GTTTTGAAGT ACTTGTACGA AAACGACTGC
GATTTTTCAG TGCAAGACGC AGAACACGCC ATGCGAGTGG CCACGAGCCG CAAGCCTCGA
CGCGCTGATG GGTTCGAAAA AGTCAAGGAG TGGATGGAAT CTACGGCCGA ATGGCGAGAA
TCGCAGGCTG AGAAAGAAAT AGAGGAATAG
 
Protein sequence
MAWSYLTGGA AAAAQEQRST IDASASDIDR MKAELALKDE QIKHEQARVL ALELQIKLRE 
RDERIERLER ELKESAEKAP RSSSHGTPQS YPRAFASTAR PQNRESGGTK RTRDSRDYAS
TSSDDKGDDD GDDATRSSPN HAVKTKTCTP WTVAEENFVM NYFAGVWGGS AASLFRNRDF
LERLRAVSGN KRTKGALAAK WFKGGLRNEL KRKAEKCLLY TEEEGLRIPC WSKEEEDFFM
KHLKKSGYQV SEGRFIHNYL TQDFLERLAK MNGGIKRTQK DAYNRFYNTT FEKYKKKLQA
TEKARQENGA LTKKREKKLI ESRPTVTNAD GQITANGRIK LQKTTTAKTG PNGEIILVPI
PGVKFCHRCK RTNREGMDFH EHNFSTCIKC HDRVKEIEEA RIAAGLPRRG KGSSPRNKPK
ETKLPPIGAD GKRDLLAYEN KEKWSEMIKA VVDGDVQVCA DVREKLIKDK QLYRNVVAWD
KSIFSTAAFS SKNPIRVISW ALANGASRDE INEDALKCAV ERKPHDTNEP TDGCAYVPAV
KVLKHLHKSG FPATEDVIHT ACAFGDVDCV RYLKENYECC DFENIWRDFK RSDGENNDIM
IVAAQEGHVD VLKYLYENDC DFSVQDAEHA MRVATSRKPR RADGFEKVKE WMESTAEWRE
SQAEKEIEE