Gene OSTLU_18085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18085 
Symbol 
ID5005569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp324011 
End bp325306 
Gene Length1296 bp 
Protein Length431 aa 
Translation table 
GC content60% 
IMG OID640420990 
Productpredicted protein 
Protein accessionXP_001421263 
Protein GI145353957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.0963143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000212922 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGCAGG CGGGGACGTC GACGAAGGCG GGACGAGGCG CGGATGGGCG CGAGAGGTTC 
GTCGTCATTC GCGCGATGGG GCTGAGCGTG GACGGGAAGG TGGAGGCGGT GGTGGAGAGC
GAGGCGTTTC CGACGCTCGA GGCGGCGATC GGGCACAGCG AGTACTTGGC GCAGACGGTG
AGAGCGCGCG GGGGGGTTCG GGGGAACGTG GGACGAAACG GGGGGAAGAA AGGGATGAAG
GCGTCGCGAA CGACGATTTG CGAGTTGCCG GTGGACACGC AGGTGTGGCA CGTGACGCCG
ACGAATATTC CCGGGCCGGC GTATCGAAAC GAGGGGCAGC TCATGGTGAT GCTGCTGTCG
AGTCTGTGCG GACGCGTGGT GTCGAAATCG CGCAATGGGG TGTTGGATGA GTCGGAGTTC
GCGTACAACA ACGGCCACGG CGTGCGGTCG GCTCGCAACG GGGGCGACGC GGAGGCGCCA
CAAATTGCGG ACGGCATTCC TTACGTCAAA GCAAAGATTT TCGGCCAGAA AACGGCTGAC
CCATTCATGT TGCGCCCTAA CATCTCCAAG GCTGCGTTCA GAACGCTCCT CCTGCGCGCG
GTGGGTAAGT ACCTGGCCGA CGATCGCGAG CCGTACCCGG TGCTGGCGAA ATCAGACGTG
CAGCTTCAAG TCGCGCAAGT GGCTCACGGA GTCCAATACC ACGTTCTCGC GATTCGTGCG
CTTGGACATT CCGCCGCGGA CTTTTCGATG TCCATCGATG ATCAAGGTCG AACCTTTGTG
CGCGCCGATC CGGCGCAACC GCCAAACTCG AACCCGAAGT TGGGGCGGCC GTTCGAGCTG
TTGTGTCAGT TCCCTTCCCT CGTGCACTTG CAAACGTGCC GCTGCATTTA TCAAGACGAC
GTACTGTACA TCATTGTGTA TCCTCGTAAC GCCAAGTCGC GATCACTTCG TTTGTCTTCA
GCAGAAATTA GAAACACGTC TTTGCCGGAA AGCATTCGAA ACACCGCGAC GGGCGCCGGC
GTGGAAAAGG CCAACGGACC TTCGTTTCCG CGCGAAGTAG GGATCGAGGA TGGAACAAAT
AGCGCTAGAC CGGGCGATGA GATTGCCTTG GCTACCCTTG GTGGCGCTTT TGGGAAGAAC
GAAGATGACT CTGACTCCGA CAACACGGAT GACTTCTCGA ACGACGAGGC AGAGGAGCCA
CGTGCCAAGA CTGCGGCGCC CGAGGTTGAT GAGAAGACGA TAGATTTACC AGACGAGGAT
GTCGACAGCG AGGTGGAAGA GCCGATGCCC GAATAG
 
Protein sequence
MSQAGTSTKA GRGADGRERF VVIRAMGLSV DGKVEAVVES EAFPTLEAAI GHSEYLAQTV 
RARGGVRGNV GRNGGKKGMK ASRTTICELP VDTQVWHVTP TNIPGPAYRN EGQLMVMLLS
SLCGRVVSKS RNGVLDESEF AYNNGHGVRS ARNGGDAEAP QIADGIPYVK AKIFGQKTAD
PFMLRPNISK AAFRTLLLRA VGKYLADDRE PYPVLAKSDV QLQVAQVAHG VQYHVLAIRA
LGHSAADFSM SIDDQGRTFV RADPAQPPNS NPKLGRPFEL LCQFPSLVHL QTCRCIYQDD
VLYIIVYPRN AKSRSLRLSS AEIRNTSLPE SIRNTATGAG VEKANGPSFP REVGIEDGTN
SARPGDEIAL ATLGGAFGKN EDDSDSDNTD DFSNDEAEEP RAKTAAPEVD EKTIDLPDED
VDSEVEEPMP E