Gene OSTLU_89073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_89073 
Symbol 
ID5005397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp118692 
End bp121787 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table 
GC content56% 
IMG OID640420818 
Productpredicted protein 
Protein accessionXP_001421393 
Protein GI145354228 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.161283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.722598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGA GCAAGTGCAA CGTGCCCGCG GCGGCGAAGG TGACGCCGGC TGACGTCGAA 
GGGATCAAGA CGGAGGCGGC GAAGAAGAAG GACGAGGACG CGCGCAAGGC GGCGGCGGAA
AAGATTACGG AGATTGCGAA CGCGGCTTCG TTCGCGGAGG AACCGTATTT GATTGATTTG
CTCGAGGTTG CGATCACGCT CGCGGGTGAT AACAAGTCTA GCAATGTGCG CGCGGCTGGC
GACGCGGCGG TGGCGGCCAT CGCGCCCAAG TTGAGCGAGT TCGCGGTTCG CCCGGCGTTG
CGAGCGATCT TTGTTGGTTT TCAGTCTCAG TTCTGGCAGT CCACCATGGC TGCTTTGCGT
GTGCTCGATG CTTTCGTCGA TCGCAACCGC AAGGCGGTCG CGGCGAACTT GCCGGAAATC
ATCCCGGAGC TCGCGCAAGT CATGGTGCAC ATGCGCTCGG AAGTCAAGGA GGCGTCCACC
GCGTCCATGG CCAAGGTTGC GACGTGCGTC GGTAACTTGG ACATCGAGCC TTTCATCCCG
ACCTTGATTG AATGCATCAA CAACGTCGAT GAAGTTCCGG AATGCGTGCA CAAGCTCGCG
GCGACGACTT TTGTTCAACA AGTTGAATCC CCGACGCTCT CCCTCCTGGG TCCGCTTCTC
CAGCGCGGTT TGTTCTTCCA GCAAACCACC CCGATCAAGC GTAAGTCTGC CGTCATCATT
GACAACATGT GCAAACTTGT CGAAGATCCG ATGGATGCCG CGCCGTTCTT GCCCAAGCTT
TTGCCGCTTT TGAAGCGCGC CATGGACGAA GTCGCCGACC CGGAGTGCCG CCAAGTGTGT
ACTCGCGCGT ACAAGACTTT GCTCCAAGCC GCGGGCAACG AAACCGGTGC CGAAGATGGC
CAAGAAGGCA AGGGTGTCTA CAACGAAGAA ACCGTCAGCG AGCAGTTCAT CGCGCTTTTG
GCCGAATCCT CCGGTTCTTC CAAGGAAGAC GTCACGAAGT TTGTCCAAGG CGACGGCGTC
AAGGTGTACT TTGACTACAT TTGCTCCTTG TCCGCCAACG CGTTGTTGGC GAAGAACTTC
GATCTCGACA CCTGGACCAA GTCTTGCGCG ACCTCTTACT TGAAGCTTTT CTTCTCCGAG
GCCGATGCCA TGACCAAGTC TCTCGTCGAA CGCGCTCACG CCGTGTACGA AGCCTCCAAG
AAGGTCTTCG TCGTCGAGGA CGAAGAAGGC GAAGATCTTT GCAAGTGCGA CTTCTCTTTG
GCGTACGGTG CGCTCATTTT GCTGAACAAC GCCACGCTTC ACATGAAGAA GGGTAAGCGT
TACGGTCTGT GCGGCCCGAA CGGTTGCGGT AAGTCCACTT TGATGAAGGC TATCAACAAC
GGCCAAGTCG AAAACTTCCC GCCGCCGGAG GAACTCCGCA CGGTGTACGT CGAGCACGAC
ATCCAGGGTG ATCAACACAC GATGAACGTC GTTGAGTTCG TTCTCTCCGA CTCTGTCATC
CAAGGCCACG GCACGAGCAA GGAATCTGTC GCTTCCACGC TCTCCTCTTT CCAATTCACC
GACGAGATGA TCAACGGCCC GGTTGTCGCC CTCTCCGGTG GCTGGAAGAT GAAGCTCGCG
CTCGCGCGCG CCATCCTCAT GAAGGCGGAC ATTTTGTTAC TCGATGAGCC GACGAACCAC
TTGGACGTGA AGAACGTCGC GTGGTTGGAA GAGTACCTCA ACTCGCAAAC GCAAGTTTCC
TCCATGATTG TGTCTCACGA TTCCGGTTTC TTGGATCGCG TGTGTACTCA CATCATTCAC
TACGAGAACC GCAAGTTGGT GACGTACAGA GGTAATCTCA CCGAATTCGT CAAGCAGTGC
CCGGCGGCGA AGAAGTACAC CGAGCTCTCC AACGACGAAC TCAAGTTCAT CTTCCCGGTT
CCGGGTTTCC TCGAGGGCGT GAAGAACAAG GACAAGGCCA TCGTCAAGGC GACCAAGTGC
TGGTTCAAGT ACCCGAACAC GACGCGTCAA ATCATCCAAG ACGCGACCAT TCAGCTCTCT
TTGAGCTCTC GTGTCGCGTG CCTCGGCCCG AACGGTGCTG GTAAGTCTAC CTTCATCAAG
CTTCTCACCG GTGAAGCCGA ACCGGACCAG GGTACCGTCT GGCGTCACCC GAACATGCGT
TACGCGTACG TCGCGCAGCA CGCGTTCCAT CACGTCGAGC AGCACTTGGA CAAAACGCCG
AACGAATACA TCCGCTGGCG TTTCTCCACG GGCGAAGACA AGGAAAACTT GACCAAGGTC
ACCGCGCAAT ACACCGAAGA GGAAGAGCGC ATGATGAAGG AAAAGATTCC GGTCCCGCAA
GAGGATGGTT CCATTCTCAA GCTCGTCGTC GAGAAGATTC TTGGTCGCCG CCAAAAGAAG
TCCAAGTACG AGTACGAGTG CCAATGGAAG GGTCTCTCCA TGGACTCCAA CTCTTGGATG
GAGCGCGAAA AGCTCGAAAA GTATGGTTTC ACCAAGTACC TCAACCGCGT TGACGAGCGC
GAAGCCGCTC GCGCGGGCTT GTACGCGCGC CCGCTCACGC AAGCGAACGT CGAGAAGCAC
TTGATCGACT TTGGTTTGGA TGCCGAATTC GGTACCCACA ACCGCATCAA GGGTCTTTCC
GGTGGCCAAA AGGTTAAGCT CGTGCTCGGT TCCGCCATGT GGCAGCAACC GCACATCGTC
GTCATGGACG AACCGACCAA CTATTTGGAT CGCGACGCCC TCGGCGCGCT CGCGTGCGCC
GTCAAGGAAT ACGACGGTGG CGTTCTTCTC ATCACGCACA ACTGCGAATT CGCCGATGCG
TTGAAGGAAG AAACGTGGAA CGTTCCGGGT AATGGTTTTG TTGAAATTGA AGGTAACAAG
TGGGGTCAAG GCAAGTCCGC TAAGGGTGCC AAGGTTGAAT TCGAAGTCCA AGAGGACACC
GTCGACGCGC TCGGGAACAA GGTTAAGGTC AAGGGACCGA AGAAGAAGTT GTCTCGCAAG
GAGATCAAGG CTATGCAAAA GACGAGAGCG GCTAAATTAG CCGCTGGCCA GGACATAACC
ACAGACTCGG ATTGGGACTT GGACCAGGTT AGTTGA
 
Protein sequence
MAPSKCNVPA AAKVTPADVE GIKTEAAKKK DEDARKAAAE KITEIANAAS FAEEPYLIDL 
LEVAITLAGD NKSSNVRAAG DAAVAAIAPK LSEFAVRPAL RAIFVGFQSQ FWQSTMAALR
VLDAFVDRNR KAVAANLPEI IPELAQVMVH MRSEVKEAST ASMAKVATCV GNLDIEPFIP
TLIECINNVD EVPECVHKLA ATTFVQQVES PTLSLLGPLL QRGLFFQQTT PIKRKSAVII
DNMCKLVEDP MDAAPFLPKL LPLLKRAMDE VADPECRQVC TRAYKTLLQA AGNETGAEDG
QEGKGVYNEE TVSEQFIALL AESSGSSKED VTKFVQGDGV KVYFDYICSL SANALLAKNF
DLDTWTKSCA TSYLKLFFSE ADAMTKSLVE RAHAVYEASK KVFVVEDEEG EDLCKCDFSL
AYGALILLNN ATLHMKKGKR YGLCGPNGCG KSTLMKAINN GQVENFPPPE ELRTVYVEHD
IQGDQHTMNV VEFVLSDSVI QGHGTSKESV ASTLSSFQFT DEMINGPVVA LSGGWKMKLA
LARAILMKAD ILLLDEPTNH LDVKNVAWLE EYLNSQTQVS SMIVSHDSGF LDRVCTHIIH
YENRKLVTYR GNLTEFVKQC PAAKKYTELS NDELKFIFPV PGFLEGVKNK DKAIVKATKC
WFKYPNTTRQ IIQDATIQLS LSSRVACLGP NGAGKSTFIK LLTGEAEPDQ GTVWRHPNMR
YAYVAQHAFH HVEQHLDKTP NEYIRWRFST GEDKENLTKV TAQYTEEEER MMKEKIPVPQ
EDGSILKLVV EKILGRRQKK SKYEYECQWK GLSMDSNSWM EREKLEKYGF TKYLNRVDER
EAARAGLYAR PLTQANVEKH LIDFGLDAEF GTHNRIKGLS GGQKVKLVLG SAMWQQPHIV
VMDEPTNYLD RDALGALACA VKEYDGGVLL ITHNCEFADA LKEETWNVPG NGFVEIEGNK
WGQGKSAKGA KVEFEVQEDT VDALGNKVKV KGPKKKLSRK EIKAMQKTRA AKLAAGQDIT
TDSDWDLDQV S