Gene OSTLU_43066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43066 
Symbol 
ID5005458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp501888 
End bp503783 
Gene Length1896 bp 
Protein Length631 aa 
Translation table 
GC content54% 
IMG OID640420879 
Productpredicted protein 
Protein accessionXP_001421483 
Protein GI145354420 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0251063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.287268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTC ACATCAGCGG TATTCAGCTC TACGGCGGAC GCGATGAGTT GATTCAGGAA 
GGGGTTTTGA AGTTAGTGTT CGGGACGAAG TACGGTTTGG TCGGTCGAAA TGGATGCGGC
AAGTCGACGC TGTTGCGAGC GATTTCCGAA CGCGTCATCA AGTTGCCAGA ATTTTTGCAC
ATCATTCACG TCGAGCAAGA GGCGTCACCG GACGAGCGGT CCGCGTTGCA GACCGTCGTG
GAGACCGACA CCGAGCGTCT GTACTTGCTC AATCTAGAGA AGCGAATGTT GGACGAAGAA
CTCGATCAAA TCGATGGCAT CGACTTGAAC GAGGTGTACG AGCGCCTGGA CGAAATAGAC
GCCGACACGG CGACGGCGCG CGCGGGACAA ATTCTGGGCG GTCTCGGTTT CGATCCCGAG
GAGCAAATGA AGGCGACGAA AGAATTTTCC GGTGGTTGGC GCATGCGCAT CGCGCTCGCG
GCGGCGCTGT TCATGACGCC GGATTTGTTG TTGCTGGACG AACCGACGAA TCACTTGGAC
GTGCACGCTT TGACGTGGCT CGAAGAGTTT TTACGAAAGT GGGAAAAGAC GGTCCTCATC
GTGTCGCACG ATCGCGGCTT TTTGAACGAT TGCACGACGG CAACAATTTT CTTGCACCAC
AAAAAGTTGC GCTACTACGG CGGATCCTAC GATACCTTCC TCAAGGTGCG CGCTGAGCAC
CGAGCCAACG AGGAGGCGAT GCAGCGAAAC CAAAGTTTGC GAGAGTCATC TCTGAAGCAA
TTCATTCAGC GATTTGGGCA AGGTCACAAG AAAATGGTGC GTCAGGCACA ATGTCGCATG
AAGATGCTCG AAAAATTACA GAGCGAGCGG GTGGACGTGG ATTACGATGA TCCGTATCTA
CGCATCAATT TCCCATCCGC GTCGCCGTTG CCGCCACCGT GCATCTCCGT GATGAACGTC
GCGTTCGGTT ACGAAGGCTA TCAAACCTTG TACCAAAACT TAGACTTTGG GTTAGACATG
GATAGCCGAG TCGCAATCGT CGGACCGAAC GGGGCTGGTA AATCGACGTT TTTGAAGCTT
CTCGAGGGTG ACATTTTGCC CACCAAAGGT TGGATTAATC GTCACACCAA GCTTCGGCTA
GCGCGTTTCT CTCAACATCA CTTGGAGACG ATGAACTTGG AGGAAGATTG CGTAGCGCAC
ATGAAGAGAC TAGACAGCGA AATGCCTATA GAGACGGCGC GAGCGTACTT GGGTCGATTC
GGCCTGTCTG GCGAGCTCGC GACAAAGCCG ATTAAAGTTT TGTCCGGCGG GCAAAAGTCA
CGTCTAGCAT TTGCCGAACT CGCGTGGAAG CAACCTCACA TTCTCCTTTT GGATGAACCT
ACTAACCATC TTGACTTGGA AACGATCGAG TCACTTGCCA TGGCGCTCAA CAACTTCGAA
GGCGGCGTCG TGTTAGTCTC GCACGACGAG CGTCTCATTT CGCTCGTCGT CGATGAAATT
TGGATTGTCA CGAAGGGCGA CATGAAATCC AATCCTCCGG TCCCAGGAAG CGTACAAGTT
TTCAATGGAT CGTTCGACGA TTACAAAGCC AAGTTGCGCG AAGAGTTTTC GGGCGGCAAC
TTGTTGTCGG AGAAACGAAA AGCCGAAAAG AAGAGGGGAG GTGGGCGTGC GCCCGAACCC
GAACCCGAAC CCGAGAAGGC GCCCGCACCG GCGAAGCCTC CCGGAAAGAT CATGATGGAC
AGTGCGTTCA CAAAGTCCAC GTCAAACGAT ATCTCCAACG AACAAGATCG CCCGACTTCG
TCCCAGGTGA AATGGGTTCC GCCTCACTTG CGAGGCGCCG CGCAACAAGA CCAAGAGAAC
GCGGGCAGTG CCGACCAAGC GTGGGGCGAA GAATGA
 
Protein sequence
MDIHISGIQL YGGRDELIQE GVLKLVFGTK YGLVGRNGCG KSTLLRAISE RVIKLPEFLH 
IIHVEQEASP DERSALQTVV ETDTERLYLL NLEKRMLDEE LDQIDGIDLN EVYERLDEID
ADTATARAGQ ILGGLGFDPE EQMKATKEFS GGWRMRIALA AALFMTPDLL LLDEPTNHLD
VHALTWLEEF LRKWEKTVLI VSHDRGFLND CTTATIFLHH KKLRYYGGSY DTFLKVRAEH
RANEEAMQRN QSLRESSLKQ FIQRFGQGHK KMVRQAQCRM KMLEKLQSER VDVDYDDPYL
RINFPSASPL PPPCISVMNV AFGYEGYQTL YQNLDFGLDM DSRVAIVGPN GAGKSTFLKL
LEGDILPTKG WINRHTKLRL ARFSQHHLET MNLEEDCVAH MKRLDSEMPI ETARAYLGRF
GLSGELATKP IKVLSGGQKS RLAFAELAWK QPHILLLDEP TNHLDLETIE SLAMALNNFE
GGVVLVSHDE RLISLVVDEI WIVTKGDMKS NPPVPGSVQV FNGSFDDYKA KLREEFSGGN
LLSEKRKAEK KRGGGRAPEP EPEPEKAPAP AKPPGKIMMD SAFTKSTSND ISNEQDRPTS
SQVKWVPPHL RGAAQQDQEN AGSADQAWGE E