Gene OSTLU_38443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38443 
Symbol 
ID5001766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp589432 
End bp592392 
Gene Length2961 bp 
Protein Length986 aa 
Translation table 
GC content51% 
IMG OID640417187 
Productpredicted protein 
Protein accessionXP_001417814 
Protein GI145346684 
COG category[R] General function prediction only 
COG ID[COG1444] Predicted P-loop ATPase fused to an acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.856005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.626674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGA AAGTGGACGC GCGCATACGC ACGCTCGTGG AGAATGGGGT GAAGAACAAC 
GAACGCACGA TGTTCGTGCT CGTGGGCGAT CGAGGACGAG ATCAAATCGT CAACTTGCAT
TACATGCTGA GTAAAGCGGT GGTGAAGTCG AGGCCGAACG TGCTGTGGTG CTATAAGAAG
GATTTGTACC TGTCTTCGCA TAAAAAGAAA CGAGCGAAAC AGATCAAGAA GATGCAAAAC
TCTGGCTTGA TGGATTCGGA GAACGAGGAT CCGTTCTCAC TGTTCGTGGC GTCGACGAAT
ATTCGGTACT GTTATTACGC GGATACGCAG AAAATTTTGG GGAACACGTA CGGGATGGCA
GTGTTGCAAG ATTTCGAGGC GCTGACACCG AACTTACTGG CACGAACGAT CGAGACCGTG
GAGGGTGGAG GGTTGATTGT GCTGCTGTTG TCGAACATGG AAAGCTTGAC GCAGCTGTAT
AACATGACGA TGGACGTGCA CTCGCGTTTT CGCACAGAGG GCCACCAAGA CGTGGTCGCT
AGATTTAATG AGCGATTCAC GTTGAGTTTA GGGGCGTGTC AGACGTGCAT CATGATGGAT
GATGAGTTGA ATATTTTGCC AACGAGTTCG CATATTAGAG GCATAGAACC AGTTGAAGAA
AAGCATGCGG GAGAGGCCCA AGAGCTTACC GATCTGAAGG AGAGCATGGA GGAAGTTGAG
CCGGCTGGCC CGCTTGTGAA GTGCTGTAAG ACTATGGATC AAGCCAAAGC AGTGGTGACG
TTTCTCGACG CTGCGAGTGA GAAGACCCTC CGATCAACAG TTGCGCTCAC CGCCGCGCGC
GGTCGCGGTA AATCTGCTGC CATGGGTATA GCTGTCGCAG GAGCCGTCGA AATGGGTTAT
GCAAACATTT TTGTCACTTC TCCCAGTCCT GAAAACTTGA AGACATTCTT TGAATTCATT
CTCAAAGGCT TCGATGCGCT TGGATACAAA GAGCACTTGG ATTACGATCT CGTAGAGTCC
ACCAACCCCG CGTTTGGAAA GTGTATTGTG CGTGTCAACG TCACGAGACG TCATCGACAA
ACGATTCAGT ACATTTTGCC TCAACACGCA GAGCGTGCGA CGCAAGCCGA GTTACTCGTC
ATCGATGAAG CAGCCGCCAT TCCGCTGCCT ACGGTAAAAG CGTTGCTCGG GCCGTATCTC
GTGTTCCTTT GTTCTACAAT CAACGGTTAC GAAGGCACAG GGAGGGCGTT GAGCATCAAA
CTCATCGGTA ATTTGCGAAG GGAAGCGGCA ACGTCACAAC AGGCGAACAA GGATGGGAAG
CTGGCGACAG GTGGATCTCG ATTGCTCAGA GAAGTAGCTC TTGCTGAACC TGTTCGTTAC
GCTGCTGGAG ATAGGATCGA AAAATGGTTG AATGATTTGT TGTGTCTCGA TGCGGCGGAT
GCATTAACGC CGCTCATCCA CGCGCTGCCT CCGCCCAGTG CATGCGAACT TTATGAGGTG
TCGCGTGATA CCTTATTCAG TGCGCACGCC GCAAGCGAGC AGTTCTTAAA GAAAATGATG
GCACTATATG TTTCTAGTCA CTACCGCAAC ACGCCGAACG ATTTGCAGCT CATGTCTGAT
GCGCCGGCTC ACCGCCTCTT TGTTCTCTTG GCGCCGGTGG ATGAAACACG AAACATGTTG
CCAGAGATTC TTTGCGTCAT TCAGGTAGCT TTAGAGGGTG CGATCTCGCA GAAGAGTGCG
CACGCAACAC TGGCTGCGGG GTTATCTCCA CAAGGTGACC TCATTCCGTG GACCATGGCA
TCGCAATTTC AGGATGAAGG CTTTCCTTCC TTCACGGGGG TGCGCATTGT TCGCATCGCC
GTGCATCCGG ACTTGCCTCG TCAAGGTTAC GGTTCTCGTG CGTTGCAGCT CATCCACGAT
TATTACGAAG GAAAATTGGC TGATTTACGC GAAGAAGAAG TGTCGAACAT CGACGCCAAG
ACGAATACTG AGCACTGGCA GCACACAGGC AAGCTCACAG AGGAGACACT CAAGCCTCGC
GAGAATTTGC CGCCGCTTTT GACAAACCTC TCAGAGCGCA AGCCAGAGAG GGTGAACTGG
ATCGGCACCG CATTCGGACT CACGTCCGAA CTTTACAGCT ACTGGAGCAA GGGTGGCTAT
AAGCCTGTGT ACCTGAGACA AACGTCAAGC GAGACTACGG GCGAACACTC GTGCATCATG
TTACGCCCTT GCTTTCCAGT TGGCGAGGAG GAAAGCCCGG ATGGTCACTG GGTGGATGCG
TTCTACGAAG ATTTCCGCGT GAGATTCACT TCTTTGATGG GATCTGCTTT TAGAGACCTT
GCTCCAGGTC TATGTTTGTC TTTGATGGCG CCAAAGCTAA ACTGGGACGA AAAACAAGGC
GCGGGGCGCG ACTCCGTCTT GAAGGCGGAC AATGAGATTT TGAGCCCGCA CGATTTACGA
CGCATCCAGA AATACTCAGC CGCTTTGGTG GACCATCATT TGATTGGTGA TTTGGTCCCC
CCACTCGCGC GCGCATACTT TGCAAAACGC ATTCCCGTGA CGCTTTCGTA CACGCAAGCT
GTGATTTTGC TCATCCTTGG TCTGCAACTG AAGACCATCG ACGACGGAAT GAAAACTTTG
GACTTGCCCG GACAACAAAT AATGGCATTA TTCAACAAAG CTATTCGTCG GATACACGGC
TCACTTCTCA AGGCACGCGA AGCCGATATC GAGAACTCGC TGCGTTCTGT GAGTATGCCA
GATTTGAGAC CCCATGCGGT TGGATTAGAT GAAGAATTAG ATGAAGGGTA CGTCGACTTC
AACTTTTTTT CGCGTTCGCC CGACTCGTGT CTCGCTCCAA AGATGATGGA AGTGATCGAA
AACAATATAG TTCCAGAATT TCACGCTCTT TGGCGCCACC GTTTAGGCGG GGTTATAAGT
CATAGAGTTC TATGTTGGTA A
 
Protein sequence
MRKKVDARIR TLVENGVKNN ERTMFVLVGD RGRDQIVNLH YMLSKAVVKS RPNVLWCYKK 
DLYLSSHKKK RAKQIKKMQN SGLMDSENED PFSLFVASTN IRYCYYADTQ KILGNTYGMA
VLQDFEALTP NLLARTIETV EGGGLIVLLL SNMESLTQLY NMTMDVHSRF RTEGHQDVVA
RFNERFTLSL GACQTCIMMD DELNILPTSS HIRGIEPVEE KHAGEAQELT DLKESMEEVE
PAGPLVKCCK TMDQAKAVVT FLDAASEKTL RSTVALTAAR GRGKSAAMGI AVAGAVEMGY
ANIFVTSPSP ENLKTFFEFI LKGFDALGYK EHLDYDLVES TNPAFGKCIV RVNVTRRHRQ
TIQYILPQHA ERATQAELLV IDEAAAIPLP TVKALLGPYL VFLCSTINGY EGTGRALSIK
LIGNLRREAA TSQQANKDGK LATGGSRLLR EVALAEPVRY AAGDRIEKWL NDLLCLDAAD
ALTPLIHALP PPSACELYEV SRDTLFSAHA ASEQFLKKMM ALYVSSHYRN TPNDLQLMSD
APAHRLFVLL APVDETRNML PEILCVIQVA LEGAISQKSA HATLAAGLSP QGDLIPWTMA
SQFQDEGFPS FTGVRIVRIA VHPDLPRQGY GSRALQLIHD YYEGKLADLR EEEVSNIDAK
TNTEHWQHTG KLTEETLKPR ENLPPLLTNL SERKPERVNW IGTAFGLTSE LYSYWSKGGY
KPVYLRQTSS ETTGEHSCIM LRPCFPVGEE ESPDGHWVDA FYEDFRVRFT SLMGSAFRDL
APGLCLSLMA PKLNWDEKQG AGRDSVLKAD NEILSPHDLR RIQKYSAALV DHHLIGDLVP
PLARAYFAKR IPVTLSYTQA VILLILGLQL KTIDDGMKTL DLPGQQIMAL FNKAIRRIHG
SLLKAREADI ENSLRSVSMP DLRPHAVGLD EELDEGYVDF NFFSRSPDSC LAPKMMEVIE
NNIVPEFHAL WRHRLGGVIS HRVLCW