Gene OSTLU_41510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41510 
Symbol 
ID5005078 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp93162 
End bp94446 
Gene Length1285 bp 
Protein Length383 aa 
Translation table 
GC content61% 
IMG OID640420499 
Productpredicted protein 
Protein accessionXP_001421048 
Protein GI145353498 
COG category[K] Transcription 
COG ID[COG5095] Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0135123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0116799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC CGAGCCGCGT GCACGTCGAT TCCGTGCGCG CGATCGCGGC GACGATCGGC 
GCGCCGCCCG TGGACGCCGA CGCCGCGCGC GCGCTGGCGA GCGATTGCGA GTACCGGCTG
CGACAGGTGT TTCAAGACGC GATGAAATGC ATGCGGGCGT CGAAACGAAC GACGCTCTCG
GCCGAGGTGC GCGCGAAGAC GCGCGCGAAG ACGCGATGCG AACGACGACG ATGACGAGAC
GCGCGCGCGA TCGACGCGGG ATGGACCGAT GGACGACGGA CTGACGAATC GGACGACGAC
GACGTCGATC GAACGCGCGC AGGACGTCAA CGCGGCGCTG CGATTGAGAA ATTGCGAACC
GCTGTATGGA TTCGGGGCGG GAACGAGCGA TTATGAATAT AAACAGACGC GCGAGGATCC
GGATGTGTTT TACGTGGAGG ACCGAGAGAT CGACATGCGG GAATTGCTGA CGAGGAAGTT
GCCGAGACCG CCGATCGAGG TGAACTTGGT GCCGCACTGG TTGGCGGTGG AGGGAGTGCA
GCCGATGATT CCAGAGAACC CGATGGTGCC CGCGGCGGAA CCGGTGGCGA TCGAACCTCC
GCGTGGGATG AAACGGCCGC GGCCGCGAGC GATGGGGGCG AAAGAGAATG GTGGGGATCC
GGACGCGGGA GGATTGTTAC CCGTGGTGTC GCACACGCTG AGCCGGGAAT TGCAGTTTTA
TTTCGACAAG GTCACGGCGC TGATTCGACA GGCTGGACGC GCCGATGCGA GCGACCGTGA
AGTTGAATTG CTCTCCACCG CGCTGCGATC GCTGAGCGCG GACGTCGGTC TGCACAATTT
GATGCCTTAC TTTACGCAGT TCATCACCGA GGAGACGACG CAAAACCTGA GGGATTTGCC
GCGGTTGCGA GTGTTGATTC AGATGATTCG GGCGTTAATT TCCAACCCCG ACATAAACGT
GGAGCTGTAT TTACACCAGC TCATGCCGAG CGTGGTGACG TGCGTGGTGG CGAAGCGTTT
ATGTCAAAAT TTGGACGAAG ACCACTGGTC GCTGCGAGAC GACGCGGCGT ACACCATGGC
GTTCATTTGC GGCAAGTTCG GCGACGCCTA TCCAAGCATT CGACCTAGGA TCACGCGCAC
GTTGTTGCGG GCGTTATTGG ATACCAAACC AATGACCACG CACTACGGCG CGATTCGAGG
CTTGCACGCT CTCGGTCCCA AGGTCGTGCG AGAGACGGTG ATGCCCAACT TGCGCTCGTA
CTTGAACACG CTCGAGCCGT TGCTC
 
Protein sequence
MSAPSRVHVD SVRAIAATIG APPVDADAAR ALASDCEYRL RQVFQDAMKC MRASKRTTLS 
AEDVNAALRL RNCEPLYGFG AGTSDYEYKQ TREDPDVFYV EDREIDMREL LTRKLPRPPI
EVNLVPHWLA VEGVQPMIPE NPMVPAAEPV AIEPPRGMKR PRPRAMGAKE NGGDPDAGGL
LPVVSHTLSR ELQFYFDKVT ALIRQAGRAD ASDREVELLS TALRSLSADV GLHNLMPYFT
QFITEETTQN LRDLPRLRVL IQMIRALISN PDINVELYLH QLMPSVVTCV VAKRLCQNLD
EDHWSLRDDA AYTMAFICGK FGDAYPSIRP RITRTLLRAL LDTKPMTTHY GAIRGLHALG
PKVVRETVMP NLRSYLNTLE PLL