Gene OSTLU_33344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33344 
Symbol 
ID5003634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp65923 
End bp67338 
Gene Length1416 bp 
Protein Length471 aa 
Translation table 
GC content58% 
IMG OID640419055 
Productpredicted protein 
Protein accessionXP_001419480 
Protein GI145350151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACG TGTGTCTGTA CTCGCCGGCG ACGATCGAAC GCGCGAGATC GGCGCTGCCG 
CGAACGGCGA GCACGTGGCG AGGGACGGTG AGCGTGGCGG TGTTGGCGGA TTTGAAAACG
CCCGGCGACG CGCTCGACTT GAGCGCGATG GCGAGCGAGT TGGAGGGAGA CGCGGGACGA
ATCGCGATCA CGATGGTGGA GGCGCTACCG GAGTACGAGA ATAGATTTCC AGTGAACTTT
TTGCGCAACT TGGCGCGGGA GAAGTGCGTC GCGGAGCTCG GAGCGAAGTA CGTGTTGGCA
CACGACGTCG ATTTCGAAGT CTTTGTTGCG CCCGACGAGG ACGCGTTTTT GAACGACGTG
CGAAACGTTT TGGGGAAGCG GAGCGGCGAA AAGGTACGCC GCGCGCTGGT CGTGCCGGCG
TTTCAACTTC ATGCGGTGTG GTCACAACGA GTGAACGCGA AGAGAGATGC GATTCTGAAC
GCGCGGCGGG TGCAAAGACA AACAAAGTCT CGCCGCAACG AAGGCAAAGA CGACGACGAG
TCCATCGACG CCGTCGTGGA CAGTATCGCA ACCAAGGGTG ATGTACGACG ACTAAATCCG
ATGTCTCTAT ACGAGACGCT CGTTGAAAGA GACCGAACTC GAACGGTCGA CTCAAAAGCG
ACCGCAATAA ATCTGACGCT TCCGTATTCC ACGCGCGAGC GCCTCGATCG CCTCGTTCGC
GAGCGTCGAC TTGCGAACGG TTTCCAAATC AATTACTTTC CGATCGCGCA CGCGCCGACA
AACTACACGG CATGGTTTGA GAACACCACC ACTGGTGCGG ACTCAACGTA TCGCGTCGCG
ACTCCGAAGC ATCCGTGGTA CTACGAGCCT TACGTCATCG TTCGCGCAGA TCTTGCGTTG
CCTTTCGATG AGTCTTTCGT GCAGTACGGC TTCAACAAGA TTTCATTCGT TCACGAACTC
GCCGCGGCGG GATTCGACTT CCACGTTACG AAGAACGCTC ACACTGTCCA TACGAACACA
CACCCGACGC GTGCAATGGC AAACATGCAA GGACAAGACT TGGCGCGTTG TCGCGCGCAC
CCTGCAGCGT CGAACGATTT TAGAATCGCT CGAGTCGGGC ACTCATGCAT TCCAGCTTTT
TTACGCCGAA TGGAGTGCGC GTACGGTTTT ACCTTGGATG ACTTAGAATT CGGTGGCGTA
TCGAATGCGC CGCCGCCCGA TGATTTGCTT TTTCGCCTAC AGTCGGATGA TAACATCGTC
TGCTTTGGGG GATGCATCAC GGATTTAGAA GATGCGCCGC GCACGCCGGC TACCGTCACC
GTGCGAGGCG GACGATTCGT CGGCGTGACG CAAGGCTCAG ACGCTCGTCG GCGTAAACGG
GGCCCTTGTG AGCGCTTTGA CGTAGCTTTA CAGTAG
 
Protein sequence
MSDVCLYSPA TIERARSALP RTASTWRGTV SVAVLADLKT PGDALDLSAM ASELEGDAGR 
IAITMVEALP EYENRFPVNF LRNLAREKCV AELGAKYVLA HDVDFEVFVA PDEDAFLNDV
RNVLGKRSGE KVRRALVVPA FQLHAVWSQR VNAKRDAILN ARRVQRQTKS RRNEGKDDDE
SIDAVVDSIA TKGDVRRLNP MSLYETLVER DRTRTVDSKA TAINLTLPYS TRERLDRLVR
ERRLANGFQI NYFPIAHAPT NYTAWFENTT TGADSTYRVA TPKHPWYYEP YVIVRADLAL
PFDESFVQYG FNKISFVHEL AAAGFDFHVT KNAHTVHTNT HPTRAMANMQ GQDLARCRAH
PAASNDFRIA RVGHSCIPAF LRRMECAYGF TLDDLEFGGV SNAPPPDDLL FRLQSDDNIV
CFGGCITDLE DAPRTPATVT VRGGRFVGVT QGSDARRRKR GPCERFDVAL Q