Gene OSTLU_14501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14501 
Symbol 
ID5000809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp226023 
End bp227180 
Gene Length1158 bp 
Protein Length385 aa 
Translation table 
GC content63% 
IMG OID640416230 
Productpredicted protein 
Protein accessionXP_001416878 
Protein GI145344728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.222713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTC GCGAGGCCAT CTTGGCGCGC GCCGCGGCGC TCGCAGAATC TTCCTCCGAC 
GACGCCGACG CGTCGACGCG CGACGTCGAC GGCGTTCAAC CGCTCGGTCG ACGACGGACG
TTCGTCCCGC TCGAAGAACT CACGCGCGAG CGCAAAGACG CGCCGTTTCG TCCGCCGGGA
GATCGCCGGG CGGCGGCGCG GGCGGCGGAC GACGGCGCGC GCGGGACGGC GGCGCCGGCG
CGGGCGCCGT CGACGGCGCC GTCGACGCCG AACGCGGTCG ACGCCGAGGC GTGGGCGAAG
GAGGATGCGA AATTATTGAT GCATTTACCG GAGAGCGAAC GGCCGATGGT GACGCAGGCG
TTTTCGATGC GAGCGCCGGC GAAGGTGCCG CGAGCGGTGC GACAGCGATT TCTGAACGCG
CTCGCGGCGA GAAAGCTTCG CAGTCGGGCG AACGTCGGAG TCAGGGAGCG CGTGAGCGAA
GCGTGGATAT CTGCACATCC AGATGAGCGT AAATCGGCGG TCGAGGAGGC GGTGAAGGAA
GAGATGAAAT TGCACGCAAA GGCGACGAGC AAGGCGACGT ATAGGAATTT GAGCGCGCAA
CTGATGTTGC GCGGTGGTGA ATCGGCGAAA GCGGAGCCTG GAGCGCTGAA GCAAGACTAC
GAGTGCTCGG CGGCGAACGA TATCGATCTC ACGCGATGTT CAACCGTGAG AGGAGATGAA
GCGGTCGAAT TTTTCATCGC TGCTTGCAAG CATCGCGACG GCTCGGCGCA TCGTGCTGCC
AACGTGGACG AAGTCAAAGT CGAGGTTGGC GGAGAAGAGG ACGTCGACGC GGAAGAGAGC
GAAGCGCTCG ACGAAGCGTG CGACGCCGCG ACGCGTATCG ATGAGCCAAT TAACGAAGTC
GCCGGTAAGA AGATAACTCA CTCAGCAGTA GAAGTCGCGG TGAGGAAGCT GTGTTGCGAT
TACGTGCAAC TTCTCGTCGA CACGGGCGCG TGCGAGGCAG CTTTGGCGAG CATCGTGGAG
CAAAAAGTCG TCAAAAAAGT GATGACGCGA CACCAAAACG ACCGCGATGA CTCGTTTCTG
ATCAAAGAGA GCGCGAGCAT TCGGAAGCTC GTCGCGAGCC AGCTCAAGCA CGAAAACGCG
AGACGGGCAA CAACGTAG
 
Protein sequence
MSRREAILAR AAALAESSSD DADASTRDVD GVQPLGRRRT FVPLEELTRE RKDAPFRPPG 
DRRAAARAAD DGARGTAAPA RAPSTAPSTP NAVDAEAWAK EDAKLLMHLP ESERPMVTQA
FSMRAPAKVP RAVRQRFLNA LAARKLRSRA NVGVRERVSE AWISAHPDER KSAVEEAVKE
EMKLHAKATS KATYRNLSAQ LMLRGGESAK AEPGALKQDY ECSAANDIDL TRCSTVRGDE
AVEFFIAACK HRDGSAHRAA NVDEVKVEVG GEEDVDAEES EALDEACDAA TRIDEPINEV
AGKKITHSAV EVAVRKLCCD YVQLLVDTGA CEAALASIVE QKVVKKVMTR HQNDRDDSFL
IKESASIRKL VASQLKHENA RRATT