Gene OSTLU_94457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94457 
Symbol 
ID5002165 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp213730 
End bp215070 
Gene Length1341 bp 
Protein Length386 aa 
Translation table 
GC content54% 
IMG OID640417586 
Productpredicted protein 
Protein accessionXP_001418183 
Protein GI145347461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACG AGGACGGCGA TGTCGCCGCG GGGAGCCGGC CGAAGCGCAA GGCGAGCGCA 
TCGTTGCGAG CGCGGACGTC GCGAGGGAAA CAAGCGCCGT CGAGCGCGCA CTACGTGGGA
TACGTGCAAG ACGACGAGAC GCCAGAGATG ATTATGAAAA AGTTTGAAGA GATGGAACGG
ATTCGACAAG CCACCAAGGC GCAAGTCGGT GAGGATGGTG ATAAGGAGAA CAAGGAAGGC
GCGACAAACG GTGCGGGTGG AGACGCGGTC GGGACCAACG GCGACGAGCA CGAAGGGTTG
GACGAGGAAC AGTTGAAGGA GTTGTTCAAA AATACGTCTA CGTTTACGGT GAAGGAGGCC
GTGATGGATT CTAACGCGCT TTTCGGTGAT ATGAGGATCG CGAACGAAGA TGGGATGTAT
TTCTCGGACG ATGAAGAGTT GCAAGATGAA TTCTGGGAGG CGCTGACCGG TAAGAAGCGT
AGAGGTAAGA AATCCAAGGG GCCGAGAGTG CCGAGAGCGC CGAGAGCGCC GCGCGAACCA
AAGGCGGCGC AGTCACACAT GATCACCGCG TACAATAGCG ACCAAGGCTT GTTCTTGCGC
AAGAAGAAGT TTGTTGATCC TTTCGAGCCT GTCATCATCA AGGTCCCGGC GCACCCGATT
CCAGTGAGCT ATGGACGAGT AATTCAGCCG TACGAACCAA AATCGGTGCG AGAGGCGAAA
CTGAAACAAG TTCCCGATTG CGTGCACATG CAAACGAACA TCAAGAAGAT GGAATATGAA
TCGCTCGGTA AGGACTATTT AGGTGTGCTC ATGAACCCGC CGTGGGATAT TGAAGATTCC
CCAGATCGCG GCGACGTGAC GTTGGAGGAC ATCGAAGCCA TTCCGCTTGA AAAACTCACG
CCACTCGGTT TCATCTTTAT TTGGGTTGAG AAGGAAAATT TGTCCAAGGT TTGCGACATC
ATGGACCGAA AGAACTTTGT CTACGTAGAG AACTTGACGT GGGTACAACT CAAGCCGAAC
AACACGATCG TTGAGTCCTC TGCGCGCTAT CTTGGTCGCT CGCACAGAAC AATGCTCATC
TTCAGACGAG ACGTTCGCGA CAAGCGCTTC ATTGAAGGGA AGAAGATTGA GTTGCGACAC
CAACGTAACT CGGATGTGAC TCTCGATATT GTGCAGACCA CGAAAACTGG TCGACGTGTT
GTCCCTGAGC ACGTGTACAA GTCCATCGAA ACTCTTTTAC CGACGGCGTA CGAACCTGGA
ACGCCTGGTA AGCTCCTCGA ATTGTGGGCC GAACCGGGCG CGCGACGCGC GGGTTGGACT
TCCGTGGCGG ATACTCCTTA G
 
Protein sequence
MANEDGDVAA GSRPKRKASA SLRARTSRGK QAPSSAHYVG YVQDDETPEM IMKKFEEMER 
IRQATKAQVG EDGDKENKEG ATNGAGGDAV GTNGDEHEGL DEEQLKELFK NTSTFTVKEA
VMDSNALFGD MRIANEDGMY FSDDEELQDE FWEALTGKKR RVSYGRVIQP YEPKSVREAK
LKQVPDCVHM QTNIKKMEYE SLGKDYLGVL MNPPWDIEDS PDRGDVTLED IEAIPLEKLT
PLGFIFIWVE KENLSKVCDI MDRKNFVYVE NLTWVQLKPN NTIVESSARY LGRSHRTMLI
FRRDVRDKRF IEGKKIELRH QRNSDVTLDI VQTTKTGRRV VPEHVYKSIE TLLPTAYEPG
TPGKLLELWA EPGARRAGWT SVADTP