Gene OSTLU_88888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88888 
Symbol 
ID5005066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp44646 
End bp45794 
Gene Length1149 bp 
Protein Length382 aa 
Translation table 
GC content60% 
IMG OID640420487 
Productpredicted protein 
Protein accessionXP_001421036 
Protein GI145353472 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0000824445 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0739348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGG TCGCGGCGGA TGCGGAGGCG AAGGAAGCGG CGGAGCGCGC GAGAGTTGAG 
GAGGAAAAGC GGCGGCAAGA GGCGGAGGAG GCGGCGAGGA AAAAGGCGGA GGAGGACGCC
GAGGCGGCGA AGAAGCGAGC GGAGGAAGAG GCGAAGGAAA AGGCGCGCGA AGAGGAAGCG
AAGCGCAAGC AGGCGGCGAG CGATTACGTC AGTGGTAAAG CCACGGGAGA AATCCCGCGC
GTCGCCGCGG CGGCGGAGGC GTTGGAGCAG GAAACGAACT TGGCCAAGAC GCTCGTCGAG
GCGCGCGCGA TGGTGGCCGA GTATCAGTCG CATCCGACGG CGAAATTGGA ACGCCGGAAA
CTCACGAACA CCATCGTGGT GCACGTGCAA CAAATCGCGG CGACGAAGGA GCAAATCAAT
AAGAAGAGTC GAGACATCAT GATGCTGCTG GTGCAATTGC AGGAGCCTCA AAAGACGTTC
GCGTTGATGA GCATCGCGAA AAAGATGCTC TCGCAGTGCG ACGTGCAGGT GGCCAAACTC
AATCGCTACG CGTTTGCGCT CGCCGAAGTC GCGGTGAGTA TCGCGATCGA CGTACCGAGG
TTTGGTGTCT TGCTCGTCGC CCTCATACAC GAGGTTTGCG TCAACGCGGT GCCGAAGTAT
TACCCGTTTG TTCCGGGACG TTACGCCACC GACGACGAAT ACTACAGTCT CATGGGGTAC
GTCAAAAACG ATGAAGGCAC GGCGTTCGAA ACCACGGATT CCTACGTCGA TCGCATGACG
GGTAGCATGC TCTTTTACGC CGCGTTTTTA CAAGTCGACG CGCCGAATCA CCCACACGGC
GTCGACGCCG CGTGGCGATG GCTCGCGCGT CTGTTGAACA GATGTCCGCC CAATCGCCAC
ACCGCGGTGG CTCTGGACTC ATTCCTCAAA ATCGCCGGCT TTCGCATGTA CGCGGCGTAT
CGCGGTCAGT TCGTCAAAGT CCTCGAACTC ATCCATCGAG AGTTTCTTCC AAAGTTGGAC
GCCAAGAACG ATCCCGACAT TCGGCCCGTG TCGTCGCGCA TCGCGACGTA CCTACAGGAG
AGTCTGTACA CGAAATCTCC CGAAGGCCGC GACATGCCCA ACACCGACAC CAGCTCGCAC
ACGTTTTGA
 
Protein sequence
MRAVAADAEA KEAAERARVE EEKRRQEAEE AARKKAEEDA EAAKKRAEEE AKEKAREEEA 
KRKQAASDYV SGKATGEIPR VAAAAEALEQ ETNLAKTLVE ARAMVAEYQS HPTAKLERRK
LTNTIVVHVQ QIAATKEQIN KKSRDIMMLL VQLQEPQKTF ALMSIAKKML SQCDVQVAKL
NRYAFALAEV AVSIAIDVPR FGVLLVALIH EVCVNAVPKY YPFVPGRYAT DDEYYSLMGY
VKNDEGTAFE TTDSYVDRMT GSMLFYAAFL QVDAPNHPHG VDAAWRWLAR LLNRCPPNRH
TAVALDSFLK IAGFRMYAAY RGQFVKVLEL IHREFLPKLD AKNDPDIRPV SSRIATYLQE
SLYTKSPEGR DMPNTDTSSH TF