Gene OSTLU_32593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32593 
Symbol 
ID5002640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp312796 
End bp313916 
Gene Length1121 bp 
Protein Length350 aa 
Translation table 
GC content55% 
IMG OID640418061 
Productpredicted protein 
Protein accessionXP_001418900 
Protein GI145348941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0289606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAACACCGCG ACGCGTTCGT CTCGACCCGA CGACGCGCGT TCGATACGCA CGCGTAAATG 
TTCGCATGGA CGTATGATCT CCCGCTCGGT GCGTACGCGT TCGGTGGGAC CGTGCGCGAG
GCGAGCGAGC AGCGAACGCA AAAGATTGCC GTGATTGTCA TCGGGTACGT GCTCGAGATT
TTCCTCGTCT TGGGCTACGC TCGAGCGGGC AAGAAGCGCA TGCATCCTAA ATTTAGACTG
AATCGTCGAC GAGCGAGGCG CCTGTTGGTG CACGTGGCCT TCGGAACGAT GGAGCTGACG
TTGGGCGCGC GAGCGTTGCT GGTCGGCGGC GGCGCGAGGG GTACGATGCG CGCGAGCGCG
GCGTGCGTGT TGGTGATCGC GCTCACGGCG ATGCTTCAAA TCGACGCATC GTTTGGGACA
CCGTCTATTA TAACTCCTGC ATTGCACATT ATAAACACCG CGCACATCGA AAAAGCGTTT
CGAGTGCTGA TTGGAGCCGG CGCGCTTTCG TCAGACTCGA GCGGCGCCGT ATTGGAAAAT
TTCATCGACC AGCTCGTTCT CGCGCAGGGG TTCGTGTATT CCAGAATGTT TATATTTGCG
TTCACGAATG TTCCGGGTAT TAAGGAGCAC CGGTACACGA CAGGGGCAAC TCTTGCACTG
TTTTTAGTCA TACCAGCAGT GTACGGTCCC GTGGGTGCCC TCATTGCCAT AGTCATCATT
GCCGCGTATT CCAACGCGTG CGGATCCGAG GAAAGCGGAA AAGAACGCAA TTTTGCAGCT
ATCTGTGGTA GAGTGAATTC GTTGGTACGT TTGCGAATCG GTAAGACGCC GTTACGGACA
GCTTTTGATG TGCTGGATAA GGATGGTAAT GGCACGTTAG ATGGGAGCGA GCTCCGTGAG
CAATTGTTCA CGTGGGGCGT CGACGCACGA GACATAAAAG AGCTTTTCGA GCGCATCGAT
ATCTCGAATC GCGGATCACT CTCCTTTGAG GAAATTCTGA ATGACACGGC GGGGCAAGCG
ATGCTTGAAT ACATCAGCGT TCTACTACAT GACAATGAAT CGCTCGATGG CGCGCCGCAT
TCGTCTTCAC GAGCATCGAA ATTAGCGTAA TTAATAGTCA T
 
Protein sequence
MFAWTYDLPL GAYAFGGTVR EASEQRTQKI AVIVIGYVLE IFLVLGYARA GKKRMHPKFR 
LNRRRARRLL VHVAFGTMEL TLGARALLVG GGARGTMRAS AACVLVIALT AMLQIDASFG
TPSIITPALH IINTAHIEKA FRVLIGAGAL SSDSSGAVLE NFIDQLVLAQ GFVYSRMFIF
AFTNVPGIKE HRYTTGATLA LFLVIPAVYG PVGALIAIVI IAAYSNACGS EESGKERNFA
AICGRVNSLV RLRIGKTPLR TAFDVLDKDG NGTLDGSELR EQLFTWGVDA RDIKELFERI
DISNRGSLSF EEILNDTAGQ AMLEYISVLL HDNESLDGAP HSSSRASKLA