Gene OSTLU_29430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29430 
Symbol 
ID5006584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp218575 
End bp220722 
Gene Length2148 bp 
Protein Length715 aa 
Translation table 
GC content59% 
IMG OID640422005 
Productpredicted protein 
Protein accessionXP_001422686 
Protein GI145356950 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.400875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGC GCGCGCCGAC GACGCGTAAA ACCCCTCGCT CGCCCGCGCG CAAAGCGTCG 
CCCGCGCGCG GACGCCCGTC GACGCCGAAA TACAACGACG ATGTCGTGCA GAGTCTGTTA
CAAATCTTGA CTTCGGGACA AAAATCGAAA AGAACCAAGT CCCCGCTCAC GCTCGAGCGC
GCGGAGGCGC TGTGGGAGCT CGCGAAAACG AACCGCGAAC CGAATGTCGC GGCGACGCTG
ACGGCGATCG CGAACGGTGG CGGAAGGGGC GCTGGAGGCG TGGTGAAGGT GGAGGACGAC
GCCGCGCGGT GGTTGACGGC GCGAATTTCG AGCGCGTTCG GCGAAGGCGA GGATGGTGGA
GAACGAAGTA GCGCGAGAAA GAGCGGCACG AAACGGAATT TGTCGACGCG ATTTGAGGAT
GGGAGCGATG TGGAGGACGG GGAAGAAGCG GCGTCGGAGG AGAGAGAGAG ACGGGGCGGG
ACGTCGCCGC GCGGCGAACG CGCGGCGAAG CGAAGACGAT TGACGGTGAC GGAGCGTCTG
ATGCGGCGCG TGACCTTGGC GACCACGCGG AGCACGTCGA AGGAGTTGAG AGTAATCGAT
GGTGTGTCAT ATGATCGAGC GGCGTGCGCG ATCGTGGATG AGAGCTTGAA GAAGAAGGGC
AAGGTAGATT CAATGTGCGC GCGGGCTGTG GAGCTGAGTG TCAAGGCGAC GGGGTCGAGG
TATGGAATCA CACCTGCGGG ATTGCAGACG CTTCGAATGA TCGTAGAGGG TGGCGTGGAT
GGGCATTATA CGTTCCAAGT CGATGCAGCC GCGCGAAAGC ACCTGGGCGC GTTGATTGCG
GACGAAGAGA GCCTCGGGCC AGAGATACTC TTGCCGACGG TGAAGCGGCG AAAAGTCGCC
GCCGCACTGC AGAGGCTGAG TAATTTCGCG CGCCAAGCGC AGTATAGGAC GATTGACGGG
GTCAAGTACG ACGAGAACGC ATGTCACATT GCCGATGTCG CGACGCAAAC TCAAGGCAAG
ATCGGATTGG CGTGCGCGAA AGCGATCCAC GCCAGCATCG AAGATGGCGG CTCGCATTGG
GGCATCACTG ATATCGAGCT TCGCACGATT GAGATGATCA TCGAAGGCGG TCGCGCTGAC
TATTCATACG TCACAGACGC CGCGGCGATG AAGTATCTCA AAGATTTAGT CGTGAATGAG
CGGAGTAAAC GAGAAGACGC CGAAATCGAG CGGGTCGCGG CTGTAAGCGA AGCGCGCGGG
AACGCCGCCG GCGGAGTTTT GGCGCACGTT AGGAAACTCG TTCGACCCAA GTTTTATCGT
TACATCGACC GTCAGAGATA CGACGAAAAG GCGTGTGAGA TTGCAGATGA CTGCATGGAG
CGCATCGGTC GAATCAATTT GGAATGCGCG AAAGAGATTC ACGAGAGCAT CCTCGACGGT
GGCTCGCGGT GGGGCATCAC GTCGACGGAA TTCGCGACGA TTCAGCTCTT CCTCGACGGT
GGCAGAGACG ACGTGCTGTA CATTTGCGAC GACGATGCGC GGATGTATCT TTTGGATATT
TTAGGTACGA AGGCGATCGA AGAGACGGAG AATGTCACGC CGCCCACGAA AGTGCAGGTG
CGACGACAGT CGCCGGCAAG CGCGCTGCAA TCGGCGCTGA AAAATCCGCC GCCATCCGCA
CTGAGACACG CCGCGGCGAA ATCTGTGAAA AAGAAAGGGG TGAACTTTAC CCCCGCCAAG
CGTTTCGAAC TTCGCACGTA TATTCCGAGT CCGACGAGTG GGAACACGTC GGAGGAGGAA
ATCGACGACG ACGATGACGA CGACGAAGTG GAAGTTTTGA CGGAGCAAGA CGAAGAAATG
GTCGAAGAAG AAGCAGACGA AGTCGACGAA GACGAACAGA TCGTCGAAGT CGCGACGACG
AAACGGTCGA GCGTGCGAAT GATCACGTAT TCTAGACCGA GCATCATGAC GGCGTTTTAC
CGCAGCATGC AATTTTGGAT GACGCCCGTG AGCGTTCTCG ACACCGCCGC GGCGTGTTTA
GCGACAGCCG CCTCGTTTCT CGTCGCCCAT ATTTTCATCA TCGCCGCTCT GGAAGCGACG
ATCACGCCTT CGTCCATCAA GTCGTTGAAC ATGGAAGCCA AACTGTAA
 
Protein sequence
MPARAPTTRK TPRSPARKAS PARGRPSTPK YNDDVVQSLL QILTSGQKSK RTKSPLTLER 
AEALWELAKT NREPNVAATL TAIANGGGRG AGGVVKVEDD AARWLTARIS SAFGEGEDGG
ERSSARKSGT KRNLSTRFED GSDVEDGEEA ASEERERRGG TSPRGERAAK RRRLTVTERL
MRRVTLATTR STSKELRVID GVSYDRAACA IVDESLKKKG KVDSMCARAV ELSVKATGSR
YGITPAGLQT LRMIVEGGVD GHYTFQVDAA ARKHLGALIA DEESLGPEIL LPTVKRRKVA
AALQRLSNFA RQAQYRTIDG VKYDENACHI ADVATQTQGK IGLACAKAIH ASIEDGGSHW
GITDIELRTI EMIIEGGRAD YSYVTDAAAM KYLKDLVVNE RSKREDAEIE RVAAVSEARG
NAAGGVLAHV RKLVRPKFYR YIDRQRYDEK ACEIADDCME RIGRINLECA KEIHESILDG
GSRWGITSTE FATIQLFLDG GRDDVLYICD DDARMYLLDI LGTKAIEETE NVTPPTKVQV
RRQSPASALQ SALKNPPPSA LRHAAAKSVK KKGVNFTPAK RFELRTYIPS PTSGNTSEEE
IDDDDDDDEV EVLTEQDEEM VEEEADEVDE DEQIVEVATT KRSSVRMITY SRPSIMTAFY
RSMQFWMTPV SVLDTAAACL ATAASFLVAH IFIIAALEAT ITPSSIKSLN MEAKL