Gene OSTLU_27987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27987 
Symbol 
ID5006121 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp61485 
End bp64962 
Gene Length3478 bp 
Protein Length1125 aa 
Translation table 
GC content63% 
IMG OID640421542 
Productpredicted protein 
Protein accessionXP_001421956 
Protein GI145355413 
COG category[R] General function prediction only 
COG ID[COG5307] SEC7 domain proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0599057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.101897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG ACGACGAGGC GACGCGTCGC GCGGCGAAGA CGGCGCTGAC GCAGATAATC 
AACGCGGCGT TTAAGCGCGC GGAGCGAGGG TTCGACGCGA TCGCGCGGGG AGATGCGGTG
GAGGACGACG GCGCGGACGC GGACGCGACG CGAGACGTGT CGCTGTTGTT GACGACGCTG
TGTAAAATCG CGGCGCGAGA GGGCGCGGTG GACGTCGACG CCTATCTCGC GCACTCCAAG
GCGTTGGCGC TGGATATCTT GAGGCAGCTC ATGGACGGTC CGCGCGCGAC GGTGTGGCTC
GAGTGCTTTC ACGCCGAGCT TCGACAACCG CTCTCCATCG CGTTGATGCG AAACGCGTTG
CTTCAGGTAC CTCGAGGCTC GGAGGCGGAG CAGAGCGTGG GAATTCTGGT TTCGATCGCG
CGCATGGCGT ACGGCACGCT CGTCGTTCGC GCGCGAGCGA CGTGGAAGCA GCAAGTCGCG
GCGTTGTATC CAATCATGTC GCTGCATCCG CTCGAGAGCG GCGACGCGAG CGCGGCGATG
CGAGTCTCGG CGCTTCGTCT CGTGCGAAGA CTGGCGTCGG ATTCCCAAGT GTTGGTGGAT
ATGTTCGTCA ACTACGATTG CGATTTACAC GCGGCCAATC TGTACGAACG CACCGTCATG
GCGCTGGCGC AGTCGGCGCA AGTGGCGGAC GTTTTGGAAC GCGACGCCGT GTTGACGTGC
TTGTTTAGCA TTTTACGCTC GTTGCAGTCG TGGCACGCTC GCGGAGAGAA CGGCGTCGAC
GACGCGTCGG TGGACATCGA CGATAACGAT GCCGACGTCT CGATGGAGGA CGAAGACGGA
TTCGACGGCG AGCTTCGACC GGCGGTTTCG AGACGAGCCT TGCGCAAGCT CAAATCGGGA
GGCGGCGCGA CGACGCCCAT CGGGAGCATC GCGGCGGCGG CGGCGGCGGC GACCGTCGCC
GTCGTCGCCG GCGATGATGG ACCTTCCACG CCCACGTCGC CGACGCATCG CGAAGAAAAA
ATCGCGACAG CTCCGACGTC GCCGTCAGAC TCCCCGTCGT CGCCGTCGTG TGTAAATTCC
CCGTCCGCCG CGGTGGAATC CGAAGCCGAA CGATTTCAAA AGGCGAAAAA GACGAAGGCC
TCCATGGAAA AGGCTGTCGA AGCCTTCAAC GTCGATCCTT CGACGCAGAC GCTCAGAGTC
GCGGCGCGCT CGGAGGATCC AAATGTGTGT GCGGAGTTTT TGCGCAAGAC GAGCGCGCGC
GTCGCGCCCG CCGCCATCGG CGAGTTACTC GGCTCGCCCG ACGCCGACGC GCTCGTCGTC
ATGCGCGCGT ACGTCCACAG ATTCGACTTT GCGTCGATGA GCATCGATGA TGCCATGCGC
TTGTTTTTGG GTGGGTTCAA GCTCCCAGGG GAGGCACAAA AGATTGATCG ATTGGTCGAG
GCGTTCGCGG CGAGGTTTTG CGCGTGCAAT CCGGGGGCGT ACCCGTCCGC GGACGCCGCG
TACATCTTAG CCTTCGCGAT CGTCATGTTG AACACGGATG CGCACAACCC CCTCACAGAC
GCCGCGATGA AGATGAGCGA AGGTGATTTC GTGCTCATGG CCACCGCAGC CGAGGCGACA
AAAGATTTGG ATGTGGAAGC CGTCGCCGCG ATTTACGCGC GCGTCACTGC CGAGGAAATC
AAAATGCACG CCGCGGAGCC ATCGACGGCG ACCAAAGCGA ACGGTGGCGA TAACGCGCGG
GCGAAGAAGA CGATGGCGCA AGTACTAAAC TTTGCAGCTC CCTGGAAGAA CCGATCTACG
CTCAAAGAGG CGAGCGACGA AACGGTAGAG TTACTCAAGT CCACGAAGGC GATGTTCAAG
CACGCCGAGG AGAGCGACGA AGCCGCGAGC GCGCTCTTCG TCCGAGCCTC CGAACCAGGC
TTGGCGCGAC CGATGCTCGA GGCGGCGGGG AAATGCATGT TAATCGCGTT ATCCAGCGCG
TTTGACTCCG CCCCGGATGA AGCGCACGCA GCGATGCCGC TCGAAGGCGC GAGAGCGATG
CTGTCGCTCG CGGCGCGACT GCAACTCCCG ATGTTGCGCG ACGATATCTG CACCTTCTTA
GTCTCTGCTC CTGGGTTTGG ACGCCGCGAA GGCATCGCAA CTCAGAGTAA AGAAGCTTTG
AGCACGCTCC TCGAGCTCGC GGCGAGCGAA TCCAATCTAG GCGGCGTCCA GGCGTGGGCG
AGCGTGTTGG AAATGGTGAC GCGGTTGGAA AACTTGCGCG CCGTCGTCGG CGCCGGCGTA
TCCTTCGATA CCGCGCGCGC GAAAGATATT TTTTGTGCAC CGCTTCGCAT GCAAGAACTA
GTGGCGTCAT CGAAATCGGC GACGCAATCG GGCGGCGATG TTAGCCCCGA CGCGCTCACC
CCGGCGGAGC TTTCGGTGAC GCAGTGGTTG TCCACCGCGG GCGGCGAAGC CATCGAGCGC
GTATTCGCGT TGAGCACTCG ATTCGATTCC GATGAGATCA TCGCCTACGC CTCCGCCATC
GCCACAGTAT CTCGTCATGA ACTCTGGGAC GGTGCGGGCG GTAACGTGTC GGCGTTACTA
CGTCTCACCG AAGTCGCGGC GACGAATATG ACGCGCGTGC GCCTGGTGTG GTCCAAGCTT
TGGAACGTCG TCGCCGAACA CCTGGTGGAG AGCGTGAAGC ACCCCGACGA TAAAGTCGTG
TTACACGCCA CAGATTCGCT ACGACAAGTG GCGAATAGGT TGCTTTTACG CGCGCGCGCG
ACGCGTTCGG CCACGCAGGT GGACGCGATG AAGCCATTCG TCGCGGCGAT CGAAAACGCC
CCAAACGCGC ACGCGAGAGA TTTGATTTCG TCGTGCGTCG CTCAAGCGCT CCAGCGATTC
GGGGACTCGC TGGATTTGGG TTGGGATCCA GCGTTAGAAG TGCTCGAACA CGTGTACGGT
GACGGTTCGT CGAGCGATGT CGCTTTGGGA GATGCGGAAG CCGCGGCGTG CGAGGCGTTG
GAGAAGGCGC TGGCCGCGGC GTTGGAAAAA AATGGCGACA GCGCGTCGAA GGTGGCGACG
GAGGACGACG ACGACTACCT CGGCCTACCG CTTGCGTGCG TCCCACGAGC GATGTGCTTG
CTCGGGACGT TCGCGCGACG CCGACGACGC GCGAGCGCTG GCTCGGACGA CGAGGATTCG
CCGCCGCGAC GCGCCGTTGC GACGATCGCC GCGGCGTGCC GCCGAGGACT CGCCATGGGC
GACGAGGACG CGAACGCGTG GTTGAAGACG ACTTGGGCGG CAACGTGCGA AACTATCGGC
GCGCTCGCGC GCGAAGACGA TCGCGCTCTC GACGCGTTGT TCCGCGTTCT CGAAGACGAC
GACGTCGAGC GACTGAGCGC CGAGGCGTGG GGGGTGGCGC GCGCTGGAGC GGTCGAGGGC
TTATTGGAGA CCAAGCTCGA CGCCACGCGG GCGTTGGATC TCATTCTTCC ACGCCTGA
 
Protein sequence
MIADDEATRR AAKTALTQII NAAFKRAERG FDAIARGDAV EDDGADADAT RDVSLLLTTL 
CKIAAREGAV DVDAYLAHSK ALALDILRQL MDGPRATVWL ECFHAELRQP LSIALMRNAL
LQVPRGSEAE QSVGILVSIA RMAYGTLVVR ARATWKQQVA ALYPIMSLHP LESGDASAAM
RVSALRLVRR LASDSQVLVD MFVNYDCDLH AANLYERTVM ALAQSAQVAD VLERDAVLTC
LFSILRSLQS WHARGENGVD DASVDIDDND ADVSMEDEDG FDGELRPAVS RRALRKLKSG
GGATTPIGSI AAAAAAATVA VVAGDDGPST PTSPTHREEK IATAPTSPSD SPSSPSCVNS
PSAAVESEAE RFQKAKKTKA SMEKAVEAFN VDPSTQTLRV AARSEDPNVC AEFLRKTSAR
VAPAAIGELL GSPDADALVV MRAYVHRFDF ASMSIDDAMR LFLGGFKLPG EAQKIDRLVE
AFAARFCACN PGAYPSADAA YILAFAIVML NTDAHNPLTD AAMKMSEGDF VLMATAAEAT
KDLDVEAVAA IYARVTAEEI KMHAAEPSTA TKANGGDNAR AKKTMAQVLN FAAPWKNRST
LKEASDETVE LLKSTKAMFK HAEESDEAAS ALFVRASEPG LARPMLEAAG KCMLIALSSA
FDSAPDEAHA AMPLEGARAM LSLAARLQLP MLRDDICTFL VSAPGFGRRE GIATQSKEAL
STLLELAASE SNLGGVQAWA SVLEMVTRLE NLRAVVGAGV SFDTARAKDI FCAPLRMQEL
VASSKSATQS GGDVSPDALT PAELSVTQWL STAGGEAIER VFALSTRFDS DEIIAYASAI
ATVSRHELWD GAGGNVSALL RLTEVAATNM TRVRLVWSKL WNVVAEHLVE SVKHPDDKVV
LHATDSLRQV ANRLLLRARA TRSATQVDAM KPFVAAIENA PNAHARDLIS SCVAQALQRF
GDSLDLGWDP ALEVLEHVYG DGSSSDVALG DAEAAACEAL EKALAAALEK NGDSASKVAT
EDDDDYLGLP LACVPRAMCL LGTFARRRRR ASAGSDDEDS PPRRAVATIA AACRRGLAMG
DEDANAWLKT TWAATRGGWR ALERSRAYWR PSSTPRGRWI SFFHA