Gene OSTLU_43793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43793 
Symbol 
ID5006579 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp207042 
End bp209411 
Gene Length2370 bp 
Protein Length763 aa 
Translation table 
GC content58% 
IMG OID640422000 
Productpredicted protein 
Protein accessionXP_001422521 
Protein GI145356611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.848996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0246757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCA CAGACGCGTT CGAGCACCAA AACTTTAACG CCGTGGATTT CATCAATCGA 
GTGCTCCCGG ACGAACGCGC GCTCGCGGGC GTCGATAAGA TGATCGCGAA GCTCCGCGCG
CGCGTGAAAC TGGTGGACGC GGAAATATTG GGCGCGCTGC GAGCGCAACA CGGGAGCGAA
GCGCGAGCGA GGGACGATTT CGAGGTGATC GTGAGCGGGA TCGACGCGCT CGCGGAACGC
GCGACGGAGA CGGAGAGAAA GGCGGCGGCG ACGGAGGCGA ACGTGCGAGA GATATGCGCG
GATATAGTGC GATTAGATAG GGCGAAGAAT CACTTGACGA ATTCCATCAC GACGCTGCGA
CGGTTGTCGA TGTTCGTGAG CGGGATGGAA CAGTTGGAGT TGTTCGCGTT GCGGAGGCAG
TACGGGGACG CGGCGAATTT GTTGCAGGCG GCGTCGCAGT TGGCGACGCA CTTCGAGGGG
TACTCGCAGA TTCCGAAGAT TGCGGAGTTG CAGGAAAAGT ATCGCGGGGT GAAGAATCAG
TTGCGCGCGG CGGTGTTTGA TGATTTTCAC ACGACGTGGC TGCCGCACGT GATGGACGGC
GACGCCGCGG CGCAGAAGAA ATTACGCGAC GCGTGCCTCG TCGTGAACGC GCTCGAACCG
AGCGTGCGCG AAGAGTTAGT CGGCAACCTC ACAAACAGAG AGCTGACCAA CTACGCGTCA
GTTTTCAGCG CGCACGAGAG TGGAGATTTC CTCGGTCGCA TCGCGAGACG GTATGATTGG
ATCACGCGAC AGTTACAATC GAAGGAGTCC ATGTGGGCGG TATTTCCAGC GCATTGGCGC
GTGCCACAAC TTTTAAGCGT GTCTCTTTGC AAGCTCACGC GCGCACAACT CGCGGAGGCG
CTTGATGCCC GCGGGCCGCA CGACGTACAA AAGCTCTTGC ACGCGATGCA CGTCACCATA
GAATTTGAGA TGGAGTTAGA CGAGCGTTTC GGCACTGGCG CGGGTGTGGA AGACGACGAG
CTCGAAGGTG ACAGTGCGTC GGCTTCGATG CTCCGGCAGA AACTCGAGCG CGCTGAGCGC
GAGAAACAGA CAGAAAACTT GCGCGGAGGG CGCGTCCTGC CCATGGATTC GGCCGCGGAA
GCCGCGGCGA CGTTCATGTT TCGGGGGAGC GTTGGATCGT GCTTCGAAGA TCATCTCGCC
GATTACGTCG CACTCGAGCG ACGTCAATTA TTTGAGCAGA TCAACGAAAG TATTCGAAAC
GAAACTTGGC AGGGTGACGA AACAAACCCA CGAATTTTGG CGAGCGCGAC GAGCGTGTTT
TTGAACATAA AGAAAGTGTT CAAGCGATGC TCCAATTTGA CGCGCGGTAA GACGCTCTTC
GCCGTGCACC AGGTGTTCGT ACAAGTTCTC ATCGCGTACG CCAAGGCTTT GAACGAACGC
ATCGACGTCG CAGCGTTGAA CGCGACGGAC GCTCGCCGTC CCGAGGCGCA GCGAGCGGCG
GAAATCAAGT GCATATGTCT CATCGTCAAT ACGGCTGAGT ATTGCAACGA AACCGTCGGT
CCACTCGGTG ATTCTATGGT CAAATCGTTG GAAGACAATT TCAAGGAGAA AGTCGACATG
ATGGACGTCG AAGATGCGTT CAGCACGACA CTGTCTGAGG CACTGAACAA ACTCATCGGC
GTGGTGGAGG CGAAATCAAA CCTCGTCTCT GGGATGCTTC GCGTGAACTG GGGCGCGCTC
GACGTCGTCG GCGACCAGAG CGAGTACGTA GACACGTTCG AACGCGCTAT TGCGCACGCG
ATGCCTGTGC TTCGCGCTTC AGTGAGCGAC ATCCATCACA CATTCTTTTG CGAGAAACTG
GCGTCGTCAA TCGCGCCGAA ATTGTACATC GCAGTGTTCA AGTGCAAGCG CTTTTCGGAA
ATCGGCGGTC AGCAACTTTT GCTCGACATG CACGCGGTGA AAGCAATTTT ACTGTCCTTG
CCCGCCATAG CCGCTGCCGG TACGGACGTC ACCGCCGAAC CATCGGCGCC GCCGATGAGT
TACGCGAAGA TGATCGCTCG CGAGATGGGC AAAGTCGAAG CGCTCGTGAA AACTATCCTC
TCCCCGAACG ACGGATTGGC GGAAACGTTC AAAGCCCTCC TTCCCATGAC AGCCAACGCC
ACGGATTTCA AAGCTATTTG CCTGCTCAAG GGCATGAAAC CAAACGAAAT CTCCGAGCCC
CCGTTCGGGC TCTTCGCCTC GGTCGGCGCT CCCGCCAGCT CCAAGCCGCT CGAGGATTTA
CCCAACGTCC CGAACAGACC CAAGGCGCCG CGCATGGACA ACGTCACCGC AAAAATGTCT
GGCATGTTCA AGCAGGGCAC CAAACAATAG
 
Protein sequence
MSSTDAFEHQ NFNAVDFINR VLPDERALAG VDKMIAKLRA RVKLVDAEIL GALRAQHGSE 
ARARDDFEVI VSGIDALAER ATETERKAAA TEANVREICA DIVRLDRAKN HLTNSITTLR
RLSMFVSGME QLELFALRRQ YGDAANLLQA ASQLATHFEG YSQIPKIAEL QEKYRGVKNQ
LRAAVFDDFH TTWLPHVMDG DAAAQKKLRD ACLVVNALEP SVREELVGNL TNRELTNYAS
VFSAHESGDF LGRIARRYDW ITRQLQSKES MWAVFPAHWR VPQLLSVSLC KLTRAQLAEA
LDARGPHDVQ KLLHAMHVTI EFEMELDERF GTGAGVEDDE LEGDSASASM LRQKLERAER
EKQTENLRGG RVLPMDSAAE AAATFMFRGS VGSCFEDHLA DYVALERRQL FEQINESIRN
ETWQGDETNP RILASATSVF VQVLIAYAKA LNERIDVAAL NATDARRPEA QRAAEIKCIC
LIVNTAEYCN ETVGPLGDSM VKSLEDNFKE KVDMMDVEDA FSTTLSEALN KLIGVVEAKS
NLVSGMLRVN WGALDVVGDQ SEYVDTFERA IAHAMPVLRA SVSDIHHTFF CEKLASSIAP
KLYIAVFKCK RFSEIGGQQL LLDMHAVKAI LLSLPAIAAA GTDVTAEPSA PPMSYAKMIA
REMGKVEALV KTILSPNDGL AETFKALLPM TANATDFKAI CLLKGMKPNE ISEPPFGLFA
SVGAPASSKP LEDLPNVPNR PKAPRMDNVT AKMSGMFKQG TKQ