Gene OSTLU_16119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16119 
Symbol 
ID5002997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp351862 
End bp353262 
Gene Length1401 bp 
Protein Length466 aa 
Translation table 
GC content64% 
IMG OID640418418 
Productpredicted protein 
Protein accessionXP_001418910 
Protein GI145348962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.485455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC GCGCGCGTCG AGCGGATCGC GCGCCCACCG ACGCCGTCAA GCGGCGCATA 
TGGGACCCGC GCGTGCGCGC CTTGGACGTG GACGACCGGA GGAGCTCGGA CACCCCCACG
GGCGTGCGCG TGCGCGGCGT GCGCGAGCGC GCCGACGTCG GCGACGCGCG TCGGGCGGCT
GCGCGCGAGC TCGCGCTGTT TCGCGGCGTG AAGCGCGCGA GGAGCGCGCT GCAGCGCATC
GCGCGGGCGA GGACGGGGCG GACCGACGAG GGAGGGACGC CGCTGCTGCT GTTTGAGCGT
TGGTTGGCGA GGTGCGCCGT CTCGGGCGCG CTCGCGGCGC CGGTGTTGCC GGCGCAGGGG
CTCGGTTTGG CGAAAGATTT GATTAGACAC GGGGCGAGCG AGCGAGACGG TGCGGAAGGG
GCGGCGGAGG CGCTCGCGGT CGCCAAGGAG AGCGCGGCTC GATGGGCCGA GGCGCGCGAC
GACGGCGGAG AGGCGCGCGA CGCCGTCGTC GTTCGCGACA AAGGTGCTTT CTTGACGATG
CAATTGGGAA CAGAGAAACC GTACGTGAAG TGTGCGAAAG CACATCTCGG TAAACTGCGC
GCTTTATATT GCAGAACAGT GCGAGGCTGT GAACCGTTGA CGGAGGACGT CGATTCAGAC
GAGTACCAAA AGTTCGCGTG CGCCGTGTTC GCATTGTTGA TGCGATACGA ATCGCTGGGC
GGGGCTGGAT ACCAAAGCGC GCTCGCCGAG GATGCGTTCG ATGTTTTGAA CGAAAAGTTG
GGCGTGTCGT GCGAGTGTTT CGCGTCGCCG CTCAACGCTC GGTACGGGCA ATTTTGTTCG
CAATTTGGTT TTGACGAGGA CCGCGCGCCA GACGTCGACG CGTTCTTCGG ATCGCTCGGA
AGCTTTTTTA GCGACGACTT TGCACCAAAA CGCGGATCGT TCGAGATGAA CCCGCCTTTC
GTCCCGGAAA CGATGTCGCG CGCGGTCGAA AAGGCGAACG ATTTGCTCGA TCGCGCCGCG
AACGCGAACG AGGCGCTCAG TTTCGTAGTC ATCGTACCGC TGTGGAAGGA ATGCCATTAC
TGGAGTGCGC TCTTGGAAAG TCGACATCTG CAGCACGGTC CAGACATCAT CGATGCGCAG
TCTCACGGCT TTTGCGACGG CGCCCAGCAC GCTCGTCCGA GTCACGAGCG CCATCGCGTG
TCGAGTTTCG ACACGGGCGT CTTCTACCTG CGAACGTCGC GCGCCGAGCG CGAGCGACCG
GTGGATGAGG AAATCAGAAA GCGCGTGTTG CGCGGCATGA AGACCGCGTT GGGGTCGTGC
AAAGACGTGC AAGAGTTGGA AGTGCGATAT CGCGGCGAGC GGGCGCGCGG CGGGCCCGCT
AAAATAGAAG ATAGAAAATA G
 
Protein sequence
MPKRARRADR APTDAVKRRI WDPRVRALDV DDRRSSDTPT GVRVRGVRER ADVGDARRAA 
ARELALFRGV KRARSALQRI ARARTGRTDE GGTPLLLFER WLARCAVSGA LAAPVLPAQG
LGLAKDLIRH GASERDGAEG AAEALAVAKE SAARWAEARD DGGEARDAVV VRDKGAFLTM
QLGTEKPYVK CAKAHLGKLR ALYCRTVRGC EPLTEDVDSD EYQKFACAVF ALLMRYESLG
GAGYQSALAE DAFDVLNEKL GVSCECFASP LNARYGQFCS QFGFDEDRAP DVDAFFGSLG
SFFSDDFAPK RGSFEMNPPF VPETMSRAVE KANDLLDRAA NANEALSFVV IVPLWKECHY
WSALLESRHL QHGPDIIDAQ SHGFCDGAQH ARPSHERHRV SSFDTGVFYL RTSRAERERP
VDEEIRKRVL RGMKTALGSC KDVQELEVRY RGERARGGPA KIEDRK