Gene OSTLU_14901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14901 
Symbol 
ID5001152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp119282 
End bp120421 
Gene Length1140 bp 
Protein Length379 aa 
Translation table 
GC content67% 
IMG OID640416573 
Productpredicted protein 
Protein accessionXP_001417150 
Protein GI145345294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.0674562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCCG GCGACGCGCG CGACGGCGCG TCCGCGCTCG ACGTCGCCGC GCGCGCGGCG 
TCGCGCGCGG AAGACGACGC GTGGGACGCG CACCACGACG CCCTGCTCGC GTGGTCCGTC
GCGGACGGCC GCGACGGCGC GTGCAACGCG CCCTGGCGCG CGACGCACGC GGGACTCAAC
ATCGGCGCGT GGCTGCAGAA CCAGCGCGCG AAGCTAAGGG CGAAAAAGAT GCCGAGGGAG
CGGGCGACGA GGCTGGACGC GCTGACGGCG GCGGGACGAT TGTGGATCGA CGCGCCGGGA
CGGAAGGGAT GGAACGAACA GCTCGAGAAG CTCGCGGCGT GGGCGGAAAA GACGAACGGA
GGGGTTGATT ATAACGCGCC GGTCGGGACG ACGCACGAGG GGGCGAAGAT CGGGGCGTGG
TTGGCGACGC AGCGAACGCG GCGAAGAGAC GGCGAGAACG CGCGGCGACC GCTCAAGCCG
GAGCAAGCGG CGGCGCTGGA CGCGCTCGTC CTGAGGGGGG TGCTGAGGTG CGAGAAAGCG
GATCCTTGGC CGAGGAAGTG GGCGCTGGTG CTGAAGTGGG GGGAAGAGCG CGCGAACGGG
GAGCACTTTA ACGTGCCGTA CGATTACAAG GATGGCGACG AACGCGTGGG GGTGTGGTTG
AACACGCAGC GACAGCGGTT CCGCGGAGGG ACGACGAAGA ATTTGCCGCT CACGCCTTGG
CAGACAGAGC AAATGCAAGC GATGATCGAC GCCGGTAAGC TCTGGGTGCA CGCCCCGGAC
GACGTGTGGG AGAAGAAGTT CGCGCTGCTC TTGCGATGGG GCAAAGAGAA GACGCGAGGC
GTCAACTACA ACGTGCCGCA GGGCGAGGAG TACGAGGGCG TCAACCTCGG CTCGTGGTTG
AGCACGCAGC GCGCTCGACT GTTGCACGAG ACTCTCGGGA AAAACAGACC GCTCAGCGAT
GACGAACGTC GAAAGTTGCA GAAACTCATC GACGACGGCA AGCTGCGACC GTCGACGCCG
CGAGGCAAGA ACGCGGCGAA GGGACAGGGC AAGCGCGCGC CGCGAGGCAA TCTCGACGAC
GTCGACGCCG CGCTCAACGT CGACCTGCCG CACACGTCGA AGAGAGGACG TAAAACGTAA
 
Protein sequence
MPPGDARDGA SALDVAARAA SRAEDDAWDA HHDALLAWSV ADGRDGACNA PWRATHAGLN 
IGAWLQNQRA KLRAKKMPRE RATRLDALTA AGRLWIDAPG RKGWNEQLEK LAAWAEKTNG
GVDYNAPVGT THEGAKIGAW LATQRTRRRD GENARRPLKP EQAAALDALV LRGVLRCEKA
DPWPRKWALV LKWGEERANG EHFNVPYDYK DGDERVGVWL NTQRQRFRGG TTKNLPLTPW
QTEQMQAMID AGKLWVHAPD DVWEKKFALL LRWGKEKTRG VNYNVPQGEE YEGVNLGSWL
STQRARLLHE TLGKNRPLSD DERRKLQKLI DDGKLRPSTP RGKNAAKGQG KRAPRGNLDD
VDAALNVDLP HTSKRGRKT