Gene OSTLU_16028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16028 
Symbol 
ID5003052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp143456 
End bp144661 
Gene Length1206 bp 
Protein Length401 aa 
Translation table 
GC content59% 
IMG OID640418473 
Productpredicted protein 
Protein accessionXP_001418621 
Protein GI145348366 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0596791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0287728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCA TGGGCGACGA GGCGGAACGA GGCGCGAACG CGGCGACGAT CGAGCCGGCG 
CGCGAGGCGC GGGCGAAGAA ACGGATCTTG TTGCTGCTGA GCGGCGCCCA AATGCAAGAC
GTCAAGATGG TGGAATTCGC GCGAGCGTTC GCGCTTGAAC CGAACGATGA CGTGTGGTGC
GTGCATTACT CGAGAGGCGC GACGCGCTAC GCGAGCGCGA TGTACGGACC GGGACGCGTC
GCGGGGTCGT ACAAAGCGGA CGAGGATTCG GTGACGGAGC GGTTGGGGGT GCGCGTCGTG
CCGCTGCGAC GAGACGACGC CGAGGGCGCG GAATCGGAAC GCGAGTCTTC GTGGTTGCCA
GAGTACGTTC GAGCGGAACT GCTCGATCGG CGGTTGAGCG GGAAATCGTT CGTGCAGGCT
GTGGAGATTT CGGGATCGAG CGGCAAGTTT TCGGTCGAAG AGGCCGTGCA GGCCATCGTC
GACGGCGAGT TCGCGAGCGC CGAGAACGAT AATTGGGTGT GCATTCCGAA GCCCGATCTC
ATCGTCATGG GATGCCGCGG ACACGGTCTC GTCCGGCGTG CGCTGCTCGG AAGCGTGACA
CAAAATGTTT TGAACCGCAT CCCGGTGTCC ACGCTCTTTT TTCGATCGTC TCTACCGAAG
ATTGTCGGAG CGAGCGATTT AGTCAAGCAA AAACTCGGAG GCGTCGATCA ACGCGTGGTG
TGCATTTGCA TGAGCGGCTC CAATTCCAGT CGGCGACTGT GCGAATACTT CGTAAAGGAG
CACATACGGT CCACCGACGT CGTTCTCTTG TTACACTGTC TCGCAGAGTC GCAACGCAAG
CAAAAGGATC TGAGCGAGGC GGATGTCGAA GAGAATCTCT CTCGATCGTA CGACCTCGTC
CAAAATTTTC AAAAGGAGCA CCCTGGCCAC GGACGCGTCA TTCGCATGGT TTTACAGAAG
GAGACGGGAA GTGCGTCGGA CATACGCGAT CGCACGATCG ACTTTTTGAA CATGACTGAT
GTGAACCTTG CCGTCGTCGG TCGAGCGATT TCGTCCGGGC ACTTGCGATC GCGCTTTATG
TCGCCGTACC CTCAGTATTG CGTCACGCAC GCGCCGTGCC CAGTTTTGGT GTGGAATCCA
CCTCCCTCGT ACATTCGCTC GCAGTCGCAA ACATCCGCCG CGGAAAAATC CACCGTCGCG
GCGTGA
 
Protein sequence
MEGMGDEAER GANAATIEPA REARAKKRIL LLLSGAQMQD VKMVEFARAF ALEPNDDVWC 
VHYSRGATRY ASAMYGPGRV AGSYKADEDS VTERLGVRVV PLRRDDAEGA ESERESSWLP
EYVRAELLDR RLSGKSFVQA VEISGSSGKF SVEEAVQAIV DGEFASAEND NWVCIPKPDL
IVMGCRGHGL VRRALLGSVT QNVLNRIPVS TLFFRSSLPK IVGASDLVKQ KLGGVDQRVV
CICMSGSNSS RRLCEYFVKE HIRSTDVVLL LHCLAESQRK QKDLSEADVE ENLSRSYDLV
QNFQKEHPGH GRVIRMVLQK ETGSASDIRD RTIDFLNMTD VNLAVVGRAI SSGHLRSRFM
SPYPQYCVTH APCPVLVWNP PPSYIRSQSQ TSAAEKSTVA A