Gene OSTLU_30877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30877 
Symbol 
ID5001011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp907823 
End bp909193 
Gene Length1371 bp 
Protein Length456 aa 
Translation table 
GC content58% 
IMG OID640416432 
Productpredicted protein 
Protein accessionXP_001417091 
Protein GI145345164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.242987 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG CCAAGGTGCA CGCGATGATG GACGACATCG CGGGATGGAA GCGATCGACG 
GGGAAGATGC GACGCTTGCT GCGACAGATG CGCGCGCTTC GAGCGGCGAT CGCGGGAGAG
GTGGAGACGA CGTGGGGGAA GCTGACGCAC AGGACGCGCG GGACGTTTCT GAGGGAGAGC
CTGCCGCGGC TGTGCGAGTT TAAGGAGCGA GAGGCGCAGA TCGCGGCGTT CGGGCTGGTG
CAGACGCTGG CGCTGCACGA TGATTGCGCG ACGGCGCTGG CGACGAAGGA GATGTTTAAA
CTGTGCGTGG CGACGATCAA GGGGAAAGAT AAAGAGCGCG CGGTGGCGGC GAGCGGGGCG
CTGACGACGC TGGTGAATCA CGACGATACG CGGTTTTTAG CGAGCGACGA GGGGCTGGAT
AAGAGCATGA CGGCGCTGAT CACGGAAAAG GGGTTAGGGG TGAGGGTTAA AAGAAATTGC
GTGGTGACGT TCGCGAGAAT CGCGGATGAT CCCGAGGTGG CGTCGCTGAT GAGCGCGAAG
GCGCCGGAGC AATTGATTAA AAACTTTCTC GACTTCGTCG ATAAGACGGA CGACACGGAC
ACGGAGAAGT GGGCGCTCAT CGCCATCGCG CGTTTGGCGA TGAACGACGA ATTTAGTAAT
TTGATGGAGA AAAAAGGTTA CGTGCCTTTT TTGTTCGAAC TCTCGAGAGA TAAGATTCCG
GCTCGGAAGC TGGCGGCGGC GCTCGTCATC GCGCACATGG CGCGCAACAA GGATTTGCGC
GAGACGCTCG TCAAGTATCG CGCCATTCAG TTGTTTTGCA CGATTGCGAT GAACACGTCG
GAACGAATCG ATATGGCGGA GATGCAATTA GTCGCCGCGC TCGGGTTGAA AAATTTGGCG
TCGAATTTCG ATTTGCGCGC GCTCGCCGGA AAAACGGGCG CCATTCAGGC GTGCATCTTC
ATGTTGCGCA GTCCGCAGCA GGAAGTAAAG CGGTTCGCCG CGCTCGCGAT CGCAGAATTA
GCGCTGTACG AGCCAAACGG TGAGCGCTTT TGCAAGCAAG GGGCGTTAAA ATGGATCATT
CAGCTCGCTC GGACCGGGGA CGTGCGCTCG GAAACCGCCG CCATCACCGC GTTATCCAAC
TTGATGTTAT CGCCCGGAAA TCAGTCCATC ATGATTGTCG AGGACGGCAC TAAGGTGGTT
GATTATTTGC AAAACTCGCG CAACCCTCGC GTGGCGCACC TCGCCAAGCA GCTTTTGAAG
CGTTTGCGCA TGGCAAAGCT CCGCGCGGCG TGCAAGTTCG CCGCGCGAAT GAAAGCGACT
GGGAACGCAC TCATCGACGC AGGCATCGAA ATCGGCGAAG GTTACGAGTA G
 
Protein sequence
MTDAKVHAMM DDIAGWKRST GKMRRLLRQM RALRAAIAGE VETTWGKLTH RTRGTFLRES 
LPRLCEFKER EAQIAAFGLV QTLALHDDCA TALATKEMFK LCVATIKGKD KERAVAASGA
LTTLVNHDDT RFLASDEGLD KSMTALITEK GLGVRVKRNC VVTFARIADD PEVASLMSAK
APEQLIKNFL DFVDKTDDTD TEKWALIAIA RLAMNDEFSN LMEKKGYVPF LFELSRDKIP
ARKLAAALVI AHMARNKDLR ETLVKYRAIQ LFCTIAMNTS ERIDMAEMQL VAALGLKNLA
SNFDLRALAG KTGAIQACIF MLRSPQQEVK RFAALAIAEL ALYEPNGERF CKQGALKWII
QLARTGDVRS ETAAITALSN LMLSPGNQSI MIVEDGTKVV DYLQNSRNPR VAHLAKQLLK
RLRMAKLRAA CKFAARMKAT GNALIDAGIE IGEGYE